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ATTITUDE CONTENT AND AGREEMENT SET IN SCALES 
OF AUTHORITARIANISM, DOGMATISM, ANTI-SEMITISM, . 
AND ECONOMIC CONSERVATISM? 


DEAN PEABODY? 
Columbia University 


TTITUDES cannot be measured directly. 
Instead, an individuals attitude 
content must be inferred from his 

responses, In the Likert-type of attitude scale, 
the inference is from responses.of agreement or 
disagreement with the statements of the scale 
to two types of attitude content: attitudes 
favorable to the attitude object scored posi- 
tively by the scale (here called “procontent” 
attitudes); and attitudes opposed to the at- 
titude object and scored negatively (here called 
“anticontent” attitudes). If the. individual 
has no definite attitudes of either kind, then his 
response must be based on "response set" 
(Cronbach, 1946, 1950), and any inference to 
attitude content is invalid. Attitude scales 


derived from The Authoritarian Personality. 


(Adorno, Frenkel-Brunswik, Levinson| & 
Sanford, 1950) go against the recommenda- 
tions of Likert (1932) in two respects that make 


response sets particularly important: the items — 


are deliberately made ambiguous, the items 
are all in one direction—worded favorably to 
thé attitude object with agreement scored 
positively (here called “plus” statements)— 
instead of including statements worded 
unfavorably to the attitude object with agree- 
ment scored negatively (*minus" statements). 
As a result, agreement response set is sys- 
tematically confounded with procontent at- 
titudes. 7 

Considerable evidence now supports the 


! This report represents a revision of a doctoral dis- 
sertation at Harvard University. The collection of data 
in England was aided by a grant from the Social Science 
Research Council; the replication in the United States 
by a grant from the Laboratory of Social Relations at 
Harvard. Support for rewriting was given by the Coun- 
cil for Research in the Social Sciences at Columbia. 

J For help in obtaining subjects, I am indebted to N. 
Birnbaum and to R. A. F. Harcourt and the members 
of his department in England; and to H. Zamansky and 
A. J. Ferretti in the United States. I àm particularly 
grateful to G. W. Allport for his continuing advice and 
support, and to R. Christie for many helpful discus- 
sions. 

? Now at the University of Oregon. 


possibility that agreement set is an important 
factor in scores on such scales, particularly the 
F Scale of “authoritarianism.” Indirect evi- 
dence includes findings of positive relations 
between F Scale scores and responding “agree” 
or "true" to a wide variety of items (see review 
in Christie, Havel, & Seidenberg, 1958; also 
Couch & Keniston, 1960). Direct evidence 
comes from studies that attempt to reverse 
the wording of F Scale items to give minus 
versions. Scores on these reyersals are then 
correlated with scores on originals of other 
items. At present there are reports of such 
correlations for over 20 different samples. 
Many of these studies are summarized in 
Christie et al. (1958), and in Chapman and 
Bock (1958). The correlations are all sub- 
stantially smaller than would be expected if 
either attitude content or agreement set were 
operating alone. The correlations are generally 
slightly in the content direction. The apparent 
interpretation is that both attitude content 
and agreement set are important factors on 
the F Scale, content being somewhat more 
important. It will be shown below why this 
interpretation can be misleading. 

Why should another study be directed to 
this problem? The purpose of the present study 
is to put the problem in a different perspective 
in four major respects: À 

1. One purpose is to broaden the problem 
by considering the role of agreement set not 
only within the F Scale, but also in its relations 
with other scales. The relations considered 
are those that are theoretically important to 
the formulation of The Authoritarian Per- 
sonality and the reformulation by Rokeach 
(1956, 1960). The Authoritarian Personality 
argues for positive relations between authori- 
tarianism as measured by the F Scale, measures 
of prejudice (the present study uses items from 
the Anti-Semitism scale rather than the more 
heterogeneous Ethnocentrism scale), and 
conservative politico-economic attitudes as 
measured by the PEC scale. Rokeach argues 
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that while the F and prejudice scales correlate 
with a conservative or “right” politico-eco- 
nomic position, his own Dogmatism scale is 
independent of the right-left dimension and 
thus a more general measure than the F Scale, 
which measures “authoritarianism of the 
right." Accordingly, the present study 
examines the roles of response set and attitude 
content within and between four measures: 
items from the F, Dogmatism, and Anti- 
Semitism scales, and items referring to Con- 
servatism or right-left positions on economic 
and political issues. 

Hyman and Sheatsley (1954, pp. 73-74) 
point out that many items of the PEC scale 
do not refer directly to economic and political 

issues, and may be measuring something else. 
Similarly, Rokeach’s measure of the Right- 
Left dimension is complicated by an attempt 
to measure "'opinionation" simultaneously. 
Accordingly the present study required con- 
structing a set of Conservatism items referring 
directly and solely to economic and political 
issues. 

2. The use of several scales makes it possible 
to compare results for different scales that may 
represent different degrees of ambiguity. The 
Conservatism items—unlike the items from 
the F, Dogmatism, and Anti-Semitism scales— 
are intended to be like traditional Likert 
scales in avoiding ambiguity. They should 
provide a base line for indicating the effect 
of ambiguity, 

3. Another purpose is to gain a cross- 
national perspective by use of an English as 
well as an American sample. Cross-national 
comparison provides a check on the generality 
of the relations of the other scales to right-left 
attitudes in a culture where economic positions 
are generally to the left of those in the United 
States. Also provided is a cross-national check 
on the nature of agreement set itself. For 
example, Berg and Rapaport (1954) suggest 
that the prevalence they find for responses of 
“true,” “agree,” and “yes” over “false,” 
“disagree,” and “no” may be due to the 
prevalence in the American culture of the 
“Yes man.” 

4. Perhaps the most important objective of 
the present study is to redefine the problem 
of agreement set and use a design that permits 
separating (a) between agreements and dis- 


agreements with the original items, (b) be- 
tween the two types of attitudes usually in- 
ferred from these responses—procontent and 
anticontent attitudes. 

The reason for separating the two kinds of 
attitudes that are usually combined comes 
from the general evidence regarding response 
sets. While individuals show consistent differ- 
ences in response sets such as responding 
“agree” or “disagree” to ambiguous state- 
ments, most people tend toward “acquies- 
cence” (responding “agree” more often than 
“disagree,” etc.), Evidence using impossibly 
difficult information items (Fritz, 1927); or 
imaginary items (Bass, 1957; Berg & Rapaport, 
1954) shows average percentages of acquies- 
cence of 62%-69%—roughly twice as many 
Tesponses of “true” as “false,” “agree” as 
“disagree,” etc. 

From this independent evidence, it follows 
that on attitude scales where agreement is 
always scored positively, response sets are 
more likely to affect the high scores usually 
attributed to procontent attitudes, rather than 
the low scores attributed to anticontent at- 
titudes. This asymmetry in the effects of re- 
sponse set is of theoretical as well as methodo- 
logical importance, since interpretation usually 
centers on the high scorers as “authoritarians,” 
“dogmatists,” etc., with only indirect reference 
to the attitudes of low scorers. In fact, high 
scores are more likely to be due to agreement 
set instead of procontent attitudes; and low 
Scores to anticontent attitudes opposed to 
authoritarianism, dogmatism, and anti-Semi- 
tism. The situation is analogous to true-false 
tests where acquiescence makes true items 
less valid and false items more valid (Cronbach, 
1941, 1942). 

The usual design in previous studies has 
been to construct reversals or minus versions 
of some of the items and correlate these re- 
versals with originals of other items. Such 
correlations combine both types of attitude 
content—whatever overall attitudinal con- 
sistency the correlations indicate is likely to 
be primarily due to the consistency of dis- 
agreements on originals (i.e., to anticontent 
attitudes). The correlations disguise the 
inconsistency of agreements with original 
items, which are likely to be due to agreement 
set rather than procontent attitudes, 
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Accordingly, a preferable design should 
consider separately agreements and disagree- 
ments with the original versions, for example, 
by comparing responses to original and re- 
versed versions of the same items. In contrast 
to the single session needed for the usual de- 
sign, this design requires two sessions with an 
interval between presentations of the two 
versions of each item—to prevent direct com- 
parison by the subjects. The basic evidence 
for either type of attitude content is agreement 
with one version and disagreement with 
the other version: agreement with the original 
and disagreement with the reversal for a pro- 
content attitude; disagreement with the origi- 
nal and agreement with the reversal for an 
anticontent attitude. 

The last statement provides the rationale 
used here for selecting reversals. The reversal 
should be such that (a) a person with a pro- 
content attitude should disagree; (b) a person 
with an anticontent attitude should agree. 
Thus there are two requirements for reversals: 

The first requirement of a reversal is that 
it reverse. It is clear that logical reversal by 
itself is not enough. Many of the comments of 
Christie et al. (1958, pp. 146-147) are relevant 
here. Specifically, for the present study 
adequate reversal was intended as reversal 
of the theoretical rubrics the original items 
are supposed to represent (e.g., the theoretical 
rubrics for authoritarianism are the charac- 
teristics Conventionalism, Authoritarian sub- 
mission, etc.). In whatever sense agreement 
with the original implied the characteristics 
described by the rubrics, agreement with the 
reversal should imply their absence. 

The second principle used here is that the 
reversal should represent the range of possible 
anticontent positions. (This principle would 
include the recommendations of Christie 
et al., 1958, against extremity of wording for 
the reversal) To differentiate anticontent 
attitudes from any tendency to respond 
“disagree,” a reversal should not restrict itself 
to an extreme counterposition, but should 
allow those with moderate anticontent at- 
titudes to agree. 


METHOD 
Subjecis 
The subjects were American and English engineering 
college students, An effort was made in selecting the 


two schools so that the subjects would be as comparable 
as possible except for their difference in nationality. 
Eliminating those subjects who omitted a large number 
of items, and those who were foreign students and mem- 
bers of minority groups, gave final samples of 88 Ameri- 
can and 75 English subjects. 


Scales and Items 


Despite the difficulties in adequately reversing items, 
it was desirable to have a considerable number of items 
from each scale used to permit reliable results. Four sets 
of items were used—each item being worded in a “plus” 
and a “minus” version: 

1. 28 of the 30 items from the final form of the F 
Scale (Adorno et al., 1950, pp. 255-257, 260) omitting 
Items 22 and 30/28. 

2. The 40 items of the final version of the Dog- 
matism scale (Rokeach, 1956, Form E). 

3. 20 items selected from the Anti-Semitism scale. 
Preference was given to those items retained in later 
forms of the Anti-Semitism and Ethnocentrism 
scales, and to keeping some balance between items 
supposed to represent the different theoretical rubrics 
(subscales) of Anti-Semitism. 

Reversed or minus versions for these three sets of 
items were constructed, following the two principles 
of reversal presented above. (F Scale reversals from 
previous studies were used where possible.) 

4, 16 “Conservatism” items relevant to consèrva- 
tism or right-left issues. In developing these items, the 
objectives were to refer directly to economic and 
political issues (and so meet the objections of Hyman 
& Sheatsley, 1954, to the PEC scale), and to give 
comparability for England and the United States by 
a series of issues that would tend to split opinion both 
within and between the two countries. Where pos- 
sible items were adapted from previous sources (e.g., 
Centers, 1949; Eysenck, 1954). For comparability 
with the other three scales, each issue was stated in 
both plus (Conservative) and minus (non-Conserva- 
tive) versions. 

Half of the Conservatism items referred to the role 
of the government in the economy: issues concerning 
the proper extent of control or ownership (e.g., Item 2+ 
“The government should not enter into the production 
of power and energy such as electricity wherever private 
industry can profitably develop it”); or government re- 
sponsibility for recession and unemployment (e.g., Item 
7— “If signs of an economic recession appear, the gov- 
ernment should take action at once, such as a program 
of construction of highways, schools, and hospitals"). 
The remaining items referred to government responsi- 
bility for social welfare (e.g., Item 10+ ‘Provision for 
medical care should be the responsibility of the individ- 
ual and not of the government”); and to issues of eco- 
nomic and social equality (e.g., Item 13— “Large in- 
equalities in wealth and income should be prevented by 
taxation").* 

? Tables giving plus and minus versions of the items 
used from the F, Dogmatism, Anti-Semitism, and Con- 
servatism scales have been deposited with the American 
Documentation Institute. Order Document No. 6826 
from ADI Auxiliary Publications Project, Photodupli- 
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Procedure 


Each of the four sets of items was divided into 
halves. The original or plus versions of one half and the 
reversed or minus versions of the other half were 
assigned to one of the two questionnaires. Thus each 
questionnaire included 104 statements: originals or plus 
versions for 14 F, 20 Dogmatism, 10 Anti-Semitism, 8 
Conservatism items; and the same number of reversals 
or minus versions of the remaining items. The order of 
the statements within each questionnaire was arranged 
to separate those of similar content. 

"The subjects took the questionnaires in groups. The 
interval between questionnaires was 2 weeks for the 
American subjects and varied from one week to a month 
for different groups of English subjects. Each question- 
naire was taken first by about half the subjects in each 
sample. 


RESULTS 


The subjects responded to the statements 
in terms of six response categories, indicating 
three degrees of agreement or three degrees of 
disagreement. The resulting composite scores 
on each set of items reflect not only the basic 
dichotomy of agreement vs. disagreement, but 
also the extremeness of agreement or dis- 
agreement, However the composite scores 
tend to correlate .90 or above with simpler 
dichotomized scores giving the proportion of 
items where the subject responded “agree” 
rather than “disagree,” regardless of ex- 
tremeness. 

Since nearly all previous studies report only 
composite scores, for comparative purposes 
the present results give composite scores at 
the beginning of Table 2 (Sections I and II). 
However, further analysis here is simplified 
by using the dichotomized agreement scores, 
Further details of the justification for using 
dichotomized scores are given elsewhere (Pea- 
body, in press). 

Considering agreement or disagreement with 
the two versions of each item gives the four 
response-combinations indicated in Table 1. 

Assuming that the minus versions generally 
succeeded in adequately reversing the plus 
versions, then the response-combinations can 
be interpreted as follows: 

Double agreement or double disagreement 
with both versions of an item are evidence for. 


cation Service, Library of Congress; Washington 25, 
D. C., remitting in advance $1.75 for microfilm or 
$2.50 for photocopies. Make checks payable to: Chief, 
Photoduplication Service, Library of Congress. 


TABLE 1 
POSSIBLE RESPONSE COMBINATIONS 
‘Resnonse! Response to Plus Version 
to Minus 
Version!) Disagree Agree 
Agree “anticontent | double agree- (total) agree- 
response” ment ment with 
minus versions 
Disagree | double disa- | "procontent re- 
greement sponse” 
(total) agree- 
ment with 
plus versions 


response tendencies to “agree” or “disagree” 
in the absence of any definite attitude. 

The basic evidence for attitude content is 
agreement with one version and disagreement 
with the other. A response-combination in- 
volving agreement with the plus version and) 
disagreement with the minus version is con- 
sistent with a procontent attitude and is 
called a “procontent response." A response- 
combination involving disagreement with the 
plus version and agreement with the minus 
version is consistent with an anticontent 
attitude and is called an “anticontent re- 
sponse.’’ 

Considering the proportion of items in each 
scale answered with each response-combina- 
tion, then the marginal totals of Table 1 give 
the overall proportion of agreement with the 
plus or minus versions. Results in terms of 
these agreement proportions are given in 
Sections III and IV of Table 2. 

Table 2 gives results for the four scales for 
the American sample (abbreviated US) and 
for the English sample (abbreviated UK). 


Comparison with Other Samples 


If expressed in the same terms used in 
previous studies, the results for the present 
samples are very similar. The means of 
the composite scores on the original (plus) 


* Although these response-combinations are the basic 
evidence for attitude content, they could evidently re- 
sult from response set as well. Hence the absolute fre- 
quencies of these response-combinations tend to be too 
large an estimate of attitude content, but are used in 
the present report as a first approximation. A mathe- 
matical model giving a closer approximation is devel- 
oped elsewhere (Peabody, 1959). 
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TABLE 2 
RESULTS WITHIN SCALES 
Scale 
F Dogmatism Anti-Semitism Conservatism 
US UK US UK Us UK Us UK 
I. Composite scores on plus 
versions 
3.73 4.10 | 3.71 3.98 | 2.79 3.31 | 4.72 4.13 
SD .53 .80 E yer .95 1.01 .62 .87 
Reliability* .69 .82 .62 .68 .87 .78 .51 .T9 
II. Composite scores on minus 
versions 
M* 5.11 9:254 5 3:10 5.27 | 5.50 5.34 | 3.65 4.54 
SD Al 39 .33 .40 .69 .85 .60 78 
Reliability .42 E iSt -64 74 .86 .58 .78 
III. Agreement scores on plus 
versions 
.45* .50 .45** .50 .24** .35 .64** 252 
SD .12 .18 .10 +13 119 .23 43 .18 
Reliability .56 .78 E! -67 78 .81 .28 12 
IV. Agreement scores on minus 
versions 
k Ss bg .78 .76* -79 .81 .81 .46** .63 
SD .09 .08 .07 .09 +13 .16 13 AT. 
Reliability .48 si .33 .63 54 .81 AT Nr 
V. Correlation between plus 
and minus? Y 
Composite scores —.18 —.36 | —.21 —.20 | —.65 —.19 | —.61 —.74 
Agreement scores —.07 —2|-—. —.17 | —.57 —.68 | —.43 —.65 
VI. Response-combinations: 
Mean proportions 
double agreement .31 .36 .29 .35 215 .23 .20 124 
“procontent response" 14 .14 .16 15 09 12 44 .30 
“anticontent response" 44 .42 AT 44 .66 .58 26 AL 
double disagreement ll .08 .08 .06 .10 .07 .10 .07 
VII. % double agreement in 
agreement with plus 68 72 65 70 62 65 32 42 
VIII. % "anticontent response" 
in disagreement with plus | 81 85 85 88 87 89 72 85 


* All reliabilities are for split-halves separated by an interval, usually 2 weeks. 
b For this table, agreement with minus versions was scored positively, rather than negatively as in many reports. The results presented 
can be conyerted by subtracting the means in Section II from 8.00, or those in Section IV from 1.00; and by reversing the signs of the 


correlations in Section V, making them positive. 
* Nationality difference significant at .05 level. 
** Nationality difference significant at .01 level. 


versions of the F, Dogmatism, and Anti- 
Semitism scales are within the usual range for 
college students; so the present samples have 
as many high scorers as other studies. There 
is somewhat higher agreement with the're- 
versed (minus) versions than in other studies 
using F Scale reversals—presumably due to 
the deliberate effort here to use reversals ac- 
ceptable to those opposed to the originals. As 
in previous studies, the standard deviations 
and reliabilities are larger for the plus versions. 

For the F Scale—as in previous studies—the 
correlations between plus and minus versions 
are small but in the direction of some overall 


consistency for both types of attitude content 
combined. 

Thus where comparison with previous sam- 
ples is possible, the present ones appear quite 
comparable. 

Analysis by Response-Combinalions 

The method of analysis advocated here, 
however, requires considering separately evi- 
dence for procontent and anticontent at- 
titudes. Proportions for the four response- 
combinations are given in Section VI of Table 
2. As expected, on the F, Dogmatism, and 
Anti-Semitism items these results show that 
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anticontent attitudes are far more significant 
than procontent ones: For both samples, anti- 
content responses are the most frequent re- 
sponse-combination; procontent responses on 
the other hand are considerably less frequent 
than double agreement. 

On the Conservatism items, in contrast, both 
anticontent responses and procontent re- 
sponses are more common than double agree- 
ment. 

Besides the absolute frequencies of the 
response-combinations just discussed, it is 
useful to consider their relative frequencies 
within agreement and disagreement with 
original or plus versions. Sections VII and 
VIII of Table 2 give the percentages of double 
agreement out of agreement with plus versions, 
and of anticontent responses out of disagree- 
ment with plus versions. For both samples, 
the first three scales give closely similar results: 
(a) Agreement with original F, Dogmatism, 
and Anti-Semitism items tends to be incon- 
sistent (i.e., due to agreement set rather than 
procontent attitudes) since the same person 
agrees with a reversal of the same item more 
often than not (6295-7295 of the time); (b) 
Disagreement with original items is usually 
consistent with anticontent attitudes rather 
than indicating a tendency to disagree. An 
alternative (the reversal) will usually be 
accepted (8176-8976 of the time); (c) If com- 
bined, these two effects would show some over- 
all consistency—as in previous studies report- 
ing overall correlations. Such correlations 
disguise the inconsistency of agreement with 
original items, which usually represents double 
agreement rather than procontent attitudes. 

On the Conservatism items, both agreement 
and disagreement with the plus versions 
usually represent consistent “content re- 
sponses.” 


Types of Subjects 

The results presented pooled all items in 
each scale, and all subjects in each sample. 
Further analyses not presented here show that 
the main results hold for the great majority 
of individual items and individual subjects, 
e.g., double agreement is more common than 
procontent responses for most items of the F, 
Dogmatism, and Anti-Semitism scales, but 
not for the Conservatism items. Similarly, 


TABLE 3 
NuumER or Supyects or Eacu Tvpr* 


Response-Combination F 
Rana Types) 


“anticontent responses" | 66 | 42 | 76 | 45 | 81 | 59 | 19 |42 
(anti-authoritarians, 

antidogmatists, etc.) 
double agreement (double | 22 | 32 | 12 | 28 | 5| 11| 7| 6 
agreers) 
“procontent responses” | — | 1| —| 2| 2| 5| 62] 27 
(authoritarians, dog- 
matists, anti-Semites, 
conservatives) 

double disagreement —|—-|-—-|-—-i|-|-|-—|- 
(double disagreers) | 


* A subject's “type” on each scale is defined by the response- 
combination he makes most frequently, 


| 

| 
whether subjects agree with few or many of | 
the original versions of the first three scales, l 
they tend to agree with reversals of the same | 
items. This suggests that the primary factor 
is the ambiguity of the original items, and not | 
a peculiar susceptibility to double agreement | 
by any particular group of subjects. 

The main trends are highlighted if each 
subject is characterized by the response- | 
combination he makes most frequently. The 
resulting “types” are given in Table 3. | 

On the first three scales, anticontent re- | 
sponses are the most common response- 
combination for the great majority of subjects | 
in both samples. Nearly all of the remaining 
subjects are most commonly “double agreers.” | 
In contrast, of the 163 subjects in both samples, | 
only 1 is characterized as Authoritarian, 2 as 
Dogmatists, and 7 as anti-Semites. 

On the Conservatism scale, however, sub- 
jects characterized by either kind of content 
response are more common than double 
agreers. There is a considerable difference 
between samples, more of the Americans being | 
characterized as Conservatives and more of 
the English as anti-Conservatives on these 
issues. 


Comparison between Scales 

The consistently different results for the 
Conservatism scale compared with the F, 
Dogmatism, and (to a somewhat smaller | 


r 
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extent) the Anti-Semitism scales, suggest 
strongly that agreement response set is not an 
automatic mechanical reaction, but depends 
on the ambiguity of the statements in the 
different scales. The even larger consistent 
differences between items within the scales 
would support the same conclusion. 

Results for the Conservatism scale are 
generally in the direction that should be 
expected of a measure of attitudes. Although 
agreement set is a prominent factor, both 
kinds of attitude content are more prominent. 
Agreement with the plus versions usually 
represents a procontent response, and dis- 
agreement an anticontent response. 

Results for the F and Dogmatism scales 
are similar to each other and contrast with the 
Conservatism scale: while disagreement with 
the plus versions usually represents an anti- 
content response, agreement usually repre- 
sents agreement response set. The close 
similarity between results for the F and Dog- 
matism scales is perhaps not surprising in 
view of the similar ambiguity of their items. 

Perhaps more surprising are the results 
indicating a comparable degree of agreement 
set for the Anti-Semitism items. Though it 
seemed reasonable to expect that prejudice 
items would prove less ambiguous than those 
of the F and Dogmatism scales (cf. Christie & 
Cook, 1958, p. 175), there is nearly as large 
a percentage of double agreement in agree- 
ments with originals for the Anti-Semitism 
items. The distribution of Anti-Semitism scores 
is considerably skewed in comparison to the 
other scales. The great majority of these 
college students agree with few original Anti- 
Semitism items, and tend to show double 
agreement when they do. However, at least 
Some of the few subjects who agree with an 
appreciable number of original items indicate 
a consistent anti-Semitic position by dis- 
agreeing with the reversals (cf. Table 3). Fhis 
small minority provides the correlational 
consistency for the entire samples on the Anti- 
Semitism items. 


Cross-National Comparison 


On the whole the correspondence between 
the results for the American and English 
samples is more striking than their differences. 


This cross-validation indicates the generality 
of the main results. 

Significant national differences do exist, 
however, for most sets of items, as indicated 
for the means of the agreement scores in 
Sections III and IV of Table 2. On both ver- 
sions of the Conservatism items, the English 
sample is significantly to the left of the Ameri- 
can sample—agreeing less with the plus ver- 
sions and agreeing more with the minus ver- 
sions. All the remaining significant differences 
show the English sample agreeing more often 
than the American sample; in particular the 
English subjects agree significantly more with 
both versions of the F and Dogmatism items. 
This result does not support the suggestion of 
Berg and Rapaport (1954) that acquiescence 
relates to the prevalence of the “Yes man" in 
the American culture. 


Relations between the Scales 


It is evident that insofar as agreement set is 
an important factor within the individual 
scales, it may also be an important determi- 
nant of the correlations usually found between 
the scales. The few available studies combine 
procontent and anticontent attitudes for each 
scale by the use of correlations: between re- 
versed F Scale and original E Scale items 
(Cohn, 1953; Jackson & Messick, 1957), or 
between originals and reversals of both F 
Scale and E Scale items (Chapman & Camp- 
bell, 1959). These studies indicate that both 


TABLE 4 


CORRELATIONS BETWEEN SCALES FOR 
Prus VERSIONS 


Scale 


Scale F 


US | UK | US | UK | US| UK 


Composite scores 


Dogmatism .61 | .63 
Anti-Semitism .46 | .54 | .34 | .44 
Conservatism .01 | .07 | .04 | .12 | .02)—.11 


Dogmatism .52 | .61 
Anti-Semitism «47 | .52 | .35 | .48 
Conservatism .03 | .04 | .10 | .16 | .07|—.08 
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agreement set and attitude content are factors 
in the correlations between the scales, but do 
not distinguish the two kinds of attitude con- 
tent. 

The present results begin with the correla- 
tions between plus versions of the different 
scales. For comparability with other studies, 
correlations are given in Table 4 for the usual 
composite scores as well as dichotomized agree- 
ment scores. As would be expected from the 
high relation between these two scores, they 
have highly similar correlations. 

The correlations for the Conservatism items 
are all negligibly different from zero—despite 
the possibility of common agreement sets with 
the other scales. This result suggests that there 
is no relation between conservative attitudes 
on economic and political issues and scores on 
the F, Dogmatism, or Anti-Semitism scales. 
The positive correlations usually found with 
the F and prejudice scales appear to be entirely 
due to the contamination of other measures 
of “politico-economic conservatism” with 
variables not directly related to economic and 
political issues, The results support Rokeach’s 
claim that the Dogmatism scale is unrelated 
to the right-left dimension, but not his claim 
that the F and prejudice scales in contrast 
reflect the political right. 

The remaining correlations between E, 
Dogmatism, and Anti-Semitism scores are 
closely comparable to the values reported for 
other samples. From the results presented 
above for the individual scales, a. reasonable 
expectation would be that these correlations 
are primarily due to two factors: 

1. Low scores on the original items due to 
anticontent attitudes. Those who reject 
original statements and accept reversals on the 
F Scale tend to do the same on the Dogmatism 
and Anti-Semitism scales. 

2. High scores due to agreement set. Those 
who have no definite attitudes on the F Scale 
and agree with both versions tend to do the 
same on the Dogmatism and Anti-Semitism 
scales. 

On the other hand, it would be expected 
that the correlations are not substantially 

due to covariation between procontent at- 
titudes of authoritarianism, dogmatism, and 
anti-Semitism. 

The correctness of these expectations could 


TABLE 5 
Number or Supjects or EAcH Types; 
RELATION BETWEEN THE F AND 
DocMaTISM SCALES 
(Both samples combined) 


F Scale 
Dogmatism scale | Anticontent | pou 
Responses. | Agreement 
itarians) | (agreers) 
anticontent re- 95 26 
sponses (anti- 
logmatists) 
double agreement 12 27 
(agreers) 
procontent re- il 1 
sponses (dog- 
matists) 


2 A subject’s “type” on each scale is defined by the respon e 
combination he makes most frequently. [ 


be shown in several different ways. One illus 
tration is to compare between scales the type 
of subjects used for the separate scales if 
Table 3. Such a comparison is given for the E 
and Dogmatism scales in Table 5, which 
combines the 163 subjects in both samples, 

Table 5 shows both the relative unimpore 
tance of procontent responses and the asso: 
ciation between the two scales for anticontent 
responses and for double agreement. Most 
anti-authoritarians tend to be antidogmatists, 
and double agreers on one scale tend to bé 
double agreers on the other. Similar associa: 
tions exist for the Anti-Semitism scale witl 
the F and Dogmatism scales, but since there 
are fewer double agreers on the Anti-Semitism 
items there are necessarily many persons 
characterized by double agreement on the F 
or Dogmatism scale and by  anticontent 
responses on the Anti-Semitism items. 

Thus there tend to be two general types of 
subjects on the three scales: Those getting 
higher scores due to agreement set, suggestin| 
an absence of definite attitudes; Those getting 
low scores due to a general opposition to 
authoritarianism, dogmatism, and anti-Semi- 
tism, suggesting a generally “liberal” ideology. 
But this ideology is unrelated to economic or 
political “liberalism” as measured by the 
Conservatism scale. There is no evidence for 
the contrasting ideology usually discussed’ 


h 


involving attitudes favorable to authori- 
tarianism, dogmatism, and anti-Semitism. 
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Discussion 


Previous studies using correlations between 
originals and reversals of the F Scale suggest 
that attitude content and agreement set are 
both important factors on the F Scale, attitude 
content being somewhat the more important 
factor. The present results support this con- 
clusion, but with a major qualification: the 
attitude content is primarily anticontent— 
attitudes opposed to authoritarianism, dog- 
matism, and anti-Semitism. The high scores 
that are usually attributed to the more dra- 
matic procontent attitudes of authoritarianism, 
dogmatism, and anti-Semitism are in fact 
usually due to agreement set. 

This result confirms the conclusion of the 
earlier analysis: on scales where items are 
ambiguous and all scored in one direction, the 
bias of response set toward "agree" rather 
than “disagree” responses has a differential 
effect on high and low scores and on inferences 
Írom the scores to the two kinds of attitudes. 
Inferences from low scores to anticontent 
attitudes are likely to be valid. Inferences from 
high scores to procontent attitudes are likely 
to be invalid, since these high scores are more 
often due to agreement set. 

The present study is relevant to several 
common interpretations: 

Assumption that agreement set is a mechanical 
process. A persistent assumption, with an 
early appearance in The Authoritarian Per- 
sonality itself (p. 59), is that agreement set 
ought to show up equally on every response. 
Then it is argued that agreement set cannot be 
important since subjects give different re- 
sponses on different items. 

As against this mechanical interpretation of 
agreement set, the present results show con- 
sistent differences for items on the different 
Scales, and support the position of Cronbach 
that response sets operate only insofar as the 
subject is uncertain. This uncertainty should 
be a joint function of the ambiguity of the 
Stimulus and the person's lack of definite 
attitudes. This interactive nature of response 
Set is brought out by the similar definitions 
given for “loaded question" by Payne (1951) 
and for "response set" by Cronbach (1946). 

Assumption that agreement sel is entirely 
general. One implicit form of this assumption 
is to treat correlations with a measure of 


general acquiescence as indicating the full 
operation of response set on a given scale. 
Although the present results give evidence for 
some generality of agreement set between 
scales, this does not fully represent the role 
of agreement set on the individual scales. 
Similarly, the evidence reviewed by Cronbach 
shows that the generality of response sets is 
low compared to their effect on given material. 

For the same reason it is doubtful whether 
response set on a given scale can be fully 
eliminated by a partial correlation technique 
using a measure of general response set (e.g., 
Webster, 1958). 

Assumption that high F scores represent 
authoritarians in any case. Several forms of 
this assumption appear to come full circle. 
Many of the very studies that have shown 
agreement set to be an important factor in the 
F Scale have then proceeded to explain these 
agreement tendencies as characteristics of 
authoritarians. 

In fact for the great bulk of studies of 
authoritarianism, agreement with original F 
Scale items is the only evidence for identifying 
authoritarian attitudes. Where results indicate 
that such agreement is usually due to response 
set rather than authoritarian attitudes, it 
becomes circular to interpret agreement set 
as a characteristic of “authoritarians.”” 

On the contrary, the present results suggest 
a reinterpretation of many of the correlates of 
high scores as actually reflecting an absence of 
any definite attitudes. An example would be 
the relation of high scores to lack of intellectual 
sophistication and to cognitive simplicity (cf. 
Jackson, Messick, & Solley, 1957). Such a 
reinterpretation also fits with the findings of 
Sanford (1950) and others (see Christie, 1954, 
p. 148), that those who get higher “authori- 
tarian” scores are actually apathetic and in- 
active politically. They would presumably be 
less resistant than those with consistent anti- 
authoritarian attitudes to a fascist movement, 
as they would to any movement. 


SUMMARY 


This study examined the roles of attitude 
content and agreement set in attitude items 
by using each item in two versions, stated in 
opposite directions. The items are relevant to 
the theoretical formulation of The Aulhori- 
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tarian Personality and Rokeach’s reformula- 
tion: items from the F, Dogmatism, and 
Anti-Semitism scales, where the original ver- 
sions are all worded in one direction and agree- 
ment always scored positively; and items 
constructed to refer directly to Conservatism 
on economic and political issues. 

From the known asymmetry of agreement 
response set (ie. most subjects tend to 
tespond “agree” rather than “disagree” when 
uncertain), it was argued that Tesponse set 
mainly affects high scores when agreement is 
always scored positively. Accordingly the 
design required separating agreements and 
disagreements with the original items, rather 
than using overall correlations that disguise 
the differential tendency for Tesponse set to be 
responsible for agreements. As expected, 
results indicated that for scales derived from 
The Authoritarian Personality, agreement with 
the originals could usually be attributed to 
agreement set rather than attitudes favorable 
to authoritarianism, dogmatism, or anti- 
Semitism. Disagreement could usually be 
attributed to attitudes opposing authori- 
tarianism, dogmatism, or anti-Semitism. In 
contrast, on the Conservatism items, responses 
consistent with both kinds of attitudes were 
more frequent than agreement set. The con- 
sistent differences of the items from the 
different scales suggests that a primary cause 
of agreement set is the ambiguity of the 
original items. 

Cross-national comparison and replication 
are provided by samples of 88 American and 75 
English engineering students. The two samples 
show close correspondence on the main results, 
As regards differences, except for the Con- 
Servatism items, where the English sample 
showed significantly less Conservatism on both 
versions, significant differences showed the 
English sample agreeing more with most sets 
of items, including both originals and reversals 
of the F and Dogmatism scales. 

Relations between scales were also exam- 
ined. The measure of economic Conservatism 
Showed no correlation with the other Scales; 
the correlations usually found for the F and 
prejudice scales appear due to the contamina- 
tion of other measures of "politico-economic 
conservatism" with other variables. The 
correlations between the F, Dogmatism, and 
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Anti-Semitism scales appear due to covaria ion 
of the same factors important within he 
separate scales: agreement set and attitude 
opposing authoritarianism, dogmatism, and 
anti-Semitism; but not to attitudes favorin 
authoritarianism, dogmatism, and anti-Semi: 
tism, 

The implications of the results were dis 
cussed, Revision is needed for those interpre 
tations treating agreement set as a mechanical 
Process, as entirely general, or as itself a 
authoritarian characteristic. Reinterpretation 
was suggested for many correlates of the F 
Scale measure of “authoritarianism” that 
may actually reflect a lack of any definite 
attitudes. 
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ACQUISITION OF AVOIDANCE DISPOSITIONS BY SOCIAL LEARNING: 


NORMAN MILLER? 
Northwestern University 


N their book on social learning, Miller and 
I Dollard (1941) effectively focused atten- 
tion upon acquisition of response tend- 
encies by one organism which unambiguously 
matched those of another. However, acquisi- 
tion of knowledge available for use independent 
of another animal's behavior was specifically 
“controlled out" of these studies. Only recently 
(Church, 1957) has attention again centered 
upon this other, and perhaps more important, 
aspect of social learning, namely, the acquisi- 
tion of response tendencies that can be 
exhibited in the absence of any model animal. 
While the “imitation studies” are necessarily 
concerned with the transmission of similar 
response tendencies from one organism to 
another, the present study is concerned with 
responses distincily different from those of the 
model animal. In addition, it employs aversive 
stimuli. Interestingly, negative results seem to 
characterize laboratory studies of animal 
social learning using aversive stimuli, both in 
instances where the correct response was one 
that matched a response of the model animal 
(Bayroff & Lard, 1944; Brogden, 1942; 
Solomon & Coles, 1954) and in a situation 
where the correct response did not bear close 
resemblance to a response of the model 
animal (Church, 1959). 

The present study also considers the role of 
overt responses in learning. Can learning occur 
simply by observing the stimulus response 
contingencies of another organism? Or is it 
necessary for some specific overt instrumental 
response to be elicited from the observer 


1 This paper is based on a dissertation submitted to 
the Department of Psychology, Northwestern Uni- 
versity, in partial fulfillment of the Tequirements for 
the degree of Doctor of Philosophy. 

Donald T. Campbell, Carl P. Duncan, Harold 
Guetzkow, Winfred F. Hill, and Janet A. Taylor made 
helpful contributions to the dissertation on which this 
paper is based. 

? Now at Yale University. 

*In a study in which cats observed models learn to 
solve puzzles for food reward (Herbert & Harsh, 1944), 
methodological errors leave the results largely uninter- 
pretable. 
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following observation? The implications of 
acquiring knowledge about the environment 
by simply observing the  reinforcements 
received by another organism in that environ- 
ment would seem to have striking importance 
for any comprehensive social psychological 
theory (cf. Campbell, in press). The individual 
organism is relieved of the burden of acquiring 
the necessary knowledge for each situation by. 
trial and error learning. Once having learned 
"the meaning" of pain cries, the observing 
organism can acquire response dispositions 
vicariously. An ecological map of “good and | 
bad" or "where the hunter places his traps" 
can be constructed from the misfortunes of 
others. 


METHOD 
Design 


Polar values of each of three variables, shock, 
confinement, and color, created the eight subgroups in 
the factorial design. For all subjects distinctive alley 
cues, white for half the subjects, black for the other 
half, were paired with a “stimulus animal’s” pain 
responses (Stage II). At no point during any part of the 
training procedure did a subject receive shock in the 
presence of these alley cues. While half of the subjects 
could make an escape response from the pain cries, 
the others were confined, preventing escape responses. 
Half the subjects in each of these four groups were 
given previous experience in a neutral environment in 
which a stimulus animal's pain response served as a CS 
for shock to the subject (Stage I). 

If all Confined subjects are confined with pain cries 
for a uniform length of time, subsequent performance 
differences between Escape and Confined subgroups 
may be attributable to differential exposure to squeal- 
ing. Therefore, each of the Confined animals in each 
subgroup was paired (yoked) with an Escape animal 
in the analogous subgroup in terms of classification on 
the volor and shock variables. Confinement lasted the 
same length of time on every trial that his yoked mate 
in the Escape group remained in the presence of squeal- 
ing stimuli on that trial before making his escape 
response. 

On termination of pretraining and pairing of squeals 
with color cues all subjects were given acquisition 
trials in a two-choice situation (Stage III). A black 
choice was punished for all subjects. Thus, dispositional 
strength toward alley cues previously paired with 
squealing was measured in terms of positive and nega- 
tive transfer to a new acquisition situation. 
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Subjects 


Albino rats 95-115 days old at the start of the ex- 
periment served as subjects. Of the total 64 subjects, 
56 were males and 8 females. Subjects were procured 
from any source in batches of 10, allowing 1 animal 
for each of the eight cells of the design with 2 spares in 
case of death or illness. Animals in each batch were 
randomly assigned to the eight conditions of the study. 
Thus, differences due to shipment or past experience 
were orthogonal to the design. Weights at the conclu- 
sion of the study ranged from 340 to 363 g. A three-way 
analysis of variance of weights for the eight subgroups 
yielded seven F ratios smaller than 1. 

Subjects were maintained in individual cages with 
a constant supply of Purina laboratory blox and water 
available 5 days before the onset of training and con- 
tinuing throughout the training period. Three days 
before the start of training, all subjects were given 10 
minutes of handling per day. 


Apparatus 


Two pieces of apparatus, a pretraining apparatus 
used throughout the entire experiment, and a modified 
type of straight alley used only in the latter half, were 
located in a small room illuminated by a 60-watt over- 
hanging light. A fan provided a masking noise. 

Pretraining box. In external dimensions, the pre- 
training box was 9" X 9" in length and width and 8" 
in height. All surfaces were patterned with .75" black 
and white horizontal stripes. The box was divided in 
half by Plexiglas extending vertically from the floor 
to within .5" of the ceiling. thereby permitting sounds 
in one half to be easily heard in the other, Nails, pro- 
truding vertically 2.5" from the floor and connected in 
pairs by copper wire, formed separate grids on each 
side of the partition. The two compartments were con- 
stantly illuminated by a 100-watt bulb in a shielded 
Socket mounted 3" above the lid of the box. Light 
entered through 150 34” holes in the lid. A screened 
opening on one wall, 5" wide and 8" high, allowed the 
experimenter to observe both compartments. 

Straight alley apparatus. A gridded straight alley 
was divided into five compartments with separate 
Plexiglas covers. The alley, 38" long, 7" wide, and 6.5" 
deep, was fitted with a grid floor of 6" steel bars set 
-5" apart. Two guillotine doors, created the 4.5" long 
center compartment when in their lowered position. 
On each side of the center compartment were two 
compartments, each 8.5" long, separated from one 
another by a Plexiglas partition. The entire surface 
area of the 2.5 compartments comprising one half of 
the alley, starting from the middle of the center com- 
partment and including both sides of one guillotine 
door, was flat white inside and extremely rough in 
texture. The other half was flat black and smooth. In 
addition, the two extreme black compartments each 
contained sets of five 34” holes which emitted flickering 
light from two screened 60-watt bulbs. The outside 
surfaces were grey. In the center of the box, at the 
junction of the black and white portions of the small 
center compartment, a grey door could be raised 
through the grid floor to divide the black and white 


sides of the apparatus. Two treadles covered the grid 
floor on each half of the center compartment formed by 
the lowered guillotine doors. The treadles mechanically 
operated electric timers. 

Stimulus rats. A large number of rats were main- 
tained for use as part of the apparatus. These “stimulus 
rats” were used in both the pretraining box and the 
straight alley to produce the squeals which served as 
crucial cues in parts of the experiment. During all 
trials in the pretraining apparatus, a stimulus rat was 
in one of the two compartments; during all trials in the 
straight alley, they were in the extreme black and white 
compartments, The single criterion of loud reliable 
squealing to the stimulus of shock was used in selecting 
stimulus rats. 

Shocking equipment. A 250 y. ac current was fed 
through a 1 meg. variable resistor into a shock scram- 
bler. The separate grids of both the pretraining ap- 
paratus and the straight alley could be independently 
charged. On all grids shock avoidance was impossible 
unless the rat stood on one leg. Shock intensity for any 
given rat was set by a criterion of “violent agitation 
without paralysis” as determined by pilot trials on rats 
of the same weight as the subject. The shock intensity 
in the more central black compartment, where a black 
choice was to be punished in the final acquisition stage 
of the experiment, was set at the lowest level available 
on the variable resistor. At this intensity, onset pro- 
duced slight but always perceptible agitation, and some- 
times squeals of low intensity. 


Procedure 


The order in which subjects were run was varied 
daily, within the limits imposed by the restriction that 
Escape animals be run before their yoked Confined 
animals, All subjects were given a 3-minute adaptation 
period in the experimental room before the daily session 
was begun. 

Stage I (Pretraining). Following taming, all subjects 
received 10 minutes of pretraining a day for 5 consecu- 
tive days. For the four Nonshock subgroups, the sub- 
ject was placed in one of the two compartments of the 
pretraining box for the scheduled period with a stimulus 
animal in the adjacent compartment. The four Shock 
subgroups, also placed in the pretraining box with a 
stimulus animal, received 20 pairings of the squealing 
from the stimulus animal with shock. Pairings were 
presented at a rate of approximately one every $5 
minute with a variable interval between pairings. A 
1-second interval separated the onsets of shock to the 
stimulus animal and the subject. Termination of shock 
to the stimulus animal was concurrent with onset of 2 
seconds shock to the subject. 

Stage II. Following the 5 days of 10-minute pre- 
training sessions, these sessions were reduced to 2.5- 
minute periods. For the Shock subgroups, pairings of 
squeal with shock were reduced from 20 to 10 per day, 
doubling the average number of pairings per unit of 
time. With this switch to shorter pretraining sessions, 
Stage II of the training was introduced. Training in the 
straight alley consisted of 10 1-minute trials per day 
with a 30-second intertrial interval. Upon termination 
of the daily session of pretraining, the subject was 
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placed in one of the two compartments adjacent to the 
center compartment, black or white depending on the 
subject’s color subgroup. Stimulus animals were always 
present in the two extreme compartments. The subject's 
home cage was 2’ behind the source of the squeals. If 
the subject was in an Escape subgroup, 3 seconds after 
placement in the apparatus, the grid under the stimulus 
animal was charged, eliciting loud squealing. Simul- 
taneously, the guillotine doors were raised, eliminating 
the small center compartment. If the subject made an 
escape response to the white (or black) compartment, 
defined as stopping the timer by pressure on the 
treadle in the oppositely colored center portion and 
full entry of subject (excluding the tail) into the 
oppositely colored section of the apparatus, the guillo- 
tine doors were closed behind him and he remained 
there for the remainder of the minute. Timing was 
automatically started with the onset of shock to the 
stimulus animal. If the subject did not make an escape 
response within 30 seconds after onset of shock to the 
stimulus animal, the subject was manually removed 
by the experimenter and placed in the oppositely 
colored compartment for 30 seconds. Shock to the 
stimulus animal was terminated by the subject’s escape 
response. For all trials, 1 minute after the subject was 
initially placed in the straight alley, he was returned to 
his home cage for the 30-second intertrial interval. 

As previously indicated, each subject in the Confined 
subgroups was given the identical treatment his mate 
in the Escape subgroup received with a single excep- 
tion: the guillotine doors were never raised to permit 
an escape response. Instead, after exposure to the 
stimulus animal’s squealing for the same amount of 
time that his mate in the Escape group exposed himself 
on that trial number, he was manually transferred by 
the experimenter to the oppositely cued compartment 
for the remainder of the 1-minute trial. Placement 
head or tail first was varied over trials and the subject’s 
feet were firmly on the grid before he was released. 

Stage III. On the sixth day after the start of Stage 
TI training (the eleventh day after the first day of 
Stage I), upon completion of 2.5 minutes of pretraining 
and three Stage IT training trials, Stage III was begun. 
All subjects were placed in the black-white center com- 
partment. Three seconds later, both guillotine doors 
were raised starting the timer. If the subject made no 
choice within 6 seconds, the center gray door was raised 
through the floor gently prodding the subject into 
responding without biasing choice. Thirty seconds 
after initial placement in the center compartment, the 
subject was returned to his home cage for the 30-second 
intertrial interval. Each subject was given, consecu- 
tively, 20 such two-choice trials. For all subjects, a 
black choice was always punished by 2 seconds of mild 
delayed shock delivered 2 seconds before termination 
of that trial. Thus, delay of punishment was an inverse 
function of the subject’s latency on that trial. During 
Stage III training, number of correct choices (white 
choices) and number of forced choices were recorded. 


RresuLTS 
Stage II 


Latency of the escape response for the 
Escape subgroups, measured in thousandths 
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Fic, 1. Mean log reciprocal of mean latency of 
escape response for Escape subgroups as a function of 
days of training. 


of a minute, was the only Tesponse measure- 


ment made during Stage II. Long latencies, | 
as well as a number of trials on which some | 


Nonshock subjects made no escape responses 
at all, created extreme heterogeneity of 
variance in raw latencies and precluded 
evaluation of latency data by a single trend 


analysis. However, separate Bartlett tests of | 
Shock and Nonshock subgroups, after trans- | 
forming mean raw latencies over blocks of | 
10 trials into log reciprocals, yielded insignifi- | 


cant x* values when applied over subgroupings 
by blocks of 10 trials (Shock groups: x 
6.43, df = 7, 50> p > .30; Nonshock groups: 
x? = 8.60, df = 7, 30> p > .20). Normality 


for both trend analyses was evaluated as 


Satisfactory by inspection. Results of the 
Separate trend analyses (Edwards, 1950) of log 
reciprocals of mean raw latencies for Shock 
and Nonshock groups are as anticipated from 
inspection of Figure 1. A significant F was 
found in the Shock 
8.85, df = 4 p< -001) but not for color or 
color X trials interaction, Among Nonshock 
groups, no F ratio reached significance. 


‘While the significant first trial differences in 


groups for trials (F = | 
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TABLE 1 


ANALYSIS OF VARIANCE OF ARCSINE OF PERCENTAGE 
or Correct CHOICES IN STAGE III 


TABLE 2 


COMPARISON OF MEANS WITHIN SHOCK SUBGROUPS 
ON PERCENTAGE OF CORRECT CHOICES 


Source af| MS F ? 
Shock 1| 16.777 | .007® 
Confinement 1| 35.943 | .39b 
Color 1 | 3460.683 | 1.43* 
Shock X Color 1 | 2428.018 |26.26° «.001 
Confinement X Color 1| 487.163 | 5.275. | .05 >p>.02 
Shock X Confinement | 1| 320.736 | 3.475 | .10 >p>.05 
Shock X Color X 1| 449.047 | 4.86 | .05 >p>.02 

Confinement 

Within groups 57| 92.465 


® Shock X Color MS used as error term. 
b Within groups MS used as error term. 


Stage III 


Correct choices. Table 1 presents an analysis 
of variance of percentage of correct choices 
transformed to arcsines. Since it was doubtful 
whether the required assumption of homo- 
geneity of variance was met (Bartlett’s test 
yielded a x? of 16.45, df = 7, .05 > p > .02), 
Lindquist’s (1953) suggestion of using a more 
stringent significance level was followed. 
Setting a at .01, only the F for interaction 
between shock and color remains significant 
(p < .001). However, the F value of 5.27 for 
interaction between confinement and color 
approaches the F of 5.29 required for signifi- 
cance at the .02 level (df — 60). 

Table 2 presents means, differences, and / 
tests between the Black and White subgroups 
and Escape and Confined subgroups com- 
prising the Shock group. These comparisons 
indicate the extent to which differential 
transfer occurred as a function of manipulation 
of color and confinement. Significant differ- 
ences obtain not only between Black and 
White Escape groups, but between Black and 
White Confined groups as well. However, the 
comparisons of Escape Black to Confined Black 
and Escape White to Confined White suggest 


latencies for Shock and Nonshock groups can readily 
be attributed to sensitizing effects of shock received in 
the pretraining box by the Shock groups, the improve- 
ment in their performance over days can hardly be 
attributed to this source. Note that they had received 

_ 110 shocks, distributed over 6 days, prior to their first 
placement in the straight alley. Since the number of 
preliminary shocks in the pretraining apparatus remain 
constant on successive days of straight alley training, 
there is no reason to expect progressive increases in 
sensitization. 


Com- | Mean Arcsine |Difer| , b | Com- 
parison* % Correct ence ? bined » 
Black | White 
EB > EW? | 78.39 40.55 |--37.84 | 6.23 | <.0005 «.001 
CB» CW*| 70.55 | 54.34 |-+16.21 | 2.91 | <.01 ^ 
Escape| Confined 
EB > CB* | 78.39 70.55 | +7.84 | 2.16 | <.025 <. 
EW < CW°| 40.55 54.34 |—13.79 | 1.86 | <.05 $ 


® The conditions are represented in the tables in abbreviated 
form: E = Escape, C = Confined, B = Black, W = White, S = 
Shock, and NS = Nonshock. 

> One-tailed test. 

© Predicted direction of difference. 


TABLE 3 
CowPARISON OF MEANS WITHIN NONSHOCK 
SUBGROUPS ON PERCENTAGE OF CORRECT 


CHOICES 
j Mean Arcsine | Differ- Com- 
Comparison % Correct ene | * | * bind 
Black | White 
EB and EW | 66.27 | 63.66 | +2.61| .97 |».20 | < o5 
CB and CW | 60.08} 57.90 | +42.17] .52|>.20] ^" 
Escape| Confined 
EBandCB | 66.27 | 60.08 | +6.19| 1.81 |€-10 | > o5 
EW and CW | 63.66 | 57.91 | +5.75 | 1.60 |<.20| ^" 
* Two-tailed test. 


superior learning in Stage II under Escape 
conditions. This superiority is shown by the 
higher percentage of correct responses for 
Black and lower for White when the Escape 
groups are compared to the Confined groups. 
Tf one is willing to consider comparison of the 
Escape subgroups and comparison of the 
Confined subgroups as independent tests of 
the same hypothesis about the effect of the 
color variable, use of the combined p value 
(Baker, 1952) in Table 2 is legitimate. A 
combined ? is also presented for the two 
comparisons of Escape and Confined groups. 

Table 3 parallels Table 2 but presents 
comparisons between Nonshock subgroups. 
None of the comparisons is significant. 

Table 4 presents means, differences, and / 
tests comparing Shock and Nonshock sub- 
groups. In summary, the results indicate that 
Shock groups show more transfer (positive or 
negative depending upon color) than Nonshock 
groups. 1 

Forced choices. The analysis of the forced 
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TABLE 4 


COMPARISON OF MEANS OF SHOCK AND NONSHOCK 
SUBGROUPS ON PERCENTAGE OF CORRECT CHOICES 


Black White 


i Com- 
Comparison ` 
Differ- bie 


Differ-| , Aue 


ence f 


12.12 |3.40|<.01 |—23.11 |4.12|«.01 |<.001 
+10.47 2.99.01 | —3.57 | .59|«.60 |«.05 
11.29. 4.14|«.001|—13.34 |3.02|<.01 |<.001 


SE and NSE 
SC and NSC 
AILS and NS 


“ Two-tailed test. 


TABLE 5 


ANALYSIS OF VARIANCE OF ARCSINE OF PERCENTAGE 
or Forcep Cxorces IN Srace III 


Source dí| MS F r 
Shock 1 471.9 | 1.52 
Confinement 1 48.7 16 
Color 1| 2163.9 | 1.17 
Shock X Color 1 95.4 | 3.31 10» 2».05 
Confinement X Color 1| 1027.5 | 5.94 4025» 95.01 
Shock X Confinement | 1 | 1843.4 31 
Shock X Color X Con-| 1 214.27 69 
finement. 
Within groups E 304.91 
TABLE 6 


CoMPARISON OF MEANS WITHIN SHOCK SUBGROUPS 
ON PERCENTAGE OF FORCED CHOICES 


ison | Mean Arcsine | Differ- Com- 
Vane > a Fareed’ | ence | |. 1? bined p 
Black | White 
EB «EW* | 26.25 | 60.29 |—34.04 |4.54| <.o00s> * 
CB«CW | 3646| 41.71 |—s.25 | 49 «. | €" 
Con- 
Escape fined 
EB < CB^ 26.25 | 36.46 |—10.21 |3.08| <.o0s> 
EW and CW | 60.29 | 41.71 |+18.58 |2.35| <.os¢ | € 


? Predicted direction of difference. 
> One-tailed test. 
* Two-tailed test. 


Choice data exactly parallels that of the 
percentage of correct choices. Arcsine trans- 
formations of the percentage of forced choices 
produced homogeneous variance over sub- 
groups. Barlett’s test yielded a x? of 6.24, 
df = 7,.70 > p > .50. Normality was satisfied 
by inspection. Analysis of variance (see 
Table 5) indicated a significant interaction 
between confinement and color variables 
(pe < .025). 

Table 6, paralleling Table 2, presents means, 


differences, and / tests between the Black an 


The comparisons of Black subgroups to White 
subgroups yield differences in the direction of 
fewer forced choices for Black than White, 
the difference obtaining significance only for 
the Black and White Escape groups. Compari- 
sons of Escape and Confined groups reflect the 
significant interaction between the confinement 
and color variables reported in the analysis of 
variance of forced choices. In the Black 
groups, where better learning in Stage II 
should yield more positive transfer to Stage 
III, the Escape group has significantly fewer 
forced choices than the Confined group. In 
the White groups, just the opposite obtains, 
Here, the better learning of the Escape group, 
which results in more negative transfer, is 
accompanied by more forced choices than the 
Confined White group. 

Table 7 parallels Table 6 but presents 


TABLE 7 


COMPARISON OF MEANS WITHIN NONSHOCK 
SUBGROUPS ON PERCENTAGE OF FORCED 


CnoicEs 

Comparison | MeimArsine |Digerenee| , | j^ | em, 

Black | White 
EB and EW | 30.05] 40.74 | —10.69 | 1.15 |>.20 » 
CBand CW | 37.82| 34.36 | +43.46| .48 |>.20 

Escape Mist 
EBand CB | 30.05| 37.82} —7.77 | .91|».20 >.05 
EW and CW | 40.74] 34.36 | +6.38 | .79 |>.20 i 

* Two-tailed test. 


TABLE 8 


COMPARISON OF MrANs OF SHOCK AND NONSHOCK 
SUBGROUPS ON PERCENTAGE OF Forcen CHOICES 


Black White 
c s Com- 
Differ- Differ- Wu 
ence M ence t » 
SEand NSE | —3.80 |1.24|<.30 |--19.55 |2.83/<.02 | <.05 
SCandNSC | —1.36 | .145.20 | +7.35 | .82]>.20 | 5.05 
AllSandNS | —2.58 | .38|>.20 |--13.31 |2.21/<.05 | 5.05 


* Two-tailed test, 
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comparisons between Nonshock subgroups. 
No comparisons approach significance. 

Table 8 presents means, differences, and / 
tests comparing Shock and Nonshock sub- 
groups. The only subgroup difference that 
reaches significance is that between Shock 
and Nonshock Escape white subgroups. 


Discussion 
Nonshock Groups 


The design supplies a clear test of whether 
pain squeals have aversive properties for rats. 
The results confirm Church’s (1959) finding 
that there is nothing aversive about squeals 
for naive white rats. Not even within the first 
block of 10 trials do escape latencies for 
Nonshock animals show any tendency to 
decrease. For those intent on more thoroughly 
exploring the possibility of innate social 
behavior at levels on the phylogenetic scale 
as far removed from human beings as the 
rat, the present study would not be definitive, 
nor should it be. A first point of concern 
might be the effect of the prolonged inbreeding 
of the albino rat (cf. Richter, 1959). Of interest 
is the success of Frings and Jumber (1954) in 
clearing starlings’ roosting areas by broad- 
casting tape recordings of the starling fear 
cry. Specificity of the effective stimulus is 
indicated by results with controls for intensity 
of stimulation. However, the extent to which 
prior learning may be required to establish the 
particular pattern of sound as aversive 
remains indeterminate. 


Shock Groups 


Two aspects of the acquisition by Shock 
groups of response dispositions toward environ- 
mental cues constitute the most striking 
features of the study. First, this acquisition 
was mediated by social responses, viz., 
another rat’s pain response was elicited in*the 
Presence of these cues. And second, the 
response disposition transferred to a new 
learning situation regardless of whether or 
not the subject was permitted to learn an 
Overt escape response to these cues. While the 
relative performances of Shock and Nonshock 
groups may not have been predictable in 
advance in that little information was available 
concerning the innate aversiveness of the 


pain responses of another animal to an ob- 
server, behavior theory does contain well 
established laws for confidently predicting the 
effect of the color and confinement variables 
on the performance of the Shock groups. 

The pairings of squeals with shock estab- 
lishes squeals as aversive. The noxiousness of 
squeals for those subjects shocked in pre- 
training was evidenced by the decreasing 
escape latencies to the squeal stimulus by 
Shock Escape subjects. Squeals maintained 
their aversive properties for an extended 
period of time in stimulus conditions discrimi- 
nably distinct from those under which initial 
pairings with shock were carried out. 

Correct choices. To predict the effect of the 
color variable on correct choices in Stage III, 
two laws must be stated. Alley cues paired 
with cues possessing acquired aversive proper- 
ties acquire aversive properties themselves. 
When white is defined as a correct choice and 
black as an incorrect choice, speed of learning 
is facilitated by past experience which estab- 
lished black as aversive and white as reward- 
ing; similarly, speed of learning is depressed by 
past experience which established white as 
aversive and black as rewarding. The aversive 
strength acquired by alley cues paired with 
squealing was exhibited by the predictable 
transfer effects to the new acquisition situ- 
ations of the different response tendencies 
previously learned toward these cues. Transfer, 
as exhibited in arcsine percentage of correct 
choices, was positive for White groups and 
negative for Black groups. 

The probability of a running response 
being elicited by the relevant cues on any test 
trial is a function of the number of times the 
running response has been elicited by these 
cues in the past. Confinement, which did not 
allow a running response to become attached 
to alley cues, restricted the amount of transfer, 
whether positive or negative. That is, while 
Confined groups showed learning, the learning 
was not as great as that for the Escape group. 
The design rules out differential acquisition of 
competing responses as the factor yielding 
performance differences between Confined 
and Escape groups, for both spent an equal 
amount of time on every trial in that straight 
alley compartment which was paired with 
squealing. Differences seem to be attendant 
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upon differential reinforcement of the running 
response and attachment of this running 
response to alley cues such that the alley 
cues come to elicit the running response. 

Forced choices. In predicting the number of 
forced choices, two laws require joint con- 
sideration. The number of forced choices is an 
inverse function of the discrepancy in the 
attractiveness of the alternative choices. 
Also, the number of forced choices is an 
inverse function of the number of times the 
available cues have elicited a running response 
in the subject's past history. For Black groups, 
Stage II training established black as a 
noxious cue by pairing it with squealing. In 
Stage III, failure to make a correct choice 
results in delayed shock in the presence of 
black. Errors in Stage III only confirm the 
dispositions toward black already acquired in 
Stage II and, thus, maintain high discrepancy 
between the attractiveness of the alternative 
choices. For White groups, Stage II established 
white as a noxious cue, but Stage III estab- 
lished black too as a noxious cue, thereby 
creating the requisites for what is termed the 
avoidance-avoidance conflict situation (or 
double approach-avoidance if one wants to 
consider secondary reward effects). Thus, the 
obtained results of more forced choices for 
White groups as opposed to Black groups is 
clearly predictable. 

However, the consideration of the effect of 
Escape and Confinement on number of forced 
choices within each color classification leads to 
more complexity. For the Black groups, the 
expectation of more forced choices for Confined 
subjects as opposed to Escape subjects is 
clearcut since color cues elicit no running 
response at the start of Stage III. Simply 
orienting their bodies away from the black 
and towards the white would effectively 
eliminate the presence of aversive black cues. 
"Therefore they are more likely to require 
"forcing of a response" before leaving the 
center compartment.5 


5 On the other hand different predictions for Escape 
and Confined White groups may not be possible. The 
Confined subjects would be more prone to stand still in 
the center compartment in that they were not trained 
to make an escape response at the sight of white. But 
the Escape subjects, who have a stronger tendency to 
approach black, may be in a greater conflict situation; 
the two competing response tendencies, approach black 


Norman MILLER 


SUMMARY 


To determine whether rats can acquire 
response dispositions toward an environment 
by exposure to pain responses of other rats 
that environment, 64 albino rats were placed in 
distinctive compartments with stimulus rats 
in identical adjacent compartments emit: ing 
pain cries. For half the subjects, pain cries 
were paired with black, for the other half 
white. Polar points on two additional variables, 
confinement and shock, created the eight 
cells of the factorial design. Half of the subjects 
in each color group were permitted to make 
escape responses from the squeal compartment 
to an oppositely cued nonsqueal compartment, 
The Confined half of the subjects were paired? 
with the Escape subjects and retained in the) 
presence of squealing for a time interval equal 
to the latency of the escape response of thei 
mate in the Escape group on that trial number, 
For half of both the Escape and Confined. 
subjects, squealing had previously served asa 
CS for shock in a neutral environment. : 


white was measured by transfer to a new 
two-choice acquisition situation in which a 
black choice was punished for all subjects. 
For Nonshock groups, pain responses of 
other rats had no aversive properties. Pain 
responses can acquire aversive properties 
Shock groups, both Escape and Confined, 
acquired aversions toward those environments” 
paired with squealing, though Escape groups 
showed better learning than Confined groups. 
Performance of Shock groups in terms of 
Percentage of correct responses and percentage 
of forced responses in the final acquisition 
Situation was predictable from laws of condi- 
tioning, transfer, and conflict. f 


learned in Stage II and avoid black learned in Stage 
III, may be more persistently similar in strength. The. 
tying of a running response to the aversive properties 
of white may make the reluctance to enter white more 
difficult to extinguish for the Escape subjects through 
out Stage IIT. The question of which case, that of the 
White Confined group with less habit strength for run- i 
ning from the center compartment, or that of the | 
White Escape group with greater probability of high 
degree of conflict, is more likely to lead to a greater 
percentage of forced choices is an empirical matter 
that is probably dependent upon the particular combi- 
nation of circumstances. In the present instance, results 
were in the direction of considerably more forced 
choices for the White Escape group. 


AES 
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RIGIDITY, NEGATIVE REACTION TENDENCIES, AND COSATIATION 
EFFECTS IN NORMAL AND FEEBLEMINDED CHILDREN: 


PATRICIA SHALLENBERGER 
University of Missouri 


Te theoretical positions have been 
advanced to explain the differences in 
the amount of rigidity manifested by 
familial feebleminded subjects and normal 
children of the same MA. A view that has had 
considerable influence on the training and 
treatment of feebleminded individuals is that 
of Lewin (1936) and Kounin (1941, 1948), 
who attributed the greater rigidity of the 
feebleminded to the lessened permeability of 
the boundaries separating intrapersonal 
regions, Other investigators (O'Connor & 
Hermelin, 1959) have explained the differences 
by the hypothesis that feebleminded subjects 
employ verbal cues to a lesser degree than do 
normal subjects of the same MA. 

Although in certain cases these two Positions 
generate contradictory predictions, they share 
& common underlying assumption. Both 
views assume that although a normal and a 
feebleminded person may have the same MA, 
the defective possesses characteristics that 
somehow inhere in his feeblemindedness, 
Thus, both emphasize an immutable difference 
between normal and feebleminded individuals, 
even when matched on MA, 

A third group of investigators have taken 
exception to the view that rigidity is an 
inherent characteristic of the feebleminded, 
Stevenson and Zigler (1957) have advanced 
the alternative that such differences are 
related to motivational discrepancies between 
the groups. This hypothesis assumes that 
institutionalized defectives tend to be rela- 
tively deprived of adult contact and approval 
and, hence, to have a higher motivation to 
Secure such contact and approval through 


1 This paper is based upon a Master of Arts thesis 
done by the first author (1959) under the direction of 
the second. We wish to thank George A. Johns, Super- 
intendent of the Missouri State School, and Virginia 
Fisher, Ruth Ragsdale, and the staff and directors of 
the Home Economics Child Development Laboratory 
of the University of Missouri for their aid in making 
subjects available. Preparation of this paper was 
facilitated by Research Grant M-3945, National 
Institute of Mental Health, United States Public 
Health Service. 
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support are positively related to the amount. 
of social deprivation experienced (Zigler, 1958), 

A cosatiation type of task has often been 
employed as a measure of rigidity, The 
subject is instructed to perform a response and; 
is allowed to continue until he wishes to stop, 
He is then instructed to perform a highly’ 
Similar response until again satiated. The 
cosatiation score is a measure of the degree to 
which performance on the first task influences 
performance on the second. Both the theo- 
retical positions of Lewin (1936) and Kounin 
(1941, 1948) and of Stevenson and Zigler 
(1957) predict that feebleminded subjects 
Should continue the second task longer than 
normal subjects. However, none of the posi- 
tions outlined here appears capable of explain- 
ing the recurring finding (Kounin, 1941; 
Zigler, 1958; Zigler, Hodgden, & Stevenson, 
1958) that feebleminded subjects under 
certain conditions perform longer on the 
second than the first task. 

In an effort to explain this finding Zigler 
(1958) advanced the following hypothesis: 


Institutionalized feebleminded subjects begin Task 
One with a positive reaction tendency higher than that 
of normal subjects. This higher positive reaction 
tendency is due to the higher motivation of feeble- 
minded subjects to interact with an approving adult. 
At the same time, feebleminded subjects begin 
Task One with a negative reaction tendency higher 
than that of normal subjects. This higher negative 
reaction tendency is due to a wariness of adults 
Which stems from the more frequent negative encoun- 
ters that feebleminded subjects experience at the hands 
of adults. If Task One is given under a support condi- 
tion, subject's negative reaction tendency is reduced 
more during Task One than is his positive reaction 
tendency. 


The institutionalized child learns during 
Task 1 that the experimenter is not like the 
adults he encounters in the institution who 
often initiate painful experiences (physical 
examinations, shots, etc.) with supportive 
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comments, This reappraisal of the experi- 
mental situation results in a reduction of the 
negative reaction tendency. When the de- 
prived child is then switched to Task 2, he 
meets it with a positive reaction tendency 
that has been reduced less than has been his 
negativé tendency. The result is a longer 
performance on Task 2. 

The finding (Zigler et al., 1958) that normal 
children exhibit a decrease in length of per- 
formance during Task 2 as compared to Task 
1 follows if one assumes that normal children 
have a relatively low negative reaction tend- 
ency when they begin Task 1. When normal 
subjects are switched to Task 2, their positive 
reaction tendency has been reduced more 
through fatigue and satiation effects than 
any negative reaction tendency they may have 
had. The result is a briefer performance on 
Task 2. The hypothesis concerning the opera- 
tion of a negative reaction tendency in the 
institutionalized feebleminded gains some 
support from the clinical impression advanced 
by Sarason and Gladwin (1958) that one of 
the major effects of institutionalization is an 
“avoidance and fear of new problem solving 
situations." 

Thus, the cosatiation score may be a measure 
of motivational determinants rather than of 
"inherent" rigidity. The purpose of the 
present study was to provide an experimental 
test of this position. The two-part experimental 
task employed in this study to obtain cosati- 
ation scores was similar to one employed 
earlier (Zigler, 1958; Zigler et al., 1958). The 
present study differs from earlier ones in that 
three pre-experimental games—under two 
conditions of reinforcement—preceded the 
experimental task. In a positive reinforcement 
condition, all of the subject’s responses met 
with success, and he was further rewarded 
with verbal and nonverbal support from the 
experimenter. This condition was assumed to 
reduce the subject’s negative reaction tend- 
ency. In a negative reinforcement condition 
that presumably increased negative reaction 
tendency, all of the subject’s responses met 
with failure, and the experimenter further 
punished the subject by noting his lack of 
success. Both normal and  feebleminded 
subjects were used. 

The two conditions were introduced to 
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TABLE 1 
Mean MA, CA, AND IQ For ALL Groups 
Group ee SD Mean 
Normal, Positive 61) 51 119 
Normal, Negative 6.0| 5.2] 115 
Feebleminded, Positive 6.3 | 11.9 53 
Feebleminded, Negative 6.3 | 12.6 50 


provide a test of the following predictions, 
derived from the hypothesis that feebleminded 
subjects have both higher positive and negative 
reaction tendencies than do normal subjects: 

1. Subjects who play the pre-experimental 
games under the negative reinforcement 
condition evidence a greater increase in time 
spent on Task 2 over that spent on Task 1 
than do subjects who play the pre-experi- 
mental games under the positive reinforcement 
condition. 

2. Feebleminded subjects who play the 
pre-experimental games under the negative 
reinforcement condition evidence a greater 
increase in time spent on Task 2 over Task 1 
than do normal subjects who play the pre- 
experimental games under this condition. 

3. Subjects who play the pre-experimental 
games under the positive reinforcement 
condition spend more time on Task 1 than do 
those subjects who play the pre-experimental 
games under the negative reinforcement 
condition. 

4. Of the subjects who play the pre-experi- 
mental games under the positive reinforcement 
condition, the feebleminded spend more time 
on Task 1 than the normals. 


METHOD 


Subjects 


The subjects consisted of! a group of 20 feeble- 
minded and a group of 20 normal children, matched on 
the basis of MA and sex. There were 10 boys and 10 
girls in each group. The normal subjects were obtained 
from the Missouri University Nursery School and the 
feebleminded subjects from the Missouri State School 
at Marshall, Missouri. The feebleminded subjects were 
all of the familial type, and no individuals with gross 
motor or sensory disturbances were used. The 1937 
revision of the Stanford-Binet, Form L, was used to 
obtain MA ratings. Two reinforcement conditions were 
employed. Half of the normal and half of the feeble- 
minded subjects were assigned to one condition. The 
other half of the subjects was assigned to the other. 
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Assignment was made so that all groups would have 
comparable MAs and so that there would be an equal 
number of boys and girls in each group. The mean 
MA, CA, and IQ of each group are presented in Table 
1. 


Apparatus 


Three pre-experimental games were employed, the 
Card Game, the Nursery School Game, and the Nail 
Board Game. The experimental task used was Marble- 
in-the-Hole. 

The Card Game apparatus consisted of two sets of 
cards. The cards were constructed of cardboard, 2 X 3 
inches, with small rectangles of red or black paper 
mounted on them. There were 12 cards in each deck, 
one set containing only red rectangles, the other only 
black rectangles. 

The Nursery School Game apparatus entailed two 
green panels, each 17 inches square and each having 
six hooks. Twelve pictures of young children mounted 
in plastic covers with a hole at the top of each were 
also provided. A small red light was placed above each 
panel. 

The Nail Board Game apparatus consisted of a red 
board 18 inches square, inclined at a 50-degree angle 
with two small boxes at the bottom of the board, one 
painted yellow and the other green. Nails placed on the 
surface of the board formed several paths, all of which 
led to the yellow box. A round glass container was 
used to hold 12 marbles. 

The Marble-in-the-Hole apparatus consisted of a 
yellow box, 9 X 8 X 7 inches, with two small round 
holes in the lid 2 inches apart, 300 blue and 300 orange 
marbles, and a red open container, 5 X 6 X 7 inches, 
to hold the 600 marbles, 


Procedure 


All subjects were given the three pre-experimental 
games in a random order, followed by the experimental 
task. Half the normal and feebleminded subjects were 
given the pre-experimental games under a positive 
reinforcement condition and the other half of the sub- 
jects under a negative reinforcement condition, The 
type of subject, normal (N) and feebleminded (F), and 
the reinforcement condition, positive (p) and negative 
(n) were used to designate the four experimental groups: 
N-p, N-n, F-p, and F-n. The responses of the subjects 
in the N-p and F-p groups always met with success on 
the pre-experimental games, i.e., the games were pre- 
arranged so that the positive groups could make only 
correct responses. The N-n and F-n groups could make 
only incorrect responses due to the Prearranged nature 
of the games. 

Additional reinforcement was given to the N-p and 
F-p groups in the form of verbal and nonverbal support. 
The nonverbal support consisted of smiling and nods 
of approval. The verbal support consisted of four 
statements made in random order at predetermined 

times during the subject’s performance. The four sup- 
Port statements used were, “You really know how to 
play this game,” “That’s fine,” “Very good,” and “You 
really know how to play (name of the game), don’t 
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you?" One of these statements was given after ey 
ninth response the subject made. 

"The N-n and F-n groups received four statements, 
disapproval: “You're not doing very well,” “That's m 
very good,” “You aren’t getting any right,” and “ 
really don’t play (name of the game) very well, 
your” These comments were also made in randon 
order after every ninth Tesponse. Care was taken 
avoid nodding, smiling, or giving other types of no 
verbal support to the subjects in these groups. 


were employed in a random order, one after every tent 
marble was inserted. F. ollowing the completion of the 


had played the games. After completing the expel 
negative groups w 
allowed to play the pre-experimental games again, 
time he received positive reinforcement for his efforts, 
remaining negative feelings he mi 
have about his performance. 

Subjects were tested individually. Upon meeting the 
subject, the experimenter said, “We're going to play 
some games today. These are all fun games, and I thin 
that you will like them." The experimenter then er 
plained the instructions for the first pre-experimenti 


were wrong. 
the N-p and F-p groups all the cards were on 2 
exposed, the S 
and after each black card, the Experi 
menter said “wrong.” 
In the Nail Board Game, 
the apparatus and told that he was to start a marble 
at the top of the board. If it rolled into the right box 4 
he “won.” As the marble always went into the yellow 
box, the subjects in the N-p and F-p groups were told i 
that the yellow box was correct. The subjects in the” 
N-n and F-n groups were told that the green box was 
correct. H 
In the Nursery School Game, the subject was told j 
that the panels represented nursery schools and that 
half of the children in the pictures went to ont- 
nursery school and half to the other. If he placed the | 
picture of the child in the correct nursery school 
panel, the red light would go on. Since there was 10 | 
Principle by which the pictures could be separated, — 
reinforcement was given arbitrarily, depending upon i 
the group to which the subject belonged. All of the 
Tesponses of the subjects in the N-p and F-p groups © 
Were reinforced, whereas none of those in the N3 
and F-n groups were, 
After completing the three pre-experimental games, - 
the subject was introduced to Part I of the experimental 


1 
the subject was shown 


^ 
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task, Marblein-the-Hole. With the container filled 
with 600 marbles and the yellow box placed in front of 
the subject, the experimenter said: 

This is a game we call Marble-in-the-Hole. Ill 
tell you how to play it. You see these marbles. Some 
of them are blue and some of them are orange. They 
go in these holes. The blue ones go in the front hole 
and the orange ones go in the back hole. (The 
experimenter points to the appropriate holes.) Now 
show me a blue marble. Put it in the hole it goes in. 
Now show me an orange marble. Put it in the hole 
it goes in. You can put as many marbles in the holes 
as you want to. You tell mewhen you want to stop. 

The subject then played the game until he indicated 
that he wished to stop, either by telling the experi- 
menter he wanted to stop, or by not inserting a marble 
for 30 seconds. 

In Part II, after the subject indicated that he wished 
to stop, the experimenter emptied the marbles from 
the box back into the container and then said: 

Now I'll tell you how to play this game. This time 
we put the orange marbles in the front hole and the 
blue marbles in the back hole. Put an orange marble 
where it goes. Now put a blue marble where it goes. 
You can put as many marbles in the hole as you 
want to. You tell me when you want to stop. 

The subject was allowed to continue until he told the 
experimenter that he wanted to stop or did not insert a 
marble for 30 seconds. 


RESULTS 


The experimental task, Marble-in-the-Hole, 
provided satisfactory measures of performance. 
No subject in any group mentioned that Part 
II was similar to Part I. Each subject re- 
sponded to Part II as if it were a new game. 
The maximum number of responses permitted 
by the task was large enough so that only 
four subjects inserted all 600 marbles, two 
in the F-n Group and two in the F-p Group. 


Analysis of Predictions 


The results were analyzed initially in terms 
of the predictions made at the beginning of the 
study. One-tailed tests of significance were 
employed. Because of the nonhomogeneity of 
variance found for the groups, all estimates of 
standard errors were computed from „the 
sums of the separate variances, and the table 
of ¢ was entered with half the usual number of 
degrees of freedom. The performance of the 
four groups on the experimental task is 
presented in Figure 1. The cosatiation score 
for each subject was obtained by subtracting 
the time he spent on Part I from the time 
he spent on Part II. 

The first prediction was that both feeble- 
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Fic. 1. Mean time in seconds spent on Part I and 
Part II of the experimental task by each group of sub- 
jects. 


minded and normal subjects who received 
negative reinforcement on the pre-experi- 
mental games would evidence higher cosati- 
ation scores than those who received positive 
reinforcement, The results were analyzed 
separately for each type of subject. The 
difference in the average cosatiation scores of 
Groups F-n (295.7 seconds) and F-p (—136.4 
seconds) was 432.1 seconds, which is signifi- 
cant (f = 3.34; p < .005). The difference in 
the average cosatiation scores of Groups N-n 
(77.4 seconds) and N-p (—72.4 seconds) was 
149.8 seconds, which is also significant (£ = 
3.66; p < .005). The prediction was fulfilled, 
The second prediction, that feebleminded 
subjects who received negative reinforcement 
on the pre-experimental games would evidence 
higher cosatiation scores than normal subjects 
who received such negative reinforcement 
was also confirmed. The difference in the 
average cosatiation scores of Groups F-n 
(295.7 seconds) and N-n (77.4 seconds) was 
significant at exactly the .05 level (¢ = 1.83). 
The third prediction was that subjects who 
received positive reinforcement on the pre- 
experimental games would spend more time 
on Part I of the experimental task than 
subjects who received negative reinforcement. 
Again the results were analyzed separately 
for each type of subject. The difference in the 
average time spent on Part I by Groups F-p 
(500.3 seconds) and F-n (376.2 seconds) was 
124.1 seconds. While this difference was in the 
predicted direction, it was not significant 
(t < 1.00). The difference in the average time 
spent on Part I by Groups N-p (170.0 seconds) 


24 


and N-n (231.9 seconds) was 61.9 seconds. 
This difference was significant in the direction 
opposite to that predicted (¢ = 242; p < .05, 
two-tailed test). 

The fourth prediction was that feebleminded 
subjects who received positive reinforcement 
on the pre-experimental games would spend 
more time on Part I of the experimental 
task than normal subjects who had received 
such positive reinforcement. The difference in 
the average time spent on Part I by Groups 
F-p (500.3 seconds) and N-p (170.0 seconds) 
was 330.3 seconds, which is a significant 
difference (¢ = 2.64; $ < .02). 


Other Measures of Performance 


Tn earlier studies (Zigler, 1958; Zigler et al., 
1958), the total time spent on the experi- 
mental task was used as the measure of the 
subject’s motivation to secure contact and 
approval, i.e., his positive reaction tendency, 
Therefore, the total time Scores—time spent 
on Part I plus time Spent on Part II—of the 
four groups of subjects were analyzed for 
differences, Again the heterogeneity of the 
Variances necessitated the use of individual 
t tests. The average total times spent on the 
experimental task by Groups F-n, F-p, N-n, 
and N-p were 1048.1, 864.2, 541.2, and 267.6 
Seconds, respectively. Across reinforcement 
conditions, the difference of 551.8 seconds in 
the average total time Spent on the task by 
feebleminded (956.2 seconds) and normal 
subjects (404.4 Seconds) was found to be a 
Significant one (£ = 3.90; p < .001). The 
difference in the average total times of Groups 
F-n and F-p was 183.9 Seconds, which is not 
significant (t < 1.00). The difference in the 
average total times of Groups N-n and N-p 
was 273.6 seconds, which is significant (¢ = 
5.54; p < .001). Feebleminded subjects, thus, 
played the game longer than the normal 
subjects, and following the pre-experimental 
negative reinforcement condition, normal 
subjects played the game longer than they 
did when the game was preceded by the 
positive reinforcement condition. No evidence 
was found to indicate that the pre-experi- 
mental reinforcement conditions affected the 
amount of total time the feebleminded Subjects 

Spent on the game, 
In order to assess the comparability of the 
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cosatiation scores obtained in this study- 
difference in amount of time Spent on Parts 
and II—with Kounin's type of cosatiatig 
scores—difference in amount of time Spent or 
Parts I and II divided by time spent on Pan 
I—Kounin's formula was employed to comput 
a second cosatiation score for each subject 
A product-moment correlation was then ru 
between the two types of cosatiation scores 
The resulting correlation (r = .904) indicated 
that the two measures are highly related, 


Discussion 


The findings of the present study indicat@ 
that cosatiation effects do not mirror thé 
person's inherent rigidity. Instead, cosatiation 
Scores appear to be measures of the relati 
Strength of certain motivational variables or 
Positive and negative reaction tendencies 
These tendencies and their relative strengths 
seem to be the product of particular environ: 
mental events that the person has experienced 
and apparently are open to manipulation and 


offer further validation for the motivational 
hypothesis advanced in earlier studies (Steven: 
son & Zigler, 1957; Zigler, 1958; Zigler et aly 
1958), they also represent evidence favoring 
an extension of this hypothesis, Whereas the 
earlier studies emphasized an increased desire 
to interact with and receive the support of. 
an adult, the present study demonstrates the 
role of another motivational variable, negative” 
Teaction tendency. The experiencing of events 
of the sort that has been described as socially | 
depriving gives rise to both an increased 
desire to interact as well as a wariness to do 80. 
It remains for future research to isolate the 
specific events that give rise to each of these 
opposing motivational factors. 
The findings of an earlier study, in conjunc - 
ion with the results of the present one, allow 
for a tentative Suggestion as to the chronology ` 
of the growth of positive and negative tend- 
encies. Degree of preinstitutional social 
deprivation (Zigler, 1958) was found to be 
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directly related to positive motivation to 
perform on the two-part experimental task, 
but it was not found to be related to the 
cosatiation score. Cosatiation scores, however, 
were found to be related to length of institu- 
tionalization. Since the present study indicates 
that the cosatiation score is sensitive to the 
amount of negative reaction present, it may 
be that the negative reaction tendency is a 
product of institutional rather than prein- 
stitutional events. 

There is a distinct possibility that prein- 
stitutional and institutional experiences differ 
in kind. This follows if one conceptualizes 
preinstitutional social deprivation as char- 
acterized by the relative absence of positive 
social reinforcers, whereas institutional social 
deprivation not only lacks such reinforcers 
but includes punishing events as well. This 
attempt at a discrete division between such 
events raises certain questions. One may 
wonder if severe deprivation or extreme 
absence of social reinforcers does not itself 
constitute a punishment condition. Further- 
more, one may wonder whether if, when 
punishing events are present, positive social 
reinforcers are necessarily absent. 

One of the findings of the present study 
offers evidence that conceptualizing a particu- 
lar set of environmental events as producing 
only a negative reaction tendency results in 
an inaccurate prediction. Whereas three of 
the four predictions made at the outset of the 
study were confirmed, the prediction that 
Subjects who received positive reinforcement 
would spend more time on Part I of the 
experimental task than subjects who received 
negative reinforcement was not confirmed. 
In fact, normal subjects receiving negative 
pre-experimental reinforcement played Part I 
of the task significantly longer than did 
normal subjects who received positive rein- 
forcement. . 

The unconfirmed prediction was derived 
from the view that the negative condition 
builds up the negative tendency which would 
result in decreased performance on Part I. 
How then can one explain the lengthier 
performance by the negative normal group 
over that of the positive? Conceivably, the 
condition built up the positive reaction 
tendency as well as the negative tendency. 


This heightening of both reaction tendencies 
occurred because the negative condition not 
only consisted of mildly punishing events 
that promoted wariness and reluctance in the 
subjects, but it also represented an absence of 
positive social reinforcers. Thus, following 
the deprivation of positive reinforcement on 
the pre-experimental games, the subjects had 
an increased desire to secure such reinforce- 
ment. It then becomes understandable why 
the support experienced on Part I of the experi- 
mental task retains reinforcing value longer 
for the normal subjects who played the pre- 
experimental games under the negative condi- 
tion than for those who received prior positive 
reinforcement, 

Evidence that the negative reaction tend- 
ency also was built up during the negative 
pre-experimental condition can be found in the 
confirmation of the prediction that negative 
normal subjects show an increase in time 
spent on Part II over that spent on Part iE 
Further evidence that the negative condition 
builds up the positive reaction tendency is 
demonstrated in the finding that the total 
time spent on the experimental task is signifi- 
cantly greater for negative normal than for 
positive normal subjects. Coincidentally, the 
finding that the total time normal subjects 
spent on the experimental task is significantly 
related to the pre-experimental reinforcement 
condition contradicts Kounin's (1939) view 
that lengthy performance on this type of 
task is attributable to the "rigid state" of the 
individual and adds support to the notion that 
length of performance may be explained more 
parsimoniously in terms of motivational 
factors. 

In light of the suggestion that the negative 
condition builds up the positive as well as the 
negative reaction tendency, one may ask why 
negative feebleminded subjects did not spend 
more total time on the experimental task or on 
Part I of it than did the positive feebleminded 
subjects. The simplest explanation would 
seem to be that the institutionalized feeble- 
minded person has such a high motivation to 
interact with and gain the support of an adult 
that it is relatively difficult to increase his 
positive tendency above this already existing 
high level. 

One further suggestion is in order. Further 
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investigation of such positive and negative 
reaction tendencies, their interactions, and the 
specific events which give rise to them, may 
clarify issues much broader than the trouble- 
some finding that under certain conditions 
feebleminded individuals play a second part of 
a two-part cosatiation task longer than 
the first part. The current controversy over 
whether social deprivation leads to an increase 
in the interaction of the socially deprived 
individual or whether such social deprivation 
»leads to apathy and withdrawal (Cox, 1953; 
Goldfarb, 1953; Irvine, 1952; Wittenborn & 
Myers, 1957) is a case in point. 


SUMMARY 


Feebleminded and normal children played 
three pre-experimental games under two 
conditions of reinforcement, followed by a 
two-part experimental task given under a 
support condition, Half the subjects in each 
group received positive and half negative 
reinforcement during the games. Following 
satiation on Part I of the experimental task, 
subjects were switched to Part II and again 

~ allowed to play until satiated, Four predictions 
were advanced, derived from the hypothesis 
that the cosatiation score is a measure of 
motivational determinants rather than a 
measure of the inherent rigidity of individuals. 
The results in general supported this hy- 
pothesis, 
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INTERRELATIONSHIPS AMONG MEASURES OF FIELD DEPENDENCE, 
ABILITY, AND PERSONALITY TRAITS! 


ROGERS ELLIOTT? 
University of Illinois 


HE ambitious study of relationships 

between personality and perception by 

Witkin and his colleagues (Witkin, 
Lewis, Hertzman, Machover, Meissner, & 
Wapner, 1954) had at its core the construct 
“field dependence.” This concept was derived 
from individual differences on a large battery 
of intercorrelated tests whose common charac- 
teristic was that a subject must make a judg- 
ment of verticality on the basis of kinesthetic 
cues, resisting at the same time visual cues 
which, if used, led to error. Witkin’s colored 
version of the Gottschaldt-figures test was 
added to this battery of tests because of its 
high correlation with them; it does not require 
a verticality judgment, but does, like the other 
tests, demand resistance to an error-inducing 
visual field. Field dependence is indexed by 
poor performance on these tests. 

On the basis of interpretations of a number 
of projective tests and an interview, the Witkin 
(Witkin et al., 1954) group summarized the 
salient personality correlates of field depend- 
ence under two major headings: activity- 
passivity, and orientation toward inner life. 
The modal field dependent student showed 
very little initiative in challenging the struc- 
ture of the status quo: he? was conventional, 
submissive to authority, got into comfortable 
ruts. He was said to be blandly unaware of 
his inner feeling life, and to fear, deny, and 
have poor control over such impulses as sex 
and aggression. 

The modal field independent person, quite to 
the contrary, actively attempted to master and 


1 Adapted from a PhD thesis done at the University 
of Illinois under the direction of C. W. Eriksen. The 
author wishes to express his appreciation to C. W. 
Eriksen, and to the following members of the faculty: 
J. McV. Hunt, W. C. Becker, G. R. Grice, and L. A. 
Hellmer. 

? Now at the Allan Memorial Institute of Psy- 
chiatry, McGill University. 

* The background review, the study, and its dis- 
cussion are concerned with males only. It is clear from 
the results of Witkin, Lewis, Hertzman, Machover, 
Meissner, and Wapner (1954) and Bieri, Bradburn, 
and Galinski (1958), that the introduction of sex 
$a variable in perception-personality relationships 
leads to a large and bewildering number of interactions. 
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reorganize his environment: he strove for 
independence, leadership, special skills, and 
competencies. He was concerned with his 
inner life, with the motives of his own and 
others’ behavior, and could, for example, 
express his hostility with directness and 
control, He was not so readily likeable as his 
field dependent counterpart, his autonomy and 
resistiveness including doses of suspiciousness 
and extrapunitiveness. 

Since a great many subsequent investiga- 
tions of field dependence have used only the 
simplest of Witkin’s criterion tasks, one may 
evaluate most studies in terms of “frame 
dependence” (field dependence defined by the 
Rod and Frame test, or RFT) or “design 
dependence” (field dependence defined by the 
Embedded Figures test, or EFT). The tenta- 
tive conclusions of the Witkin group have 
received mixed support. Field dependence has 
been reported to relate as expected, positively 
with gregariousness and conventionality, 
negatively with rated interpersonal hostility 
(Crutchfield, Woodworth, & Albrecht, 1958), 
and negatively with the Rorschach M score, 
the Barron-Welsh Art Scale, and various 
other measures of creativity and originality 
(Bieri, Bradburn, & Galinski, 1958; Crutch- 
field et al., 1958). The proposed negative 
relationship between introspection or psycho- 
logical mindedness and field dependence has 
been tested once, without support (Bieri et al., 
1958). 

Most central in assessing the general result 
of the Witkin report is the study of the 
relationships between field dependence and 
the dependent personality. Among clinical 
groups, alcoholics are more frame dependent 
than any other diagnostic group except 
patients with diffuse brain damage (Bailey, 
Hustmyer, & Kristofferson, 1959; Witkin, 

, & Goodenough, 1959). We cannot 
know as yet whether this result is accounted 
for by the psychodynamic dependence said to 
be involved in the etiology of alcoholism, or 
by diffuse brain damage resulting from 
alcoholism. Linton (1955) reported that 
design dependence related positively with 
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increase in reported autokinetic movement in 
the presence of influence exerted by a con- 
‘federate. In a post-experimental interview, 
subjects were rated for degree of felt disruption 
in the situation before the advent of the 
confederate, and the degree of megativism, 
defined as a deliberate decision to avoid 
being influenced by him. Both of these vari- 
ables correlate to a significantly higher degree 
with the EFT than they do with the auto- 
kinetic change scores, and the relationship 
between design dependence and autokinetic 

. change vanishes when either interview variable 
is partialed out of it. It seems possible that 
field dependence was related more to some 
kind of disruptive emotional response to the 
strange experimental situation than it was to 
conforming behavior per se. Two other 
behavioral measures of conformity—namely, 
attitude change toward “authoritative” 
sources, and “wish-fulfilling” errors in a 
reasoning test—were weakly related or not 
related with design dependence. 

Gross (1959) found that she could increase 
the frame dependence of all of her subjects by 
introducing a bogus “distorting lens" between 
them and the apparatus, and she interpreted 
this result to be the effect of increased un- 
certainty in the use of external cues, She 
appears to have been guided in this interpreta- 
tion by the finding that in the normal rod 
and frame task, highly frame dependent 
subjects tend to check themselves as feeling 
“uncertain” significantly more often than 
frame independent subjects, who in turn 
check themselves as feeling “expectant.” 
Block (1957) presents data in indirect support 
of Gross’ notion. Groups which differed 
significantly in frame dependence‘ did not 
differ in degree of yielding in an Asch-like 
conformity task; but they were markedly 
different (p < .001) in the degree of self-rated 
confidence with which they made their 
judgments. 

These results from Linton (1955), Gross 
(1959), and Block (1957) indicate that the 
field dependent person does not always act 
dependently. It may be more accurate to 
state that he reacts with disruption and ineffec- 


4 The two groups were selected on the basis of their 
difference in GSR reactivity, the correlates of this 
variable being the focus of Block’s (1957) interest. 
The highly reactive group was also the field dependent 


group. 


tiveness in the face of strange or unstructured w 
stimulus configurations, such disruption lead. | 
ing to conforming behavior only when there is _ 
available something obvious to conform to, like 

a luminous frame, or a confident confederate, 

The relationship between measured intelli- ' 
gence and field dependence was not established 
by Witkin, but the trend of recent results’ 
(Bieri et al., 1958; Bound, 1957; Crutchfield © 
et al., 1958; Jackson, 1957) is about as follows, i 
When field dependence is measured by the | 
EFT, it tends to be significantly related with | 
any measure of ability, and to share more | 
common variance with quantitative-spatial $ 
tests (about 30%) than with verbal tests 
(about 10%). When field dependence is; 
measured by the RFT, it tends to have slight 
negative relationships with ability measures. 

Tf the two tests are not equivalent with regard 
to intelligence, what other limits are there to 
the assumption that they are equivalent | 
measures of field dependence? The answers 
(Bound, 1957; Gruen, 1955; Young, 1959), in | 
terms of the intercorrelation between the | 
two tests, are very similar: they have perhaps i 
15% of their variance in common, a very | 
considerable drop from the 55% first reported. 
by Witkin. In cases where some version of the 
EFT other than Witkin's is used (e.g., that of 
Thurstone, 1944) it is difficult to guess its | 
validity as a measure of field dependence. 3 

The effort of the present study is to replicate 
and extend the findings just reviewed, using 
the RFT and EFT as criterion measures, in | 
intercorrelation with various measures of 
intelligence, uncertainty, dependent behavior 
and attitude, intraception and introspection, 
self-awareness, and self-reported attitudes 
and traits. Specifically, the following relations 
are expected to hold: 

1. The intercorrelation between the RFT 
and the EFT is significant, but substantially 
lower than has been reported. 

2. Fast performance on the EFT is related 
positively with high ability, and more so with 
quantitative ability than with verbal ability; i 
ability scores are less likely to correlate k 
significantly with the RFT. " 

3. With statistical control on ability scores; » 
field dependence relates negatively with 
success in learning a finger maze under condi- 
tions of visual deprivation, because field 
dependence implies reliance on visual cues, 
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and because a maze test of this kind presents 
a highly unstructured situation. 

4, In a virtually impossible puzzle task, in 
which the subjects are allowed to request help, 
scores representing requests for direct help and 
for indirect emotional support are positively 

' related to measures of field dependence. 

5. Scores on a measure designed to assess 
psychological mindedness (intraception) relate 
negatively with measures of field dependence. 

6. A test designed to assess self-concept 
differentiation is negatively related to measures 
of field dependence. 

7. A measure of independence of judgment 
is negatively related to measures of field 
dependence. 


METHOD 
Subjects 


'The subjects were 128 male undergraduates from all 
classes and colleges at the University of Illinois. All 
but 4 were enrolled in the course in introductory 
psychology and were nonvolunteers. The other 4 were 
volunteer summer students from more advanced 
psychology courses. 


Aptitude Scores 


_ For 100 subjects, scores from either the School and 
College Ability Tests (SCAT) or its predecessor, the 
ACE, were on record at the University of Illinois 
Student Counselling Service. Decile scores for Quanti- 
tative (Q) and Linguistic (L) subscales were taken for 
these subjects. 


Perceptual Tests 


Rod and Frame test (RFT). The frame of this appa- 
tatus was 40” square, made of four 1" X 40" slats. The 
rod was a slat 1” X 38”. Both were mounted on the 
same center, but pivoted independently. Both were 
coated with luminous paint. The entire apparatus was 
placed 8’ in front of the subject’s chair, centered 
approximately at the subject’s eye level. Just in front of 
the rod and frame was a curtain which could be drawn 
80 as to conceal the entire apparatus. To rotate the rod, 
the subjects activated a selsyn by turning a knob that 
was mounted next to him at seat level. The test room 
was completely dark, except for the luminous rod and 
frame, during a judging trial. Between trials, the con- 
cealing curtain was drawn, and the light was switched 
on by the experimenter to hinder dark adaptation by 
the subject and to facilitate readings by the experi- 
menter, 

"The apparatus was concealed when the subject 
entered. He was seated, given instructions on the use 
of the rotator knob, and told that his job would be 
to rotate the rod to the true vertical, "straight up and 
down like'a plumb line." Tlie lights were turned out, 
the curtain opened, and the subject made his first 

- Setting, telling the experimenter when he was finished. 
The experimenter then closed the curtain, turned on 
the light, recording the reading, feset the apparatus, 


turned out the light, and opened the curtain for the 
next trial. There were 16 trials. The direction of tilt of 
the frame was alternated trial by trial, right or left. 
The degree of tilt of the frame from the vertical was 
systematically varied (32 or 26 degrees), as was the 
initial tilt of the rod, to prevent the use of movement 
cues. 

The basic score for this task, labeled RFT, was the 
sum over all judgments of deviation from true vertical 
(in degrees). Since the distribution of these scores 
showed marked positive skew, a common log trans- 
formation was imposed to force a better approximation 
to normality, and this score was labeled LRFT. For 
use in correlations with items, rod and frame scores 
were dichotomized; and in correlation with a seven- 
point personality scale, they were forced into an 
approximately normal distribution of eight ordered 
categories. 

Embedded Figures test (EFT), Two forms of this test 
were used. Thurstone’s (1944) is a timed, group ad-- 
ministered test in which the subject must select which 
of four complex figures contains a given simple figure. 
The score, rights minus wrongs, was reversed in sign so 
that high scores represent poor performance; it is 
labeled TEFT. Witkin’s (1950) form, labeled WEFT, 
is individually administered, and the complex figures 
are colored. His administration was followed, except 
for an item time limit of 3 rather than 5 minutes. Nine 
of his 24 items were used, so selected as to give the test 
a greater average difficulty than Witkin reported 
(hence a score distribution of reduced positive skew), 
and to maintain graded difficulty levels. These items 
are A-1, C-1, C-2, D-1, D-2, E-1, E-3, G-l, H-1. The 
score for this task was the sum of times for all items. 
EFT scores were both dichotomized and categorized in 
the same fashion as the RFT scores, for the same 
purposes. 


Performance Tests 


Block Puzzle test. This test was very similar to that 
used by Bernardin and Jessor (1957). It consisted of 
12 wooden pieces which were fitted together to form 
a cube. It is difficult under any circumstances, and the 
short time limit imposed in this study made it virtually 
impossible. The subject was told that he would be 
asked to solve the puzzle in 15 minutes, that it was very 
difficult, and that because of its unusual difficulty the 
experimenter would be willing to give him as much 
help as he felt he needed: i.e., whenever the subject 
felt “stuck” enough, he could ask for help if he wished, 
and the experimenter would put the next piece in for 
him. The subject was told that the experimenter 
would take some notes on his manner of attacking the 
problem, but the notes were actually a protocol of every 
statement made by the subject to the experimenter. 
The criteria for such statements were that the subject 
made a statement or question while looking at the ex- 
perimenter, or, if not looking at the experimenter, 
made an utterance so loud and clear that it could be 
considered more than “talking to oneself.” Four catego- 
ries were scored: direct requests for help; statements 
implying a request for support, when the subject asked 
how he was doing, whether he was doing all right, etc; 
requests for information concerning instructions, time 
limit, etc.; and other comments. 
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Forty-four subjects, including two who solved the 
puzzle (in 12 and 14 minutes) received no scores in any 
category. Because of the low frequency of scorable 
utterances in any given category, especially in the 
“help” and “support” categories, all block puzzle 
variables were scored dichotomously to represent the 
simple presence or absence of a scorable statement. 

Maze test, This test was also modeled on one used by 
Bernardin and Jessor (1957). It consisted of a 2’ X 2’ 
plywood board on which were glued 2” segments of 
copper wire in such manner as to form a maze of 20 
choice points. The correct path ran roughly counter- 
clockwise around the board. The subject was blind- 
folded, and instructed that he would have 15 minutes 
to learn the maze and that it was quite possible to do 
so. The experimenter guided the subject’s finger to the 
first choice point to give him some idea about what the 
maze felt like. The subject was further instructed on 
what constituted error (blind alley entrances and 
‘ retracing) and what constituted success (three con- 

secutive perfect trials). Scores were the number of 
trials to criterion, the number of errors from the second 
trial to the last, and the total time spent learning the 
maze. Since the total time scores were skewed because 
of failures, they were cast into seven ordered categories, 
from 7 (representing failure) to 1 (representing success 
in less than 9 minutes). An unexpected change of 
design and scoring method resulted in the elimination 
of the data on 58 subjects, leaving the data on 70 
subjects for analysis.* 

Uncertainty measure. The measure of uncertainty 
was the time in seconds spent by the subject on the first 
trial of the maze test, lack of external structure being 
maximal in this situation. This variable was referred 
to as “first trial time.” Since this measure was derived 
from the Maze test, data are available for only 70 
subjects, 


Pencil-and-Paper Tests 


Psychological Mindedness test. This is the modifica- 
tion in form and scoring made by Bieri et al. (1958) of 
the Kelly (1958) Role Construct Repertory Test. The 
subject must identify a specific person best fitting each 
of seven role descriptions of persons likely to be im- 
portant in his interpersonal world: the seven roles 
described are those of mother, father, brother, best 
friend, current girl friend, ex-girl friend, and the 
subject himself. Each of the 24 items in the test is a 
different combination of three of the above persons, and 
the subject must describe how two of the three persons 
in an item are alike in some important personal charac- 
teristic, and different from the third person in that 
respect. The characteristic and its opposite for each 
item form a bipolar construct. Each construct was 
judged as either representing or not representing 


5 The original design called for the experimenter to 
follow Bernardin and Jessor (1957) in being hostile to 
half of the subjects. The experimenter found it difficult 
to make hostile comments in a standard way, and almost 
impossible to do so with certain subjects. The attempt 
was abandoned after over 50 subjects had been tested, 
and later analysis demonstrated that the experi- 
menter had been selectively hostile with these sub- 
jects in terms of their EFT scores. 


psychological mindedness, roughly defined as an 
emphasis on the perception of enduring motivational 
and dispositional aspects of interpersonal relations, 
As an example, the construct “impulsive-not impulsive” 
contributed a unit toward the total score; the construct ' 
“likes to dance-does not enjoy dancing" did not. The 
score was the number of constructs judged as repre- 
senting psychological mindedness. The correlation of 
scores of two independent raters (the author and a 
colleague in clinical psychology) for 66 subjects was 
r = .93, and for these subjects, the average of the two 
ratings was used; for the remaining 62 subjects the 
author's score was used. 

Self-concept differentiation. This test consisted of 49 
pairs of opposed trait or descriptive phrases, the mem- 
bers of each pair being separated by a seven point 
scale—e.g., “sociable —:—:—:—:—:—:— aloof.” 
Many of the traits were selected on the basis of their | 
reported correlation with conformity (Barron, 1953; 
Crutchfield, 1955), with field dependence (Crutchfield — 
et al., 1958; Witkin et al., 1954), and with perceptual | 
closure factors (Pemberton, 1952). Many others were 
included simply to lengthen the measure, so that à 
subject would have maximum opportunity to demon 
strate his checking behavior. The chief score derived 
from this test was the information statistic (Ir) (Garnet 
& Hake, 1951) representing a subject’s relative use or 
disuse of all seven scale points available to him. If 
the subject, for example, used each of the seven scale 
points seven times, he was differentiating maximally 
among them. The assumption is that the degree of 
scale checking differentiation is related monotonically 
with degree of self-concept differentiation. In addition, - 
each item pair served as a variable by itself in correla- 
tion with RFT and EFT scores, as a check on previous - 
results concerning the trait correlates of field de 
pendence. 

Independence of Judgment scale. The scale contained 
all of the 22 items reported by Barron (1953) and 10 
of those reported by Crutchfield (1955) as differentiat- 
ing yielders from independents in an experiment on 
conformity like Asch’s. Five of the Crutchfield items 
were reversed or reworded slightly. These items were 
embedded in a 62-item scale constructed with a view to 
controlling the operation of an acquiescent set. The 
score was the number of items answered in the inde- 
pendent direction. In addition, phi correlations were 
computed between dichotomized RFT and EFT 
distributions and each item in the scale. The sample 
percentage of true responses for the 62-item scale was 
49%. 


Collection and Analysis of Data 


All subjects participated for three experimental 
hours. The first was a group session, including eight 
subjects on the average, during which they were given 
the Thurstone EFT, the Psychological Mindedness 
test, the Self-Concept Differentiation test, and the 
Independence of Judgment scale. The Block Puzzle and 
Maze tests were individually administered in the second 
hour, and the Witkin EFT and RFT in the third. 
Scores for all variables were intercorrelated by the 
local high speed computer (Iliac). The WEFT and 
RFT score distributions were also broken into high 
middle, and low groups of 20 each. The eta’s were com- - 


puted from these three arrays and compared with the 
corresponding Pearson 7's for the joint distributions of 
every variable with each of these two measures of field 
dependence. In no comparison was there any significant 
difference between the two coefficients. As noted, 
correlations involving Maze test data are based on 
70 subjects; correlations involving ability scores are 
"based on 100 subjects; and correlations between maze 
measures and ability measures are based on an N of 
55. Inspection of the means and variances of the sub- 
sample variables revealed no evidence that the sub- 
samples were unrepresentative of the full sample. All 
significance tests are two-tailed. 


RESULTS 


Table 1 provides the relevant information 
regarding the intercorrelations among measures 
of field dependence and ability. As expected, 
the relationships among perceptual tests are 
all highly significant, but as in other replica- 
_ tions employing the RFT and WEFT, the 
size of the relationship is only about .40, 
significantly lower than the .76 originally 
reported by Witkin. Only the WEFT is 
correlated significantly with both ability 
scores, and though its correlation with quanti- 
tative ability is greater than that with lin- 
guistic ability, the difference is not significant. 
& Rod and frame scores, raw and transformed, 

show only very slight, nonsignificant relations 

with ability scores. These data lend general 
: support to the expectations concerning inter- 
criterion and criterion ability relations in 
field dependence. For the RFT scores, the 
mean was 3.0; SD, 2.8. These statistics are 
much the same as those reported by Gross 

(1959) (mean, 2.9; SD, 2.0) for 71 under- 

graduates, but much lower than the figures 


: 
; 


TABLE 1 


INTERRELATIONS AMONG MEASURES OF FIELD 
_ DEPENDENCE AND ABILITY 


Variable WEFT | TEFT Q L 
Rod and Frame test, raw | .42*** | .30*** | —.09 —.19*. 
scores (RFT) 
Rod and Frame test, log | .38*** | .30*** | —.13 —.07 
scores (LRFT) 
Witkin Embedded Fig- distet Coras: bth | teat bag 


ures test (WEFT) 
urstone Embedded 
Figures test (TEFT) 
intitative score (Q) 
> Linguistic score (L) 


7 72:197 


.35e** 


Note.—Coefücients involving aptitude scores are based on N = 
100. For the rest, N = 128. 
* p <.10, two-tailed test. 
** p «.05, two-tailed test. 
*** p <.01, two-tailed test. " 
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TABLE 2 


CORRELATIONS OF Maze TEST MEASURES WITH 
MEASURES or FIELD DEPENDENCE AND ABILITY 


Field Dependence Ability 
igni (N = 70) WW = 58) 
Maze Variable 
RFT | LRFT | WEFT| TEFT| Q | L 
First trial time .48***| .30***| .20* 415 749 |—.15 
Trials —.22* |—.08 —.08 01 .25*| 101 
Errors «05 .09 $2A** |) 125875191520: 442 
"Total time .26** .26** 32429 | SUM LOA LE .02 


* p <.10, two-tailed test. 
** p «.05, two-tailed test. 
*** p <.01, two-tailed test. 


TABLE 3 * 
PARTIAL CORRELATIONS RELEVANT TO THE ANALYSIS 
or FIELD DEPENDENCE IN RELATION TO MAZE 
PERFORMANCE 


Field Dependence 
Maze Variable Maze Variable 


RFT | LRFT | WEFT | TEFT 


First trial AIPORT) 228 10 02 ‘Total time 
time 
Trials —.10 First trial 
time 
"Total time .05 16 «17 .26** | First trial 
time 
Total time 43 9 


Errors 


Note.—Cell entries are partial r's between the variables enter- 
ing from the top and left of the table. The variable entering the cell 
from the right is the one partialed out. N = 70. 

*p «.10. 
“p <.05. 
** p «01. 


reported by Witkin et al. (1954), (mean, 
7.4; SD, 5.5). 

The results concerning the expected relations 
between Block Puzzle test variables and 
field dependence measures are uniformly 
negative. The 16 intercorrelations between 
the four puzzle variables and the four per- 
ceptual measures range from —.06 to .08, 
with a mean of .005. There is thus no evidence 
that field dependent subjects tend more than 
field independent subjects to ask for help, 
moral support, or information, or to make 
more conversational comments in a difficult 
task where such requests are allowed. Inter- 
correlations among puzzle test variables were 
quite low, ranging from .08 to .22 and aver- 
aging .16. 

Table 2 presents the correlations of the 
Maze test variables with scores on field 
dependence and ability, and Table 3 includes a 
set of partial correlations which help to clarify 
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these relationships. The expectation that field 
dependence should be positively related to 
poor performance on the maze is borne out 
(see Table 2) in the significant correlations 
between the perceptual measures and the 
total time to criterion, the measure of success. 
But in the case of the rod and frame scores, 
the relationship with degree of success in 
maze performance is primarily a function of 
the time spent on the first trial; when the 
variance of the first trial times is partialed 
out, the correlations of RFT and LRFT with 
total time collapse, as does the correlation 
between RFT and number of trials (see Table 
3). The WEFT and TEFT correlations with 
total time appear to be less determined by 
scores for first trial time, and more determined 
by error production A conclusion which 
exaggerates small differences is that frame 
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dependence predicts a rather slow exploratory 
attack on the maze problem, whereas design 
dependence predicts slow learning throughout 
the problem. 

Since the poorer maze performance of the 
frame dependent subjects is primarily aci 
counted for by their relatively slow tal 
around the maze on the first trial, as jus’ 
noted, the expectation concerning field depend" 
ence and uncertainty in an unstructured 
situation is met, though it does not hold if 
either version of the EFT is the measure of | 
field dependence. It was possible, post hoc, to 
check this result with some data gathered 
after the study had begun. It was clear, after 
a fifth of the subjects had taken the RFT, 
that they varied considerably in the time they 
spent completing the 16 judgments, The 
remaining 101 subjects were timed on the 


TABLE 4 


CORRELATIONS OF MEASURES OF FIELD DEPENDENCE AND ABILITY WITH PSYCHOLOGICAL MINDEDNESS, 
INDEPENDENCE OF JUDGMENT, AND SELF-CONCEPT DIFFERENTIATION. s 


Field D. idi Abili 
S N P (N bi o) 
Measure 
RFT LRFT WEFT TEFT Q L 
l 
Psychological mindedness —.07 —.10 —.03 —.09 203v a225 
Independence of judgment —.15* —.11 —.12 —.19** .14 238" 3 
Self-concept differentiation —.10 —.06 —.07 —.08 —.13 z405 
p< a0 r 
p< 05, 
TABLE 5 ) 
CORRELATIONS BETWEEN SELF-RATED TRAITS AND MEASURES OF FIELD DEPENDENCE 
(The trait correlating positively with field dependence is stated first) 
Trait RET WEFT 
P. 
16. Like being with others, sociable vs. Prefer solitary activities .25 E! 
17. Patient and reasonable with others vs. Impatient, demanding of others .19 -21 
38. Practical, commonsensical vs. Theoretical, intellectual 19 419 
4. Conventional, proper vs. Unconventional, eccentric 220 45 
40. Dependable, trustworthy vs. Irresponsible, forgetful 16 16 
6. Quick, energetic vs. Slow in speech and movement 220 i 
23. Ambitious vs. > "20 
Hi EH El vs. Moody, temperamental 119 
. Seldom lose poise and composure in a social . Easi - i 
rassed except with Class BERE (GO Scone UE 19 
7. Inclined to blame self when things go wrong vs. Inclined to blame others when j 
things go wrong i 17 
9, Thorough, persevering vs. Undisciplined, distractible in work 116 
44, Analytical, focused on points vs. Global, attend to wholes 16 
13. Accept authority easily vs. Resistant to authority 45 
14. Thoughtful, reflective, pensive vs. Active, matter-of-fact, unphilosophical Er 
12. Friendly, attentive to others vs. Cool, aloof toward others $ 
19. Bold, adventurous vs. Cautious, timid 


5. Considerate, polite vs. Rude, insolent 


Note.—For N = 128, the .01, .05, and .10 levels for r are .228, .174, and -146, respectively, for the two-tailed test. 
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À TABLE 6 
CORRELATIONS BETWEEN ITEMS AND MEASURES OF FIELD DEPENDENCE 


Item 


RFT WEFT 


t 


have been substantially affected by my parents. 


of authority. 


unimportant matters. 
25. I become easily bored and impatient with people, 
slow and uninteresting. 


hoped he might get by with it. 
47. I like to read about science. 
58. I am not sensitive to the decorations in a room. 


8. When one is a subordinate in some activity, one can either quietly accept —.20 
authority, or criticize the authority. While a. good case can be made for 
either approach, I have always been rather strongly and openly critical 


and I find many people =:18 


35. In the important decisions of my life, I have collaborated closely with, and .16 .26 


1:1, 


26. I tend to see the advantages and disadvantages in any course of action to —.16 .20 
a degree where I have a hard time making up my mind, even in relatively 


—.16 


38. I like to get suggestions from others, and frequently ask people for advice. .20 
2. Some of my friends think that my ideas are impractical, if not a bit wild. 
, 29. At times I have been so entertained by the cleverness of a crook that I have 


43. I consider myself sociable and gregarious and have a wide acquaintanceship. .24 


; 24. In my conduct I tend to let custom be my guide and to avoid the uncon- +22 


ventional. 


v 


without suffering great regrets. 


asthma, walking while asleep. 
16. The unfinished a 
the completed and polished. 
than they really do. 


and seldom change it. 


. I could cut my moorings—quit my home, my family, and my friends— 


19. Science should have as much to say about moral values as religion does. 
31. One or more of the following has been true of me: having hay fever, having 


and the imperfect often have greater appeal for me than 


|. 50. I think that, in general, people pretend to care more about one another 


20) 


—.18 
—.18 


—4 
—.16 


- 30. Once my mind is made up on a matter, I usually put aside previous doubts 1S 


task, with the expectation that the more a 
J subject is field dependent, the more he should 
J be disrupted by trying to fit conflicting cues 
into his category of verticality—leading to 
slower as well as poorer judgments. These 
latency scores correlate .16 with the RFT 
scores (not significant) and .31 with their log 
transformation (p < .01); correlations with the 
WEFT and TEFT are —.01 and —.04. Again 
there is the suggestion that frame dependence, 
but not design dependence, is related to the 
subject's performance latencies in unstructured 
situations. z 

The results appearing in Table 4 make it 
clear that there is no support for the expecta- 
.. tions that field dependence is associated with 
little differentiation in the self-concept, little 
‘chological mindedness, and much con- 
y amity. Only the measure of independence 
-of judgment approaches being significantly 
related with any measures of field dependence, 
_ and even this relationship shrinks to insignifi- 
cance when the variance of linguistic aptitude 
Scores is partialed out of it. 


. Note—For N = 128, the .01, .05, and .10 levels for phi are .228, .174, and .146, respectively, for the two-tailed test. 


Table 5 and Table 6 summarize the self- 
reported trait pairs and items which correlate 
at better than the 10% level with RFT and/or 
WEFT scores. (Trait and item correlates with 
the TEFT are omitted solely for economy of 
presentation; they present a very similar 
personality pattern.) To summarize these 
trait and item patterns by caricature, the 
field dependent person rates himself as a kind 
of incarnation of social “desirability, a boy 
scout reciting his oath; while the field inde- 
pendent person rates himself as a touchy, 
brooding, intellectual malcontent, some soli- 
tary mixture of prophet, beatnik, and angry 
young man. There are a couple of incon- 
sistencies—i.e., Item 26, and Trait Pair 
14—but all in all these self-report correlates 
are in good congruence with those described 
by Witkin and his colleagues (1954) and 
reported by Crutchfield et al. (1958). 


Discussion 


Perhaps the clearest results of this paper 
are those on the relationships among criterion 
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tests of field dependence and between such 
tests and ability measures. Both the RFT and 
EFT distract the subject with misleading 
visual cues, and this similarity is attested by 
the significant, although moderate variance the 
tests share. They have a marked degree of 
statistical alienation, however, and a good 
deal of it is probably a function of obvious 
differences in item content and administration. 
The EFT contains items very similar to those 
found in aptitude tests and, like such tests, it 
is timed: hence there are abundant cues for 
the arousal of concern over evaluation and 
achievement motivation. The RFT has a 
minimum of such cues: its items (trials) are 
not timed and are not overlapping with items 
in aptitude tests. These considerations may 
account for the fact that the EFT is consist- 
ently more strongly related to measures of 
aptitude and of learning than is the RFT. 
It would seem, therefore, that the RFT and 
other tests involving judgment of verticality 
are “purer,” less multifactorial measures of 
field dependence than any EFT. 

The question of major interest, however, is 
whether field dependence has anything to do 
with dependence in personality and behavior. 
A conception foreshadowed in the introductory 
discussion of the researches of Linton (1955), 
Gross (1959), and Block (1957) may serve as 
a guide in the evaluation of these results: 
that field dependence is the tendency to react 
with affective and intellectual disruption when 
the subject's situation is marked by unusual- 
ness, incongruity, confusion, or lack of struc- 
ture in general. (All these terms are descriptive 
of the RFT situation.) Such disruption lasts 
until some system or order is imposed upon 
the situation, either by the subject or by 
some external agency. A field dependent 
subject is relatively unable to impose such 
structure autonomously: hence if an order is 
introduced, he accepts it, and we can observe 
dependent behavior; if an order is not intro- 
duced, he remains disrupted (a condition 
that could best be observed, perhaps, with 
psychophysiological measures). 

Consider the Maze test situation. For any 
blindfolded subject it is highly unstructured, 
and remains so at least throughout the first 
trial until he can develop some picture of the 
maze pattern. It is on just this first trial 
that the frame dependent subject takes so 


much time, hesitating to commit himself, 
and making slow, painful progress. After this 
trial, he performs normally. | 
Now consider the Block Puzzle test, meas- | 
ures of which were uncorrelated with all 
other variables. In terms of the presen 
conception, the task proves nothing. A subject l 
could not allay his presumed uncertainty and | 
confusion by depending on an imposed struc- | 
ture or order, because none was ever offered, 
Task instructions left it very much open | 
whether the subject should ask for help, or | 
even whether the puzzle could or should be | 
solved. The assessment of degrees of disruption | 
is therefore crucial It seems, however be- | 
latedly, that the scores representing requests | 
for help and for support were not satisfactory | 
in this respect, since to ask how oneself is | 
doing, or even to ask for direct help, may be as 
much a sign of an initiating, assertive attempt | 
to solve a hard problem as it is a sign of 
dependence induced by uncertainty. The 
experimenter did not record his clinical 
impression of a subject's apparent ease and | 
effectiveness vs. nervousness and tenseness, || 
there having been no way to assess the reli- | 
ability of such impressions, It is just such & 1 
measure, though, which might usefully have f 


tested the present conception. 
We have seen that the items and adjectives Í 
correlating with the RFT and WEFT tend to” 
fall on a dimension of social desirability. E 
Conventional values are certainly a prominent | 
external social structure, and the field depend- | 
ent subject may well be sensitive to it | 
This sensitivity to what is “out there" has | 
been demonstrated before, as when frame | 
dependent subjects show a high accuracy | 
(relative to their independent counterparts) in 
the recognition of tachistoscopically presented 
photographs of their peers (Crutchfield et al, — 
1958). The low order of relationships reported 
in this study, however, suggests that, in 
general conventionality of values probably - 
relates to field dependence only insofar as it i$ i 
correlated with “other-direction.” Some field 
independent students cleave strongly 10) 
well internalized mores; some field depend 
students claim academically popular, but” 
generally unconventional values which they ' 
recognize as locally fashionable. These sources |: 
of “error” attenuate correlations between 
field dependence and conventionality. ji 


———— 


FIELD DEPENDENCE, ABILITY, AND PERSONALITY 35 


The failure of the RFT and EFT to relate 
with the tests of psychological mindedness and 
self-concept differentiation is difficult to 
account for, since almost any conception would 
assume field dependence to be associated with 
lack of introspection and of intraception. It is 
quite possible, though, that any such con- 
ception is simply wrong. Further research 
will tell, but it is necessary to note the faults 
in the two tests in question. The score for 
self-concept differentiation is a measure of 


` response information unrefined by any sub- 


tractive term representing variability of 
response to a single stimulus. The score for 
this test thus may represent expansiveness in 
scale checking and nothing more. In the test 
of psychological mindedness, items must 
suffer from being scored in simple dichotomy: 
it is reasonable to suppose that there are 
more than two reliable degrees of psychological 
mindedness, though it might require consider- 
able effort to establish them. 

The search for correlates of field dependence 
within the large and poorly mapped domain 
of personality is thus an enterprise of uncertain 
outcome, The ineffective performance of 
frame dependent subjects in situations de- 
scribed as lacking structure, predictability, 
instructiveness, and so forth, and the extreme 
frame dependence of children and brain 
damaged patients, suggest that we might 
with more profit and economy change the 
focus of investigation from personality to 
intellectual function, This approach would 
see frame dependence primarily as a kind of 
intellectual deficit, and secondarily as a 
correlate of personality attributes. The nature 
of the intellectual processes in which deficit 
Occurs is at present almost completely a matter 
of speculation, but it seems a possibility that 
they are of the kind central to some theories 
of intelligence now current: the decentering 
ability conceptualized in the developmental 
theories of Werner (1957) and Piaget (1950), 
the concrete-abstract dimension of Gold- 
Stein's (1943) theory, or the systems of trans- 
formations proposed by Pribram (1960), All 
of these concepts converge on a notion of higher 
brain function marked by a high degree of 
autonomy, of internalized frames of reference 
relatively independent of outside stimulus 
Control Whatever the nature of these func- 
tions, they are not measured or are only 


slightly measured by common ability tests, 
including the ones used in this study. 

The relative deficit of such higher autono- 
mous processes in frame dependent persons 
may indeed have its source partly in develop- 
mental failures of an experiential nature. 
Parental overprotection and parental incon- 
sistency, for example, may result in learned 
tendencies to avoid important areas of stimu- 
lation, hence to inhibit the growth to autonomy 
of important schemata, leaving them relatively 
dependent on immediate sensory control, 
For a beginning, though, it would seem most 
efficient to examine such matters as the 
trainability of adults on rod and frame 
performance under various conditions, the 
effect of certain drugs and of sensory depriva- 
tion on frame dependence, electrophysiological 
correlates of differences on the RFT, and so on. 


SUMMARY 


The present study examined the construct 
“field dependence” in terms of certain expecta- 
tions derived from the literature, especially 
those concerning intercriteria and criterion- 
ability relationships, and the relations between 
various tests of field dependence and variously 
measured personality traits. 

The results were as follows: the criterion 
tests of field dependence were related, but 
must be considered far from equivalent; 
EFT performance was related with ability 
scores, but RFT performance was not; in other 
performance tests, measures of behavioral 
dependence were unrelated to either criterion 
test, but measures of uncertainty in unstruc- 
tured situations had significant relations with 
RFT performance; objective test measures 
of self-concept differentiation, psychological 
mindedness, and independence of judgment 
were unrelated with any measure of field 
dependence; self-reported traits and attitudes 
related much as they had previously with 
measures of field dependence, the field de- 
pendent person giving a picture of himself 
characterized by high social desirability. 

The results were discussed in terms of a 
conception of field dependence that considers 
the primary reference of the construct to be a 
tendency to react with affective disruption 

5 For an example of the conjunction of high IQ with 
low “abstract” ability, see the report by Nichols and 
Hunt (1940) on a case of frontal lobe damage. 
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and intellectual deficit to an external situation 
that lacks clear structure and instruction. 
Field dependence, in the present view, is a 
more direct index of internalization and 
autonomy of complex cognitive processes than 
of personality traits and motives. 
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systematic points of view on the nature 

of personality, the concept of the self has 
often played an important central role. Once 
the self-concept is formed and stabilized, it is 
said to influence later behavior strongly. 
Special attention has been directed towards 
the influence that self-attitudes have on 
attitudes towards others. Adler (1926), 
Fromm (1947), Horney (1939), and Murphy 
(1947) have all suggested that the self is an 
anchorage point, influencing perceptions of and 
attitudes towards others. This notion of the 
self as a reference point affecting interpersonal 
relations is most evident in Sullivanian and 
Rogerian theory. Sullivan comments that 
“one can find in others only that which is in 
the self...if the self dynamism... be 
chiefly derogatory (then) it will facilitate 
hostile, disparaging appraisals of other people” 
(1947, p. 10). Further: “When one respects 
oneself . . . so one can respect others... as you 
judge yourself, so shall you judge others” 
(1940, p. 87). Similarly, Rogers (1951) asserts 
that “when an individual . . . accepts (himself) 
...then he is necessarily... more under- 
Standing...and accepting of others as 
separate individuals” (p. 520). 

Much research has been conducted to 
determine whether such a theoretically 
proposed relationship between self-acceptance 
and acceptance of others can be demonstrated 
empirically. Berger (1952), Omwake (1954), 
Phillips (1951), Sheerer (1949), and Stock 
(1949) have all obtained significant results on 
samples ranging from patients in treatment 
to prisoners, high school students, college 
Students, and adult education enrollees. 


E many theorists who have proposed 


1 An abridged version of this paper was presented at 
the 1960 Western Psychological Association Conven- 
tion. The study was conducted as a doctoral disserta- 
tion for Stanford University. The author is indebted 
to C. L. Winder, Quinn McNemar, and Lucy Rau for 
their rigorous guidance and help. 

2 Now at Whitman College. 
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Correlations ranged from .36 to .74. Zelen 
(1954), testing sixth grade children, found 
that self-acceptance was significantly corre- 
lated with acceptance by peers, but not with 
acceptance of others. This contrary finding 
might be explainable on the basis of age 
differences between the subjects of Zelen's 
study and other researches. Inspection indi- 
cates that, in general, the lower the age group, 
the lower the correlations found. ; 

Despite these numerous studies, little 
attempt has been made to specify the variables 
leading to such correlations. Levy (1956) 
indicates that self-acceptance can be demon- 
strated to influence even acceptance of home 
town. He therefore questions the adequacy of 
the experimental design and proferred inter- 
pretations of previous researches. He further 
appears to be asking for two departures from 
previous studies: Offer the subjects a series of 
specific objects to describe rather than the 
usual, nondefinitive "average-other person,” 
and offer a theoretical formulation that will 
predict the nature of these new correlations. 

The present work extends previous studies 
in the directions suggested by Levy (1956). 
Thus, subjects were asked to describe them- 
selves and two specific people, their fathers 
and their teachers. Predictions regarding the 
nature of the correlation between self- and 
other-appraisal and definitive statements of 
the variables influencing such relationship were 
derived from a learning theory analysis of the 
self-concept. 

Following a learning theory analysis, 
several stimulus objects were defined along 
with certain responses. The self, the father, 
and the teacher were considered stimulus 
objects. Test answers were considered re- 
sponses elicited by these stimulus objects, In 
research of this type, the responses are desig- 
nated as reflecting acceptance or nonaccept- 
ance. Studies of stimulus response associations 
indicate that responses originally elicited by a 
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specific stimulus should also tend to be elicited 
by other stimuli that are similar to the original. 
The greater the similarity between the new 
stimuli and the original stimulus, the greater is 
the tendency towards generalization. It seemed 
logical to consider the correlation between 
self-acceptance and acceptance of others as an 
example of stimulus generalization, Thus, 
self-acceptance responses originally elicited by 
the self-as-object are generalized to others- 
as-objects. It was therefore predicted that 
there would be a significant positive correlation 
between self-acceptance and acceptance of 
others (Prediction 1). Furthermore, since 
degree of generalization is a function of degree 
of similarity, it was predicted that the greater 
the perceived similarity between the self and 
the other, the greater would be the generaliza- 
tion of self-responses towards the other 
stimulus object (Prediction 2). 

Two assumptions were implied by these 
predictions. First, that the self is the central 
or reference stimulus, not the other stimulus 
object (i.e. the father or the teacher). It was 
therefore assumed that acceptance responses 
are first associated with the self and then 
are generalized towards others. The second 
assumption was that the self-concept for an 
adolescent subject is stable and distinct 
enough to be considered a reference stimulus. 
Tf the self-concept is not a distinct stimulus, 
then it would be difficult for generalization 
to occur. If the original stimulus’ charac- 
teristics are vague, changing, or varying, then 
comparison with the second stimulus may be 
difficult or impossible. 

The third and fourth predictions were 
derived from findings which indicate that 
anxiety increases the degree of stimulus 
generalization (Hilgard, 1951; Mednick, 1957; 
Rosenbaum, 1953; Wenar, 1954). Prediction 
3 was as follows: Self-acceptance generalizes 
towards acceptance of others as a function of 
degree of self-acceptance, the greater the 


3 The restriction of the term “stimulus” to mean 
only “external energy changes impinging on a receptor” 
is not implied here. It is felt that this strict definition 
cannot explain adequately the influence of such cogni- 
tive and affective variables as thoughts, memories, 
attitudes, or the self-concept. Consequently, stimulus 
is considered here as conceptually similar to Dollard 
and Miller’s (1950) terms “cue-producing responses” 
(p. 98) and “verbal mediation cues” (pp. 101, 310). 


self-dissatisfaction (lack of acceptance), the 
greater the generalization. Similarly, Predic- 
tion 4 was stated in this way: Self-acceptance 
generalizes towards acceptance of others as a 
function of degree of involvement with the 
other person; the greater the involvement, the 
greater the generalization, Subjects have a need 
to relate to and be accepted by others. This 
need begins during the long dependency 
period of infancy and childhood and is mani- 
fested in adulthood as need Affiliation (Mur- 
ray 1938). Further, Zelen’s (1954) and 


- Smith's (1958) data suggest that a person 


who does not accept himself is also rejected 
by others. Therefore, the greater a subject’s 
self-dissatisfaction, the more likely it is that 
his Affiliation need will be frustrated. This 
actual or expected. frustration in turn may be 
hypothesized to lead to an increase in anxiety. 
And anxiety increases level of stimulus general- 
ization. Hence the prediction that gener- 
alization increases with increased self- 
dissatisfaction. 

Involvement was also hypothesized as a 
significant variable because subject’s need to 
relate to and be accepted by a person increases 
with greater involvement with him. With this 
increase in strength of need, frustration of the 
need should lead to greater anxiety. Thus, it 
was predicted that there would be greater 
generalization of self-responses towards fathers 
than teachers, assuming that the subjects 
were more involved with parents than with 
teachers. 


METHOD 
Subjects 


The subjects were 82 male high school seniors, all 
of whom were in contact with a male teacher. They 
were selected from the senior class on the basis of 
availability of time to participate. 


Instrument 


Four Q sort decks were developed for use. Each 
deck was composed of 20 adjectives selected from the 
300-item Adjective Check List (Gough, 1955a) in an 
earlier standardization study. Criteria for selection 
were as follows: 

1. All adjectives were equated for social desir- 
ability. Only those adjectives checked by the stand- 
ardization subjects as highly desirable or desirable 
and which were judged as favorable in Gough’s 
(1955b, pp. 33-34) own study of his Check List 
were used. 
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2. One deck, called the Common Adjective Sort 
(CAS), included only those adjectives checked by 
standardization subjects as applicable for describing 
themselves, their fathers, and their male teachers. 
This deck was developed for use as an independent 
measure of perceived similarity between these 
stimulus objects. 

3. The remaining three decks, called the Unique 
Adjective Self-Sort (UASS), the Unique Adjective 
Father Sort (UAFS), and the Unique Adjective 
Teacher Sort (UATS), included only those adjec- 
tives checked as relevant primarily for description 
of the self (UASS), the father (UAFS), or the male 
teacher (UATS). The UASS was used by each 
experimental subject to describe only his perceived 
self and his ideal self. The UAFS was used for similar 
descriptions of the subject’s father, and the UATS 
was used only for description of the subject’s male 
teacher. 

The purposes of establishing these criteria were to 
control for social desirability influences while still using 
adjectives appropriate for description of the named 
social objects. Kenny (1956; personal communication, 
1959) and Kogan and Quinn (1957) report that social 
desirability factors tend distinctly to influence Q sort 
behavior. Among various approaches offered by Kenny 
(personal communication, 1959) as a means of holding 
constant the effects of social desirability, the method 
of selecting only desirable traits was used. This proce- 
dure is similar to the forced-choice technique used in 
some personality and interest inventories where the 
subject is forced to choose from items which are equally 
desirable. It eliminates the spurious correlation between 
perceived and ideal sorts which occur because of a 
specific response set, i.e., that of placing all desirable 
traits in the “most like (me)” category and all the 
undesirable traits in the “least like (me)” pile. 


Measurement of Acceptance and Similarity 


Each subject’s perceived self-sort was correlated 
with his ideal self-sort, giving a Self-Acceptance (SA) 
score for every subject. Only UASS decks were used. 
Similarly, each subject’s perceived father sort was 
correlated with his ideal father sort, giving a Father 
Acceptance (FA) score for every subject. Only UAFS 
decks were used. Finally, each subject’s perceived 
teacher sort was correlated with his ideal teacher sort, 
giving a Teacher Acceptance (TA) score for every 
subject. Only UATS decks were used. 

„Each subject’s perceived self-sort was correlated 
with his perceived father sort, giving a Self-Father 
Similarity (S-F sim.) score for every subject. Only 
Sorts from the CAS deck were used. In the same 
fashion, each subject’s perceived self-sort was corre- 
lated with his perceived teacher sort, giving a Self- 
Teacher Similarity (S-T sim.) score for every subject. 
Only sorts from the CAS deck were used. 


Procedure 


Experimental subjects, in groups of 10, were asked 
to describe several designated stimulus objects using 
the decks provided. Sorting was on a five-point forced 


rectangular distribution. Subjects sorted in the follow- 
ing order: description of a male teacher with UATS, 
then with CAS; description of father with UAFS, 
then with CAS; description of self with UASS, then 
with CAS; description of ideal teacher with UATS, 
ideal father with UAFS, and, finally, ideal self with 
UASS. All decks were collected immediately after each 
sorting was completed and before directions for the 
next sorting were given. 


RESULTS 


Prediction 1. Subjects should generalize 
self-acceptance responses towards fathers and 
male teachers. Pearson product-moment corre- 
lations between the SA scores and FA scores 
were .32 (p < .005) and .25 (p < .02) for the 
SA scores and TA scores. ' 

Prediction 2. Degree of perceived similarity 
between the self and the other stimulus affects - 
the size of the generalization effect. This 
prediction was restated as: As the perceived 
similarity between self and other increases, 
the discrepancy between SA scores and Other- 
Acceptance scores (FA or TA scores) decreases. 
Discrepancy scores were obtained by taking 
the absolute difference between each subject's 
SA score and his Other-Acceptance score 
(FA or TA). These discrepancy scores were 
then correlated with perceived similarity 
scores (S-F sim. or S-T sim.). According to 
the restated prediction, a significant negative 
correlation was expected. 

The correlation based on the SA-FA discrep- 
ancy scores and the S-F sim. scores was —.24 
(p = .05). The correlation based on the SA-TA 
discrepancy scores and the S-T sim. scores was 
—.34 (p = .002). 

Prediction 3. The greater the self-dissatis- 
faction, the greater the generalization, The 
variance of the FA scores for the 15 subjects 
with the lowest SA scores was compared with 
the variance for the 15 subjects with the 
highest scores. Similarly, the variance of the 
TA scores for the 15 subjects with the lowest 
SA scores was compared with the variance for 
the 15 subjects with the highest SA scores, 
The F ratios, thus, computed were not signifi- 
cant. 

Prediction 4. Degree of involvement influ- 
ences generalization; the greater the involve- 
ment, the greater the generalization. It was 
assumed that all subjects were more involved 
with their fathers than with their teachers. 
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It was therefore expected that the correlation 
between the SA scores and the FA scores would 
be greater than the correlation between the 
SA scores and the TA scores. A / test of the 
differences between these two correlations 
was not statistically significant. 


Discussion 
General Implications for Research 


The first implication involves experimental 
design. There is little doubt now that self- 
acceptance is correlated with acceptance of 
others. Future research must direct itself 
towards contributing more detailed informa- 
tion concerning such a relationship. For this 
reason, it is believed that the most meaningful 
experimental approach lies in greater specific- 
ity. This study has shown that specifically 
designated stimulus objects can be examined 
instead of the formerly used, vaguely defined, 
"average-other person." This new technique 
does away with the ambiguity of interpreting 
findings involving the average-other. Further- 
more, it allows a more straightforward conclu- 
Sion: that self-acceptance influences acceptance 
of others—specifically, of fathers and even 
teachers. 

A second implication applied to theory. 
Given that the relationship exists, the full 
value of such a finding is attained only as the 
variables influencing the relationship are better 
understood. However, as Levy (1956) pointed 
out, theoretical formulations, thus far, have 
not provided hypotheses as to what variables 
are important. In this study, a learning 
theory orientation was found to be a fruitful 
Source for deriving such hypotheses. It was 
shown that the self could be assumed to be a 
stimulus object and that self-acceptance 
could be viewed as responses elicited by the 
self, Further, it was found quite feasible and 
informative to apply knowledge about stimulus 
generalization to study of self-acceptance and 
acceptance of others. 

A third implication concerns the inter- 
pretation of the finding that one variable, 
perceived similarity, significantly influences 
the generalization of self-acceptance responses, 
The interpretation maintained in this report 
is that perceived similarity is an independent 
variable influencing the dependent variable, 


generalization. Another way of interpreting 
the data is to consider acceptance as one 
factor used by a person when he rates another 
on similarity. For example, a subject may 
judge his father as similar to himself on the 
basis of whether he accepts his father as much 
as he (the subject) accepts himself. The 
reasoning runs like this: I accept myself; I 
also accept my father; therefore, my father 
and I must be similar. Q sort techniques tend 
to minimize this type of problem inasmuch as 
the subject is asked simply to describe several 
people and is unaware of the purpose of such 
description. However, research is needed to 
clarify the issue further. 

Finally, several further predictions may be 
derived from the results of this study. As a 
first example, it is feasible to attempt a 
prediction of the degree of generalization of 
self-acceptance to still other people. Thus, it 
might be predicted that a wife would express 
dissatisfaction with her husband or children if 
she were already dissatisfied with herself. 
The degree of this generalization of self- 
dissatisfaction would depend upon the degree 
to which the wife perceived her family as 
similar to herself. This type of behavior 
might be called “scapegoating” in social 
psychology, “projection” in abnormal psy- 
chology, or, as suggested here, generalization 
of self-attitudes toward other similar social 
objects. Such generalization is quite possibly 
the explanation for the Sears, Maccoby, and 
Levin (1957) finding of a significant correlation 
between a mother's "acceptance for the self 
(and her acceptance of) the husband, the 
current situation, the pregnancy, the infant 
and the child" (p. 59). 

Another prediction involves the possibility 
of adjustive behavior being facilitated by 
generalization. It is suggested that the variable 
of perceived similarity be explicitly added to 
Rogerian and Sullivanian expositions of the 
way acceptance of self facilitates realistic 
acceptance of others. A self-accepting person 
is presumably better able to see others as 
separate social objects from himself (Rogers, 
1951, p. 520); he may then also be better able 
to determine realistically how similar the self 
is to others; the greater the similarity, then, 
the greater the acceptance of the other person. 
Thus, the self-concept can serve as a mediating 
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stimulus, aiding in eliciting adjustive responses. 
An example of a prediction from this reasoning 
is that empathic behavior is facilitated where 
there is some perception of similarity of the 
other to the self. Nimitz (1956) reflected this 
idea in the comment, “An individual may 
exaggerate similarities between himself and 
others... (and this) need be in no way 
pathological (Le., it can occur without de- 
fensive needs being served)" (p. 279). A 
therapist may therefore more readily 
empathize and accept his client by such a 
process. 


SuMMARY 


This study was an attempt to provide a 
theoretical orientation from which the vari- 
ables influencing the relationship between 
self-acceptance and acceptance of others 
could be derived. A learning theory approach 
was used, whereby the self was considered as 
a stimulus object and self-acceptance state- 
ments were considered responses. The general 
hypothesis was that these responses asso- 
ciated with the self would be subject to the 
effects of stimulus generalization. 

Eighty-two male high school seniors were 
asked to describe themselves and two other 
designated stimulus objects, their fathers and 
their male teachers. Special Q sort decks, de- 
veloped in a standardization study, were used 
for these descriptions. It was predicted that 
self-acceptance responses would generalize to- 
wards the father and teacher as a function 
of the following variables: degree of perceived 
similarity between the self and the other stim- 
ulus object, degree of self-dissatisfaction, and 
degree of involvement with the other stimulus 
object. Results indicate that self-acceptance is 
significantly correlated with acceptance of fa- 
ther and with acceptance of teacher and per- 
ceived similarity is a significant variable influ- 
encing this generalization of self-acceptance. 
There is no support for the proposed influence 
of degree of self-dissatisfaction or of involve- 
ment for adolescent subjects. 

The contributions of this study towards 
better understanding, experimental design, 
and prediction in studies of this type were dis- 
discussed. 
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behavior may be conditioned by arrang- 

ing a contingency between a verbal 
response, which is a unit of verbal behavior, 
and a “generalized conditioned reinforcer.” 
The verbal responses most frequently con- 
ditioned have been uttered specific words, 
numbers, or categories of these such as plural 
nouns (Greenspoon, 1955), and three-digit 
numbers or verbs (Sarason, 1957). In a few 
studies, the verbalizations to be reinforced 
have been “statements describing or evaluating 
the state of the patient by himself,” or state- 
ments of opinion used as a measure of a par- 
ticular, relatively simple, attitude (Ekman, 
1958). 

This study concerns itself with the question 
of whether the responses of subjects to a 
complex, multifactorial attitude scale (the F 
Scale) can be altered by means of verbal 
reinforcement and whether such change, if 
obtained, generalizes to a related attitude 
scale (the E Scale). A further point of interest 
is whether generalization takes place if the 
source of original reinforcement is no longer 
present. Since the prestige of the experimenter 
is an important variable in the effectiveness of 
conditioning (Verplanck, 1955), one may 
speculate that his presence or absence should 
be important in obtaining or failing to obtain 
generalization. 

The study has practical relevance to the 
general area of behavior change. Psycho- 
therapists may transmit their own values, 
beliefs, and theoretical orientations to the 
patient by means of verbal and gestural 
conditioning. For both theoretical and prac- 


C= (1953) has suggested that verbal 


1 This paper is based on a dissertation submitted to 
the Graduate School of the University of Pennsylvania 
in partial fulfillment of the requirement for the degree 
of Doctor of Philosophy. 

? The writer is deeply indebted to Seymour Feshbach 
under whose supervision the study was conducted and 
to Francis W. Irwin whose suggestions and criticisms 
Proved to be an invaluable aid. Special thanks go to 
Harold A. Rashkis of Eastern Pennsylvania Psychiatric 
Institute who made the time available to do this study 
while the author was in his employ. 
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tical reasons, it therefore seems important to 
know the practicability of verbally con- 
ditioning complex attitudes, the nature and 
extent of the generalization of such condi- 
tioning, and whether these phenomena 
represent the effects of simple automatic rein- 
forcement or whether they are more complex 
examples of social influence processes. 


METHOD 


Materials 


The California F Scale was chosen as a complex, 
multifactorial attitude scale individual items of which 
bear only an indirect relationship to each other. The 
30-item California scale was combined with the 30- 
item reversed Christie scale (1958) to exclude the 
influence of acquiescence set. Combining these two 
sources, a 60-item scale was constructed with items 
arranged in random order. The California E Scale of 
30 items, which according to Adorno, Frenkel-Bruns- 
wik, Levinson, and Sanford (1950) is ideologically and 
historically related to the F Scale, was chosen as the 
measure of generalization, 


Subjects 


Twenty-four pairs of female subjects (48 subjects 
in all) were chosen from a sample of 250 subjects who 
completed the combined F Scale and the E Scale, 
and who were taking the introductory psychology 
course at the University of Pennsylvania. To be chosen 
a subject had to have at least 30, but no more than 45 
F Scale items answered in the prodemocratic direction, 
and between 12 and 24 in the prodemocratic direction 
on the E Scale. This insured that all subjects were 
already relatively prodemocratic. Subjects were 
matched on the basis of their scores on both scales 
to form the 24 pairs. 


Experimenter 

The experimenter was a 28-year-old male who was 
presented to the subjects as an experimental psycholo- 
gist from a research institute conducting some studies 
at the university (which was in fact true). 


Procedure 


'The experiment was conducted 3 months after the 
initial F and E Scales were filled out. Half of the 
subjects (1 from each of the 24 pairs) were randomly 
assigned to the experimental procedure and the other 
half (their matched pairs) to the control procedure. 
After the learning session, one-half of the 24 pairs 
responded to the E Scale (generalization procedure) 
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TABLE 1 


TOTAL AND MEAN NUMBER OF PRODEMOCRATIC 
RESPONSES FOR EACH SET OF 
TRIALS 


LEARNING 
1st 2nd 3rd All 60 
20 Trials | 20 Trials 20 Trials |Trials 
Total! M |Total| M |Total| af |Total 
Experimental 306 |12.75| 318 | 13.25| 379 | 15.79| 1003 
(N = 24) 
Control 304 | 12.66 299 | 12.45} 293 | 12.20] 896 
(N = 24) 
TABLE 2 


i TESTS FoR MATCHED PARS FOR EXPERIMENTAL 
VERSUS CONTROL GROUP OVERALL AND FOR EACH 


OF THREE Pairs or TWENTY TRIALS 
Dana Varance| — ¢ » 
Overall (60 trials) 1.49 .33 |2.81 <.01 
1st 20 trials .08 ES .016 | >.05 
2nd 20 trials 19 1 | 1.54 | >.05 
3rd 20 trials 3.6 «6 6.00 | <.01 


with the experimenter in the room and one-half with 
him absent. 

In the experimental group the experimenter read 
all 60 items of the combined F Scale aloud and asked 
the subjects to state whether she "agreed" or **dis- 
agreed" with each item. Every time the subjects gave 
an answer in the prodemocratic direction the experi- 
menter said either "good" or "right" (*good" and 
"right" were alternated in an irregular order but with 
about equal frequency). For each control subject, the 
experimenter merely read the items aloud and re- 
corded the subject’s answers without comment. 

For one-half of the pairs, after completing the F 
Scale procedure, the experimenter went on to read 
aloud all 30 items of the E Scale and recorded the 
subject’s. "agree" or "disagree" answers without 
Comment (experimenter present generalization condi- 
tion). For the other half of the subjects, the experi- 
menter excused himself from the room after the condi- 
tioning period and had the subjects fill out the E 
Scale by themselves (experimenter absent generalization 
condition). This procedure was identical for the control 
group. 

The experimental paradigm may be characterized 
thus: 


Generalization 

12 subjects experimenter 
Present 

12 subjects experimenter 
absent 

12 subjects experimenter 
Present 

12 subjects experimenter 
absent 


Experi- 24 subjects 
mental 
Matched 


Control 24 subjects) 


Finally, all experimental subjects were questioned by 
the experimenter (experimenter absent subjects were 
spoken to in the hall after leaving) as to their degree of 
awareness of the experimental contingency between 
prodemocratic responses and “good” and “right.” 


RESULTS 


The results of the 60-trial 
period were divided for purposes of analysis 
into three blocks of 20 trials for both the 
experimental and control subjects. The data 
may be treated by / tests. Although the data 
lend themselves to a trend analysis based on 
analysis of variance, such a procedure does 
not take into consideration the matching of 
subjects (but in this case it yields results 
identical to those found by ¢ tests). 

Table 1 shows the total and mean number 
of prodemocratic responses for the experi- 
mental and control groups. 

Table 2 deals with the question of the signifi- 
cance of the obtained differences between 
the experimental and control groups. It shows 
that learning, that is, a shift in a more demo- 
cratic direction, took place in the experimental 
group. 

The ¢ tests clearly indicate that there is an 
overall learning effect, although the difference 
is statistically significant only for the last block 


conditioning 


EXPERIMENTAL GROUP mmm. 
CONTROL GROUP 


RESPONSES 


TRIALS 


FOR EACH SET OF FIVE 


MEAN NUMBER OF PRO DEMOCRATIC 


0 5 l0 15 20 25 30 35 40 4S SO 55 60 


NUMBER OF TRIALS 
(number of items on F scale) 


Fic. 1. Learning curves by groupings of five trials. 
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TABLE 3 
E Scare Scores IN TERMS or MEANS 
(Mean Number of Anti-Ethnocentric Responses) 


Means pes 
1st 10 trials 7.9 7.1 
2nd 10 trials 7.0 6.1 
3rd 10 trials 6.8 6.1 
Total 7.2 6.4 


Control Present| Control Absent 


1st 10 trials 5.5 6.1 

2nd 10 trials 6.1 6.1 

3rd 10 trials 6.1 6.4 
Total 5: 6.2 

TABLE 4 
1 TESTS FOR GENERALIZATION COMPARISONS 
Mean |Var-| , 
Difference] ance ? 

1, Experimenter present] |... 4.6 — | 1.23 |3.73|<.01 
vs. Control present 

2. Experimenter absent! bwah .66 |1 -66|>.05 
vs. Control absent Pi 

3. Experimenter absent| 1st 10 1 
vs. Control absent j trials Mi — Ma") .41 |2.43|<.05 

4. Experimenter present | 1st 10 IS .201/3.92| «.01 
vs. Experimenter absentj trials |Mi — M24) 


? Based on nonmatched subjects. 


of 20 trials. Figure 1 illustrates the course of 
learning as shown by blocks of 5 trials (learning 
appears to occur rapidly somewhere after the 
30th trial). 

In regard to generalization, Table 3 pre- 
sents the data for the Generalization Condi- 
tions, showing both the total means and means 
obtained by dividing the 30 trials of the E 
Scale into three blocks of 10 trials. Table 4 
represents the statistical analysis of these 
data. 

The first / test indicates that the subjects do 
Show an overall generalization effect in terms 
of more prodemocratic responses when the 
experimenter remains in the room. The second 
t test demonstrates that if the experimenter 
leaves the room it is mot possible to show 
Significant overall generalization. The third 
i test suggests that even though no overall 
generalization effect can be shown when the 


experimenter leaves the room, there is some 
generalization over the first of 10 trials even 
under this condition. Such generalization is 
greater when the experimenter is in the room 
than when he is absent. Figure 2 shows the 
course of the generalization by groups of 5 
trials. 

Certain other data warrant brief mention. 
'The initial correlation before conditioning 
between the combined F Scale and the E Scale 
for the 48 subjects was quite low (r = +.23, 
not significant). It is noteworthy that general- 
ization from the F to the E Scale did take place 
in spite of the low common variance between 
the two scales. However, z between the F 
Scale and the E Scale for the 12 subjects of 
the Experimenter Present condition, after 
conditioning, increased to +.60 (p < .05). 
Despite the originally low pre-experimental 
correlation (+.22 for these 12 subjects) these 
subjects not only gave more democratic 
responses on the E scale, after repeated rein- 
forcement on the F Scale for prodemocratic 
responses, but the shared variance of the two 
scales markedly increased. A conceptual 
change may have taken place in which the 
subjects learned the hypothesis of Adorno 
et al. (1950) that these two scales ought to be 
related! 

There was a tendency short of significance 
for the more aware subjects to show a greater 


B RIVENTAL GROUP 
99 39 Experimenter Present = 
$a Experimenter Absent = = 
Re 38 CONTROL GROUP 
kd ;L Experimenter Present »«»« 

uo Expenmenter Absent OO 
BE NE. 
tu eu 
5e 35 ^a 
9o * 
as sal. ` 
m LI 
ag 33 ^ 
e E X 
zB 32 XD x 
Sw 31 x “ah 25 7% 
d ETT x xy to ELA "e. a 
BS soL x* * 4o OTSA 
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o 
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NUMBER OF TRIALS 
(number of items on E scole ) 


Fic. 2. Generalization data by groupings of five trials. 


46 


change in the prodemocratic direction. Gen- 
erally, however, the subjects had a low level 
of awareness and the differences were small. 


Discussion 


This study demonstrates that at least one 
complex attitude scale is amenable to in- 
fluence by means of verbal reinforcement. 
Since the responses studied involve stated 
opinion rather than action, it is problematic 
whether such changes would affect the sub- 
jects’ overt behavior. It was also shown that 
change on one attitude scale can produce 
„change on another attitude scale. Indeed the 
results suggest that the verbal reinforcement 
procedure may be capable of bringing about 
conceptual changes that alter pre-existing 
relationships between components of the 
subject's cognitive structure. 

There is an indication that changes brought 
about by verbal reinforcement depend on the 
experimenter's physical presence. At least in 
the initial phases of generalization, the sub- 
jects’ altered verbal behavior is dependent on 
the presence of the experimenter. As in 
Rogers’ (1960) study, there is the implication 
that changes may be Specific to the inter- 
personal situation in which they take place. 

A strictly Skinnerian (1953) explanation 
of the findings seems to run into difficulty in 
accounting for discrimination of the similarity 
between scale items when they bear such little 
physical resemblance to each other. Osgood’s 
(1957) theory, in terms of “mediated generali- 
zation,” seems to be more powerful in explain- 
ing the basis for the subject’s discriminations, 
but the explanation rests on speculation about 
past experiences which are far from easy to 
demonstrate. At present it seems simpler and 
safer, particularly in light of the findings about 
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the Experimenter Absent condition, to view the 
results as reflecting the dependence of learning 
(whatever the nature of the learning mecha- 
nism) on social influence phenomena. 


SUMMARY 


This investigation found that verbal rein- 
forcement was effective in shifting subjects toa 
more democratic position on a combination of 
the California and Christie F Scales, Further, 
the change generalized to the E Scale under 
conditions in which the experimenter was 
present, even though he was no longer rein- 
forcing the subject. The experimenter evi- 
dently has social influence value. 
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EFFECT OF PERSONALITY AND STATUS EXPERIENCE UPON 
THE VALUATION OF UNOBTAINED STATUSES! 


PETER GUMPERT? JOHN W. THIBAUT, Aw» EMIR H. SHUFORD 
University of North Carolina 


UBJECTIVE status may be defined 
(Thibaut & Kelley, 1959) as the result 
of an evaluation by a group member of 

his outcomes (the rewards he receives and the 
costs he must incur), in relation to those he 
perceives other members to be attaining. An 
individual in a group has high subjective 
status if the level of his outcomes is above that 
attributed to his fellow members; he has low 
subjective status if he is below the level of 
outcomes he attributes to the other members 
of the group. Given that differences in out- 
comes among the members of a group exist, 
say, as perceived by an omniscient observer, 
various conditions may be stated that should 
tend to increase the likelihood of comparisons 
of outcomes by the group members, and of 
subjective status evaluations. First, member- 
ship in the group must be salient for the mem- 
bers. For each person, the outcomes of the 
other members must also be salient and known; 
the incurrence of costs and consumption of 
rewards by each group member must be public 
to all other members, or at least readily 
inferrable. Further, status comparisons are 
not likely to occur when the group members 
differ too greatly in their power vis-à-vis 
outside agents that may affect the outcomes 
of the various group members. Hence, power 
differences within the group must not be too 
great nor should the total range of outcomes 
of the group members be so large as to suggest 
great differences in power within the group. 
Finally, the activities of the group members 
must not be so different as to make compari- 
Sons of outcomes difficult or meaningless. 
"These conditions interact to make the out- 
Comes of his fellows highly salient for the 


1 This study was conducted in the program of the 
Organization Research Group of the Institute for 
Research in Social Science, University of North Caro- 
lina. The activities of the Organization Research 
Group are supported by the Office of Naval Research 
—Nonr-855 (04). 

*Now at Stanford University. The author held a 
National Science Foundation Undergraduate Fellowship 
during the period in which the research was conducted. 
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group member, and thus to make subjective 
status evaluations maximally likely. 

As suggested by Thibaut and Kelley (1959), 
the outcomes included in the various status 
levels in a group frequently approximate a 
Guttman scale. That is, the activities and 
outcomes of the middle and upper status 
positions include samples of the outcomes of 
status positions below them, and include also 
outcomes that have not been sampled by 
incumbents of the lower status positions. Thus 
a person in a middle status position is likely 
to have attained, or at least sampled, the 
outcomes of persons in positions lower than 
his own, and is likely to perceive that persons 
occupying higher status positions are experi- 
encing outcomes that he has not attained. The 
point to be emphasized is that for a person in 
such a status system the only unattained out- 
comes are those at positions above him in the 
hierarchy. 

Thibaut and Kelley (1959) suggest that in 
evaluating the utility or desirability of un- 
attained outcomes, persons differ in the weights 
they attach to the reward and the cost com- 
ponents of such outcomes. Individuals may 
be located on a continuum at one end of which 
rewards are relatively heavily weighted, and 
on the other end of which costs are relatively 
heavily weighted. On the basis of a number of 
studies (Atkinson, 1953; Klein, 1954), persons 
who heavily weight the reward component of 
unattained outcomes may be described as 
"confident, outgoing, and oriented toward 
success," and persons who heavily weight the 
cost component as “unconfident, constricted, 
and fearful of failure.” In a status system of 
the type we have described, the former person 
should value status positions above him more 
highly (and, hence, show greater upward 
mobility aspirations) than should the latter 
sort of person. The present study was designed 
to test this notion. 

In devising a laboratory setting for the 
study, it was necessary to find a task and a 
population of subjects that meet all the 


!] 
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conditions just described. Outcomes in the 
situation had to be salient for all subjects, 
with minimal opportunities for privacy. The 
range of outcomes had to be not too great, 
and all subjects had to have approximately 
the same realm of power within the setting. 
The artificial status system devised for the 
experiment had to be arranged in the manner 
of a Guttman scale, and the outcomes involved 
had to be novel to the subjects. Finally, sub- 
jects had to be selected from opposite ends of 
the "outgoing-confident-constricted-unconfi- 
dent" continuum, and a method for assessing 
their subjective status had to be found. 


METHOD 


Experimental Setting 


The setting consisted of a series of five cosmetic 
booths. All booths contained such make-up essentials 
as a mirror, cleansing cream, tissues, and cotton 
squares. Booth A contained lipstick and powder, each 
in two shades. Booth B contained lipstick and powder 
in additional colors, plus mascara. Similarly, Booths 
C, D, and E each contained all the cosmetic items of the 
lower booths, with the addition in each booth of one or 
more new items, so that the booths ranged in easily 
discernible steps from very simple (A) to very complex 
and well-equipped (E). The booths were arranged in a 
row in the same room, visible to all subjects. 


Subjects 


Subjects were 62 junior high school girls, ranging in 
age from 12 to 15. Girls of this age level were chosen to 
serve as subjects because they should have had limited 
prior experience with cosmetics, but relatively great 
interest in their use. The subjects were recruited from 
the Chapel Hill Junior High School on the basis of very 
high or very low scores on the Welsh A and R (Anxiety 
and Repression) scales of the MMPI (Welsh, 1956). 
Behavioral validation studies of the Welsh A and R 
scales (Dahlstrom & Welsh, 1960, pp. 212-214) have 
strongly suggested that this dimension reflects the de- 
sired continuum of differential weighting of rewards and 
costs. On the basis of these studies, high scores on both 
the A and the R scales are taken to denote the disposi- 
tion to weight heavily the cost component of unattained 
outcomes. Low scores are interpreted as reflecting the 
tendency to weight heavily the reward component of 
unattained outcomes. In the sample, raw A and R 
scores ranged from 2 to 35 for A and 10 to 24 for R. 
Scores were dichotomized at 14 for A, and at 16 for R. 


Measure of Utility of Outcomes 


All subjects were asked, during the experiment, to 
evaluate the “attractiveness or desirability” of all the 
cosmetic booths in the setting, using a lottery procedure 
for measuring utility which has recently been developed 
(Shuford, Jones, & Bock, 1960). This method (the 


“method of contingent paired comparisons”) involves 
the choice by each subject of one of two “boxes.” 
Each box contains nine tickets" (squares on which are 
printed one of two of the letters, A, B, C, D, or E, 
representing the booths). The proportion of tickets on 
which are printed a given letter corresponds to the 
probability of “winning” that booth, as in a lottery. 
Each booth is thus compared with every other booth 
with different probabilities of “winning.” An interval 
scale for the utility of each booth (relative to Booth A) 
is then obtained for each subject. In order to determine, 
for any subject, the utility of unattained outcomes, the 
utility of her “own” booth is subtracted from the 
utility of the next higher booth. A measure of the utility 
of attained outcomes may be obtained by subtracting 
from the utility of the subject’s “own” booth the 
utility of the next lower booth. 


Procedure 


The 62 subjects were scheduled in groups of three? 
Immediately prior to entering the experimental room, 
subjects were individually photographed with a Polaroid 
Land camera. They were then taken to the experimental 
room, and were shown through all the booths by a 
female assistant, who was introduced as a “cosmetic 
consultant.” The consultant explained each booth in 
detail to the group of subjects, and indicated the func- 
tion of each item of make-up contained in each booth. 
Then one of three standard sets of instructions was 
read to the subjects, and the experiment proceeded 
according to three variations: 

“Judges” condition: All outcomes attained. The 
subjects were told that their task in the experiment 
was to use all of the five cosmetic booths in applying 
make-up in the most creative manner possible. They 
were also told that they were competing with all 
other subjects for a valuable prize, which was to be 
awarded to the subjects who did the best make-up 
job, based on “before” and “after” photographs 
which impartial cosmetic experts would judge. Each 
subject was allowed 4 minutes in each of the five 
booths, in a predetermined random order. Subjects 
were instructed to use any of the cosmetic items they 
wished to use, in any order or in any way, and they 
were offered the assistance of the cosmetic consultant 
if they wanted it. Following this 20-minute make-up 
session, the subjects were asked to evaluate the “‘at- 
tractiveness or desirability” of each of the five booths, 
in the manner described above. A measure of the 
amount of experience each subject had previously 
had with cosmetics was also obtained at this time. 
The subjects were then allowed to return for an 
additional 10-minute period to the last booth which 
they had used in the previous session. Finally, à 
second photograph of each subject was then taken, 
and the group was cautioned not to reveal the 
nature of the experiment to future subjects. 

“Nonmobility” condition: Unattained outcomes. 


3 Occasionally only two subjects arrived at the 
laboratory. In these instances an experimental confeder- 
ate from the Junior High School was employed to 
pretend to be the third subject. 
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This procedure was identical to that of the Judges 
condition, except that here each subject was assigned 
at random to only one booth of the three middle 
booths (B, C, or D). The subjects were told that 
the assignments had been made strictly by chance, 
and that they would use only the one booth through- 
out the two make-up sessions in the experiment. 
Each subject worked in her assignment booth for 20 
minutes, after which all subjects were asked to 
evaluate the utility of al] of the booths, as in the 
Judges condition. The rest of the procedure was 
identical to that followed in the Judges condition. 
“Mobility” condition: Unattained outcomes. This 
procedure was the same as that in the Nonmobility 
condition, except that subjects were told that an 
evaluation of their performance during the first of 
the two make-up sessions would be used to assign 
them to a booth for the second session. If they did 
quite well during the first session, they were told, 
they would be assigned to a higher booth during the 
second session, while if they did not do well, they 
would remain at their first session booth or be as- 
signed to a lower booth during the second session. 
After the first 20-minute make-up session, the sub- 
jects evaluated the utility of all the booths, and were 
assigned to higher booths during the second (10- 
minute) session. All other details of the procedure 
were identical with the other conditions. 
Summarizing the procedure, subjects in the Judges 
condition sampled the outcomes available in all of the 
five booths, while each subject in the other two condi- 
tions was essentially in a “middle” status position, 
having experienced the outcome of positions below her, 
and having been made aware of unattained outcomes 
in status positions above her own. In the Nonmobility 
condition, status was ascribed and permanent, whereas 
in the Mobility condition higher status could be 
achieved by performing well during the first make-up 
session. Evaluations of the utility of the booths were 
made at the same point (after a 20-minute make-up 
session) in all three variations. 


HYPOTHESES 


The major hypothesis of the study is that 
the presence of unattained outcomes in the 
Status system (in the Mobility and Non- 
mobility conditions) differentiates clearly 
between the high and low scorers on the Welsh 
A and R scales. More specifically, subjects 
scoring low on these scales (those heavily 
weighting the reward component) should tend 
to overvalue the utility of unattained out- 
comes, as measured by the difference between 
the utility of the cosmetic booth immediately 
higher than their own and the utility of their 
own booth, while those subjects who scored 
high on the A and R scales (those heavily 
weighting the cost component) should tend 


to undervalue the utility of unattained out- 
comes, as compared with the low scorers. 

Second, it is predicted that in the Judges 
condition in which there are essentially no 
unattained outcomes for the subjects, low 
and high score on the A and R scales would 
not show differences in the utility values they 
assigned to the various booths. 

A third prediction concerns differences 
between the Mobility and Nonmobility 
conditions. Since persons who are "confident, 
outgoing, and oriented toward success" 
should tend to overvalue unattained out- 
comes, and persons who are “unconfident, 
constricted, and fearful of failure" should 
undervalue unattained outcomes, this effect 
should be most evident when there exists 
a possibility of status mobility within the 
group. Hence, it was predicted that differences 
between the evaluation of the utility of un- 
attained outcomes by the two personality 
groups would be maximal in the Mobility 
condition. 


RESULTS 


Data gathered on the pre-experimental 
cosmetic experience of the subjects confirmed 
our assumption that for the most part the 
subjects had no previous experience with the 
major portion of the cosmetics in the experi- 
mental task. 

The first main result to report is that no 
differences in utility values for the various 
booths were found between the Mobility and 
Nonmobility conditions. Thus no support is 
given to our third hypothesis. We will return 
shortly to a discussion of this finding. In light 
of their close similarity, the data from the 
Mobility and Nonmobility condition were 
pooled into one “experimental” group, in 
order to increase the number of cases for 
subsequent analyses. 

The first hypothesis predicts a negative 
relationship between A and R score and the 
utility values for unattained outcomes. To test 
this hypothesis a variance analysis was per- 
formed for the experimental group on the 
utility difference between the subjects! own 
booth and the next higher booth (Table 1). 
This analysis indicates that low scorers on the 
A and R scales gave a higher evaluation to 
the utility of unattained outcomes than did 
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those subjects who scored high on the A and 
R scales. It will be noted that no effect is 
attributable to position (booth occupied) nor 
to the interaction between booth occupied and 
A and R score. 

The same results can be seen from inspec- 
tion of Figure 1. Here again the low A and R 
scorers at each of the middle booths show a 
sharper rise than do high scorers in utility 
evaluation for the booth immediately above 
own booth. Thus the first hypothesis appears 
to be confirmed. 

According to the second hypothesis, the 
utility values for the various booths in the 
Judges condition should be very similar for 
high and low scorers on the A and R scales. 
To test this hypothesis the average difference 
for each subject between the utility of each 
booth and the next higher booth, in the Judges 
condition, was subjected to an analysis of 
variance. From the obtained F of 0.44 we 
conclude that high and low scorers on the A 
and R scales do not differ in the utility values 

*they assign to booths above them when all 
outcomes have been attained. 

Another kind of analysis permits the first 
and second hypotheses to be tested from the 
same set of data. It is possible to compare the 
effects of 4 and scores on evaluations of the 
utility of both attained and unattained out- 
comes within the experimental group. To 
assess the effect of A and R score on the evalua- 
tion of unattained outcomes, the utility of 
Booth C minus the utility of Booth B (for 
subjects occupying Booth B) and the utility 
of Booth D minus the utility of Booth C (for 
subjects occupying Booth C) were combined 
in an analysis of variance. This analysis shows 


TABLE 1 
ANALYSIS OF VARIANCE OF DIFFERENCE IN "EXPERI- 
MENTAL” GROUP BETWEEN UrrLiTY OF Own Booru 
AND Unitity or Nexr HIGHER Boots 


Source of Variation df SS MS F 

High vs. Low A and R 1] .115 | .115 | 5.502* 

score 
Position (Booth) 2 | .012 | .006 | 0.287 
Interaction 2| .009 | .004 | 0.215 
Within 39 | .817 | .021 

"Total 44 | .953 

* p< 025. 
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Fic. 1. Utility values of the various booths in the 
"experimental" treatment for high and low scorers on 
the A and R scales, plotted separately for subjects 
assigned to each of the three middle booths. (Vertical 
arrows indicate own booth. No utility value can be 
computed for the A booth since all other values are 
scaled relative to it.) 


that (consistent with the first hypothesis) low 
A and R scorers evaluated the utility of un- 
attained outcomes higher than did high A and 
R scores. The obtained F of 4.99 is significant 
at the .05 level. 

To assess the effect of A and R scores on the 
evaluation of attained outcomes within the 
experimental group, the utility of Booth C 
minus the utility of Booth B (for subjects 
occupying Booth C) and the utility of Booth 
D minus the utility of Booth C (for subjects 
occupying Booth D) were combined in a 
variance analysis. The results of this analysis 
give additional support to the second hypoth- 
esis in showing no difference in the evaluation 
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of attained outcomes between high and low 
scorers on the A and R scales. The obtained 
F is 0.077. 


Discussion 


The results of this experiment are interpreted 
as being in general conformity with the 
formulation proposed by Thibaut and Kelley 
(1959). In a status hierarchy that approxi- 
mates to a Guttman scale, samples of all 
outcomes are attained except for those con- 
nected with positions located above the person. 
If persons differ in the weights they assign to 
the reward and cost components of unattained 
outcomes, those who heavily weight the 
reward component should value more highly 
the outcomes in positions above them than 
do persons who heavily weight the cost com- 
ponent of such unattained outcomes. An 
immediate implication from such differential 
weighting of unattained outcomes is that the 
former persons should show greater aspiration 
to upward status mobility than the latter. 

If we are correct in interpreting the Welsh 
A and R scales as measuring the differential 
weighting of rewards and costs for unattained 
outcomes, low scorers on these scales should 
more highly value outcomes for positions 
located above them than do high scorers. For 
attained outcomes, however, no differences 
in valuation of outcomes should be detected 
when high and low scorers are compared. 

With respect to the “mobility” variation, 
however, the results depart from expectations. 
It was supposed that when upward mobility 
became a distinct possibility, the unattained 
outcomes of positions above the subject would 
take on a greater salience, would instigate him 
more powerfully, and, hence, would throw 
into heightened relief the differences between 
subjects who differentially weighted rewards 
and costs. That the data do not support this 
hypothesis is clear: there were no differences 
between high and low scorers on the A and R 
scales when the Mobility and Nonmobility 
conditions were compared. 

In speculating about the sources of this 
negative finding, the possibility immediately 
comes to mind that the manipulation of 
mobility was not sufficiently powerful to 
produce the expected effects. The experi- 
mental manipulation of mobility has proved 
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difficult in the past (Kelley, 1951). In the 
present experiment the possibility may be 
especially plausible that the mobility ma- 
nipulation was not sufficiently powerful and 
sustained to override competing processes 
that might attenuate it. Specifically, the 
Mobility condition subject’s perception of 
her own performance may have strongly 
determined her valuation of unattained out- 
comes, those perceiving themselves as per- 
forming well being confident enough to value 
highly such outcomes. If such perceptions of 
own performance are not positively correlated 
with the A and R scores, this process might 
conceivably reduce any tendency for A and 
R scores to show greater effects on the Mobility 
than in the Nonmobility conditions. j 

To check this possibility, four judges were 
asked to evaluate the performance of the 22 
subjects in the Mobility condition by inspec- 
tion of their before and after photographs. 
On the basis of the judges’ evaluation, subjects 
were dichotomized into high and low per- 
formers. These dichotomized scores were 
exactly zero-correlated with the subjects’ A 
and R scores. Tabulation of the difference in 
utility between own booth and the next higher 
booth showed that, while low A and R scorers 
valued these unattained outcomes higher than 
did high scorers, high performers also valued 
such outcomes higher than did low perform- 
ers. Variance analysis of these data yielded an 
F significant at the .10 level for rated perform- 
ance, with a smaller F for A and R scores 
and no interaction effect. From this we con- 
clude that it is possible that perception of one’s 
own performance on the task may have con- 
tributed to the failure of the mobility ma- 
nipulation. 


SUMMARY 


An experiment was designed to test the 
effects of attained and unattained outcomes 
on the perception of subjective status by 
members of small groups. In groups of three, 
62 female junior high school students were 
asked to evaluate the attractiveness or de- 
sirability of five cosmetic booths, which were 
arranged in a Guttman-scale status system. 
Six groups of subjects experienced all the 
outcomes in the system (used all booths in 
the make-up task), and the subjects in the 
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remaining groups experienced the outcomes 
in only one of the three middle status posi- 
tions. These latter subjects have thus experi- 
enced the outcomes in their own positions 
and in positions below them, and are aware of 
unattained outcomes in status positions above 
them. Three variables were investigated: the 
presence of unattained outcomes for the 
subjects in the status System, perception by 
the subjects of the possibility of status mobil- 
ity, and a personality variable measure by the 
Welsh A and R scales. It was found that the 
personality variable was instrumental in the 
differential perception by subjects of attained 
and unattained outcomes in the status system. 
As predicted, the low scorers on the 4 and R 
scales valued the utility of unattained outcomes 
higher than did the high scoring subjects, while 
no differences attributable to the personality 
measure were found in the valuation of 
attained outcomes. 
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MODELS OF GROUP PROBLEM SOLVING! 


EDWIN J. THOMAS anp CLINTON F. FINK? 
University of Michigan 


Eviews of the research on group 
R problem solving (Kelley & Thibaut, 
1954; Lorge, Fox, Davitz, & Brenner, 
1958) indicate that sweeping conclusions about 
the superiority of groups over individuals are 
not warranted. However, studies in this area 
show that it may be fruitful to examine the 
conditions under which groups should be 
superior to individuals. This paper examines 
conceptually the requirements of the problem 
solving task and selected group variables that 
affect the quality of a group’s solution, and 
suggests three models for group problem 
solving. Following the presentation of an 
experiment that tests some of the assumptions 
of the models and sheds further light on proc- 
esses affecting the quality of group solutions, 
the paper terminates with a discussion of 
other models of group problem solving and of 
extensions of the models to preference 
phenomena. 

Each model specifies theoretical probabilities 
for the outcome of group problem solving, and 
the antecedent conditions that should produce 
the expected outcomes. A significant restriction 
on the application of the models is that the 
group’s product must be expressed as the 
distribution of individual answers. Distribu- 
tions of individual responses as group products 
are found in legislative votes, committee votes, 
and in court decisions, although these are not 
always answers to problems. A less imposing 
restriction is that the individual answers 
must be expressed as one of three or more 
discrete possibilities, such as a correct answer 
versus two or more incorrect answers; this 
restriction will be relaxed at a later point in 
the paper. 

The independence model states the theoreti- 


1 This article reports portions of a research project 
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Michigan. 

* The authors wish to express their gratitude for the 
help provided by Rosemary Conzemius in the early 
phases of this study. 
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cal probabilities for different distributional 
possibilities assuming that the group members’ 
answers will not differ from the responses of 
individuals working alone, when these are 
grouped into the same distributional categories. 
The probabilities for the distributional cate- 
gories are derived from selected expansion of 
the multinomial theorem, (f: + f» ++ +++ 
p+)", where p, equals the probability of success 
of an individual working alone, f» through pr 
are the probabilities, respectively, of different 
wrong answers, and n is the number of in- 
dividuals in the group. Table 1 presents the 
distributional categories and theoretical prob- 
abilities. If p, is the probability of success of 
an individual working on the problem alone, 
and Pe is the probability that all of the group 
members will be correct, then Pc = px". The 
probability that there will be some members 
correct and some incorrect, Po, ;, is the re- 
mainder obtained by subtracting from 1 the 
probability that everyone will be correct, 
x", and the probability that no one will be 
correct, (1 — pı)”. Pr, v, the probability that 
the group members will be unanimously, 
incorrect is the sum of the probabilities that 
all members will have a given incorrect 
answer, (5j +...+ pr”). The probability 
that all members wil have mixed incorrect 
answers, Pr, y, is the remainder of subtracting 
from the probability that no one will have a 
correct answer, (1 — pı)”, the probability of 
all members being unanimously incorrect, 
(br d. bee. 

The outcome stipulated for the independence 
model will obtain when the individuals have 
essentially no influence upon one another, such 
as when there is no interaction, communica- 
tion, or interdependence among the group 
members. 

"The rational model states that when at least 
one member of the group obtains the correct 
answer to the problem all other members will 
adopt the answer. In other words Pc equals 
1 — (1 —p:)", the probability that at least 
one member of the group will have the right 
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TABLE 1 
THEORETICAL PROBABILITIES OF OBTAINING GIVEN DISTRIBUTIONS oF GROUP MEMBERS’ ANSWERS FOR 
DIFFERENT MODELS 


(Multialternative Cases) 


Type of Distribution 
ee All Members In 
Ne All Members Correct | Same Members Correct | An Members Incorrect, Mixed ge ae Ticor 
S Pow ) TU 
Independence i 1-[RG-»| G-»"-(QP«eEgS] (QU 
Rational 1- 0 p" o U= He Ha | O H + pe) 
Consensus >p” and <1 — (1 — p)” <1 — [p +1 p)" | <U p)” GP Hp”) | >OP H5) 


2 f = the probability of success of an individual working alone on the problem; s + --- + f, = the probabilities, respectively, of 
incorrect answers for individuals working alone on the problem; s = number in the group; and p; > 0 and <1 where i = 1,-++, r and 


(it Pi B) = 1 


answer? As may be seen in Table 1, Pc; 
equals 0, for if one person has the correct 
answer, the model assumes that all others will 
adopt it. The probabilities for P; a and Py, v 
are the same for the independence and rational 
models. The extent to which groups perform 
better than individuals is given by the size 
of the group, n, with Pc being an increasing 
function of n. The superiority of the group 
over the individual will be a. positively de- 
celerating function of m with larger values 
of p leading to a more rapid deceleration. 

This model has been termed the rational 
model because it assumes that as soon as the 
correct answer is heard it will be adopted by 
all others in the group. Group members would 
be expected to be “rational” in this sense 
under three conditions. First, the correct 
solution must be offered in the discussion by 
Some member. Second, the correctness of the 
solution must be persuasively demonstrable 
either because the person who offered the 
solution was judged to be trustworthy and 
expert or because of the verifiability of the 
answer. A verifiable solution is one which can 
be judged correct or incorrect by formal 
criteria, such as logic or mathematics, or by 
physical standards, such as weight or length. 
Third, in order for the rational model to apply 
there must be no potentially inhibiting group 
factors, such as strong pressures toward uni- 
formity or a rigid authority structure. 


* Essentially this term was suggested by both Lorge 
and Solomon (1955) and Faust (1959) as the expression 
for the likelihood that the group will arrive at the cor- 
rect answer, but these writers have limited their 
formulations to the two-alternative case. 


The consensus model assumes pressures 
toward uniformity such that the expected 
Proportion of groups with all members being 
correct or all having a given incorrect answer 
is greater than that predicted by the in- 
dependence model. Both of these increases 
occur at the expense of the proportion of 
groups with mixed answers. As Table 1 shows, 
the theoretical probability for Pc is a value 
falling between the probabilities for the in- 
dependence and rational models, the value of 
Pr, v is greater than the quantity for the other 
two models, and the values for the other two 
terms, Po, r and P; x, are less than those given 
for the same categories by the independence 
model. The consensus model is one of many 
"process" models in which the theoretical 
probabilities for the various distributional 
categories differ from those of the independ- 
ence model because of group processes affect- 
ing members. 

The condition leading to the outcomes 
stipulated for the consensus model is pressures 
toward uniformity, which induces members to 
adopt one answer and not different answers. 
Festinger (1950) has described two conditions 
conducive to pressures toward uniformity: 
dependence upon others for the validation of 
the “correctness” of one's opinions (low physi- 
cal reality); and uniformity desired or neces- 
Sary for the group to move toward a goal 
(group locomotion pressure), Another con- 
dition affecting pressures toward uniformity 
is the absence of formal rules, such as a logical 
system, by which a solution could be found to 
be correct or incorrect (ie. low “formal” 
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reality). These three conditions create de- 
pendence on the opinions of others and con- 
stitute nonobjective bases for changing 
opinions. 

Assuming that the consensus model were 
operating, which would be superior: groups 
or individuals? An important variable con- 
ditioning the effect of pressures toward uni- 
formity is the magnitude of f; relative to the 
largest of the probabilities for wrong answers 
to the problem. If pı is greater than any of the 
probabilities for the incorrect answers, then 
the group's performance will tend to be 
superior to that of the individuals—assuming 
that a majority in the group would generally 
achieve dominance over a minority. Anal- 
ogously, if any wrong answer is greater than 
f: then that answer will tend to be adopted 
and, to the extent that the wrong answer is 
adopted, the performance of the group will of 
course be inferior to that of the individuals 
working alone. 

The experiment reported below was con- 
ceived to test the rational model, although 
the results obtained bear on the assumptions 
of all three models, Two related predictions 
from the rational model were tested: that the 
performance of groups will be superior to that 
of individuals and that the group superiority 
increases with the size of the group. These 
predictions derive from the assumptions of 
the rational model that all members of a group 
will adopt the correct answer when it appears 
and that the probability of having at least 
one member correct increases with group size. 


METHOD 
Subjects 


Fifty male and female undergraduates at the 
University of Michigan worked on the problem indi- 
vidually. Their responses provided the basis for the 
various p values used in computing theoretical prob- 
abilities. Another 143 undergraduates participated as 
members of 44 groups, ranging in size from 2 to 5 
members. There were 16 two-person groups, 10 three- 
person groups, 9 four-person groups, and 9 five-person 
groups; half of the groups had only male members, the 
other half only female members. Even though Sizes 4 
and 5 have an unequal number of male and female 
Eroups, the possible sex differences essentially cancel 
out and the design was treated as if there were an equal 
number of males and females in each group size. 


Procedure 


E Following a general introduction to the experiment, 
individual answer sheets were distributed to the sub- 


jects and the experimenter stated Maier’s horse trading 
problem, repeating it for clarity. “A man bought a 
horse for $60 and sold it for $70. Then he bought it 
back for $80 and sold it again for $90. How much money 
did he make in the horse business?" (Maier & Solem, 
1952, p. 280). At the end of one minute, the experimen- 
ter collected the answer sheets. The procedure thus 
far was the same for individuals and groups. 

Next (designated Time 2) for those who worked on 
the problem individually, the experimenter instructed 
them to think over the problem (giving them 2 min- 
utes) after which they would have another opportunity 
to indicate their answers. At the end of 2 minutes, 
subjects filled out a final questionnaire, 

In the group condition at Time 2, members were 
seated around a circular table, and instructed to discuss 
the problem in order to arrive at a group decision as to 
the correct answer. No procedures for discussion were 
given and no one was designated as the leader. The 
answers of the group members were to be written on a 
report form that was likened to a committee report in 
which the answer of each individual is represented. In 
order to minimize pressures toward uniformity, subjects 
were explicitly told that the report might be either 
unanimous or divided so that it would reflect the 
opinion of each member. The experimenter handed the 
report form to a group member, randomly preselected, 
who was instructed to record the name and the answer 
of each person when the group reached a final decision, 
Before discussion began, the group was informed that 
there was a time limit (unspecified) and that if they 
had not reached a decision within this limit, a warning 
signal would be given and they would have to complete 
the report without further discussion. Each group was 
given a maximum of 2 minutes per member to discuss 
the problem. Prorating of time by size was used because 
a fixed time might have placed the members of the 
larger groups at a disadvantage in solving the problem. 

During the discussion, the experimenter observed 
the participation of each member, marking a single 
tally on an observation sheet each time a person gave a 
response in one of the following categories: states 
correct answer, rejects incorrect answer, gives rationale 
for correct answer, and restates problem correctly 
(facilitative acts); states incorrect answer, rejects 
correct answer, gives rationale for incorrect answer, 
restates problem incorrectly (hindering acts). 

After the group report had been collected each mem- 
ber completed a questionnaire that inquired into his 
reactions to the group and the discussion. 


Task 

Maier’s horse trading problem may be solved by 
arithmetic, an example of what we have called formal 
reality, rather than through resorting to any physical 
operations. The problem seems simple, for fundamen- 
tally it involves the addition of $10 to $10. However, it 
is a story problem and is moderately difficult; only 45% 
of the subjects in the experiment of Maier and Solem 
(1952) gave the correct answer ($20). 

In that experiment the investigators found that 
when a leader allowed a minority to speak up the 
number of persons later giving the correct answer was 
greater than for the condition in which a minority was 
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not allowed to speak up. The authors (Maier & Solem, 
1952) say: “The reason seems to be that a minority 
can effectively compete with the social pressure of the 
majority only when the minority has reality on its 
side. Thus it was only when the minority held the 
correct answer that members of the majority changed 
their views" (p. 286). It appeared at the outset of 
experimentation that the problem had at least a 
moderate amount of verifiability and was therefore 
suitable for testing the rational model. 


RESULTS 
Rational versus Consensus Model 


The rational model predicts that the prob- 
lem solving of groups will be superior to that 
of individuals. The results for groups, however, 
were found to be no better than for individuals; 
48% of the group members gave the correct 
answer at Time 2; this was also the percentage 
of individuals working alone who gave the 
correct answer at Time 2. The percentage of 
unanimously correct groups was 34. 

Table 2 shows the percentages of groups 
which fell in the various distributional cate- 
gories and the proportions predicted from the 
independence model. The theoretical prob- 
abilities for the independence model are given 
in the table because predictions from both the 
rational and consensus models can be stated 
in terms of the proportions for the independ- 
ence model. The probabilities for the rational 
model of obtaining unanimously correct groups 
are found by summing the values in Columns 
land 3 for each size; the theoretical probability 
of having some members correct and incorrect 
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is zero, and the predicted proportions for the 
other two distributional categories are the 
same as those for the independence model. 

Inspection of the obtained proportions of 
groups in Table 2 clearly indicates no support 
for the rational model. Expected proportions 
in Column 1 plus Column 3 range from 73% 
to 96%; but the obtained proportions in 
Column 2, although elevated with respect 
to those predicted by the independence model, 
are appreciably lower than those predicted by 
the rational model, ranging from 20% to 
55%. Furthermore, the proportions in Column 
4, while lower than those predicted by the 
independence model, are still considerably 
higher than zero, and in fact are not notably 
different from the proportions obtained in 
Column 2. The outcomes predicted for the 
consensus model, moreover, are evident in 
Table 2. With one exception, the obtained 
proportions in Columns 2 and 8 are con- 
siderably larger than those predicted by the 
independence model, and the obtained propor- 
tions for both Columns 4 and 6, with two ex- 
ceptions, are less than the theoretical prob- 
abilities of the independence model. Testing 
against the theoretical proportions of the 
independence model, the Kolmogorov-Smirnov 
one-sample test indicates a significantly ele- 
vated proportion of unanimous groups for Sizes 
2, 3, and 5; for Size 4 the p value is larger than 
05 but less than .10. Noteworthy also is the 
absence of a size effect. 

In addition to showing that the distribution 


TABLE 2 
‘THEORETICAL PROBABILITIES FOR THE INDEPENDENCE MODEL AND OBTAINED PERCENTAGES OF HAVING ALL, 
Some, on No Group MEMBERS Correct at TIME 2, By Group SIZE 


All Members une Members)" an Mesbers Tni All 

Correct* CERE rect, Mixed | “rect Dems 
Size p value? 

(1) & G) 4) G) (6) (2) (8) 

Theoret- b- Th = E $ 4 

ical | tained | "Scat | enea | THQ | OR, | Theoret | Ob. 
Two (16 groups) 23.0 | 37.5 | 49.9 | 25.0 | 16.24 | 6.30] 10 <.05 
Three (10 groups) 11.0 | 20.0: | 74.8 | 20.0 | 11.34 | 0:00 230 60.00 <.01 
Four (9 groups) 5.3:51892:2 87.4 | 44.5 6.63 | 11.10 .66 | 22.20] <.10 
Five (9 groups) 2.5.1 55.5 | 93.7... |-33:2 3.62 | 11.10 18 0.00| <.01 


* Column (1) computed from pı” where pi = .48, the likelihood that an individual wo 


uld give the correct answer ($20.00) the second 


time while working alone on the problem; Column (3) computed from 1 — [pi^ + (1 — $1]; Column (7) OR ea ene 


pı”), where each value, respectively, equals .28, .14, and -10, the likelihood that an indi 


ividual would give, respectively, the wrong an- 


swers of $10.00, $0.00, and $30.00 at Time 2 while working alone on the problem; Column (5) computed from [(1 — po” — Cpr" + 2° 


Ton. 


b By Kolmogorov-Smirnov one-sample test comparing the theoretical with the obtained for the categories of unanimous and non- 


unanimous for a given size. 
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of Time 2 answers corresponds to proportions 
predicted by the consensus model, it is neces- 
sary to demonstrate that the unanimity at 
Time 2 was not due to the distribution of Time 
1 answers. Table 3 shows that at Time 1, 6 
of the 44 "groups" were unanimous whereas 
at Time 2, 28 of the groups were unanimous—a 
large convergence toward uniformity (x? — 
20.00 by McNemar's test, p <.005). The 
unanimity is gained clearly at the expense of 
diverse answers, and does not favor the correct 
answer. Again no size effect is apparent. 

It is not easy to demonstrate the degree of 
verifiability of the horse trading problem 
because only indirect indicators of verifiability 
were available. If the problem were highly 
verifiable, one would expect individuals with 
the correct answer to be certain of their answer 
and those with an incorrect answer to be 
uncertain. Of the 19 individuals who worked 
alone and were correct at Time 1, 16 were 
highly certain of their answer; of the 28 who 
got the answer incorrect who worked alone, 15 
were highly certain of their answer (x? = 4.71, 
p <.05); all subjects not highly certain were 
at least moderately certain of their solutions. 

If verifiability of this problem were high we 
would also expect others to adopt the correct 
answer once they heard it. If a subject were 
correct in a lone minority and mentioned his 
answer to the group, the others should thus 
adopt it. In each of 18 groups, only one member 
had the correct answer prior to the group 
discussion; in 6 of these groups the correct 


TABLE 3 
NUMBERS or Groups HAVING VARIOUS DISTRIBUTIONS 
OF MEMBER ANSWERS AT TIME 1 AND TIME 2 


Some Mem- 
[All Members |bers Correct |All Members AIT Members 
: ELBE) al ixed ' | Unanimous 
Size 
1) L2 LG | 0 |. | (9 | (7) |. 
Tine tine Time | Time | Tiare | Tare | Tie Tre 
3e Spee) r a | eval a eT 
Two 0 6 9 4 5 1 2 5 
Three peat eit DERI S e wl EATT 6 
Four 0 2 7 4 1 1 1 23 
Five 0 5 8 3 1|1]|0 0 
Total Do e A 452728 3s erts |n Ld ss 
^ Time 1 “groups” were the answers of individuals 


working aloneat Time 1 who later were actually grouped together 
to discuss the problem at Time 2. 


subjects managed to get all others in their 
group to adopt the correct answer on the 
group report, and all 6 of these subjects talked 
the most during the group discussion; but in 
11 of the 12 groups that did not unanimously 
adopt the correct answer at Time 2 the correct 
subjects did not talk the most in their respec- 
tive groups (p < .001 by Fisher's exact test). 
Thus whether others change to the correct 
answer depends upon how much talking the 
lone, correct subject does. 

Independent evidence of pressures toward 
uniformity was obtained from responses on 
the post-experimental questionnaire. Over 
half (56%) of the subjects endorsed strongly 
or very strongly the question: “Did you want 
the group to arrive at a unanimous decision?" 
Only 23% of the subjects were indifferent 
or did not want the group to arrive at unani- 
mous decision. The subjects were also asked 
about how much pressure they felt from others 
to arrive at a unanimous answer; 81% of the 
subjects indicated that they felt pressure either 
moderately, strongly, or very strongly—the 
remainder feeling either slight pressure or 
none. The subjects were also asked about the 
extent to which they felt free to disagree with 
others in their group; 71% indicated that they 
did not feel free to disagree, the remainder 
indicating that they felt free to some extent. 
By absolute standards pressures toward 
uniformity appear to have been high in these 
experimental groups. 

The members’ reactions are also pertinent 
to the question of whether pressures toward 
uniformity were operating. Table 4 reports 
various member reactions by whether the 
group was unanimously correct, unanimously 
incorrect, or nonunanimous. The figures in the 
table are mean sums of ranks of individual 
scores, given to provide some rough estimate 
of central tendencies of the three groups. The 
various member reactions support the con- 
clusion that a goal for most of the groups was 
to attain unanimity and that failure to attain 
unanimity was associated with lack of unity 
of the group, lack of satisfaction, perception 
that there was not enough time and that the 
members had relatively low influence over 
one another. In the last row of the table there 
is again evidence from the question concerning 
felt pressure toward uniformity that there was 
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TABLE 4 
Meran Sums or RANKS OF INDIVIDUAL SCORES ON 
VARIOUS MEMBER REACTIONS, BY UNANIMITY OF 
OPINION IN THE Group, Tre 2 


j.| Unani- | Sprit, 
Member Reactions | mous, | mous, [Correct |, | ju 
Correct | “DCor- jand In- 
rect |correct 
Perceived influence in| 74.615 81.33 | 61.25 | 23.26]<.001 
group 
Judged adequacy of time] 59.11 57.25 | 85.90 | 10.39| .01 
Satisfaction with group} 93.90 | 89.18 | 36.80 | 62.86|«.001 
report 
Perceived unity of group) 78.26 | 76.45 | 57.83 | 7.67|«.05 
Felt pressure to una- 76.61 78.11 58.11 7.27|«.05 
nimity 


* Kruskal-Wallis one-way analysis of variance. 

> Maximum scale values for the questions used to measure 
member reactions were 3, 3, 9, 4, and 6, respectively, for those 
member reactions listed in the left-hand column of the table; the 
higher the mean the higher the value on each variable, except for 
the judged adequacy of time where higher means indicate less time, 


stronger pressure felt in both the unanimously 
correct and the unanimously incorrect groups. 
The amount of pressure toward uniformity 
that subjects felt was not related to group size. 


Factors Affecting Quality 


Size. The results thus far indicate no size 
effect. A suggestive, unexpected finding was 
noted, however: groups most likely to have 
members change to the correct answer when 
no one was correct at Time 1 tend to be groups 
of Size 2. At the outset of the group discussion 
there were 11 groups in which all members 
had an incorrect answer; 4 groups had subjects 
who got the correct answer at Time 2, and 3 
of these were two-person groups; the other 7 
groups had subjects who did not get the answer 
at Time 2, and 2 of these were two-person 
groups. The sample of groups is too small for 
the trends favoring two-person groups to be 
significant. 

Unanimity-Diversity. The distribution of 
member answers at Time 1 affects the distribu- 
tion of answers at Time 2. All 6 groups that 
were unanimous at the outset of the group 
discussion were unanimous at the end when 
they gave the Time 2 report. In contrast, of 
the 38 groups that were nonunanimous at the 
beginning of the discussion, 32 were groups 
that had members who changed their answers 
at Time 2 (x? = 13.51, p < .001). Further- 
more, as might be expected, the unanimous 


groups devoted much less time to discussion 
than did the nonunanimous groups (p < .05). 
The typical pattern was for the subjects in 
unanimous groups to record their answers on 
the group report and hand it immediately to 
the experimenter. 

The above findings suggest that the greater 
the variety of answers at Time 1, the more 
changes of opinion there would be at Time 2, 
Table 5 indicates that for each size, as the 
number of different answers increased, a larger 
percentage of subjects changed at Time 2. 
Although a statistical test would be inappro- 
priate for these results because the individuals 
are not independent as statistical units, the 
trends are most consistent. Analysis of the 
subjects who changed revealed, moreover, 
that high heterogeneity was no more likely to 
lead to change in a positive direction than low 
heterogeneity. The mere mention of the correct 
answer in the group, however, did increase 
slightly the likelihood that members, when 
changing, would give correct rather than 
incorrect answers at Time 2; there was slightly 
more change toward the correct answer when 
both correct and incorrect answers occurred 
at Time 1 than when there were only mixed 
incorrect answers at Time 1. 

Group Processes. While we know that pres- 


TABLE 5 


NUMBER or DIFFERENT ANSWERS AT TIME 1 AND 
AMOUNT OF CHANGE AT TME 2, By Group SIZE 


Change at Time 2 
Number of P 
i ji Wh 
resp AER P Subjects Wh 3 pid. Net eene 
N % N % 
Two 2 13 | 48.20 | 14 | 51.80 
1 0| 0.00} 4| 100.00 
Three 3 3] 100.00] 0| 0.00 
2 7| 37.80] 11 | 62.80 
1 0| 0.00| 9| 100.00 
Four 4 5| 62.50| 3| 37.50 
3 2| 25.00| 6| 75.00 
2 3| 8.70| 13] 91.30 
1 0| 0.00] 4| 100.00 
Five 5 0 ol. 0 = 
4 3| 60.00| 2| 40.00 
3 12| 47.00 | 13 | 53.00 
2 5| 33.33 | 10| 66.67 
1 0 p5 0 wE 


x 


j 
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sures toward uniformity were operating in the 
experimental groups, it is not clear how the 
pressures were put into effect to achieve uni- 
formity from diversity. Three possible explana- 
tions are examined below. 

A theory of “majority rule” would say that 
the answer adopted by the group would be the 
one favored by most of the members prior to 
entering the group discussion. This explanation 
is entirely correct with respect to the groups 
that were unanimous at Time 1, as has been 
noted. However, when there was diversity at 
Time 1 the groups did not as emphatically 
adopt the most highly endorsed answer of the 
individuals. Of the 15 groups that had diverse 
answers at Time 1 and a most highly endorsed 
answer, 9 adopted solutions involving changes 
toward the most highly endorsed alternative 
and 6 did not. 

A second explanation might stress the 
“strong leader.” We have already noted that 
the lone, correct subject is likely to have others 
adopt unanimously the correct answer at 
Time 2, provided that he talks the most. The 
outcome in problem solving for all groups was 
analyzed where there was one person who 
talked more than the others. For the groups 
adopting a unanimous answer at Time 2 the 
evidence clearly supports the strong leader 
interpretation; of the 10 groups in which the 
person who talked most had the correct answer 
at Time 1, 9 were unanimously correct at 
Time 2; of the 11 groups in which the person 
who talked most had an incorrect answer at 
Time 1, 8 groups adopted unanimously the 
Same incorrect answer (p < .01). For the other 
groups there was a nonunanimous outcome 
and there was no evidence that the outcomes 
that were adopted reflected the previous 
answer of the person who talked the most. 

A third explanation might be termed the 
"complete process" theory, for it includes the 
features just mentioned and what occurs at 
each of the steps as the group solves the 
Problem. Three elements may be identified: 
resources, or the valued properties instrument- 
ally relevant for the group to solve its problem; 
pul, or the actualization of these resources 
in the solving of the problem; and facilitative 
Processing, or the way in which the input is 
Processed by the group so as maximally to 
lead to the desired outcome. The complete 


process explanation would account for correct 
solutions at Time 2 as follows: the resources 
of the members are actualized as input and, 
in turn, are facilitatively processed. A poor 
group solution could occur because resources 
were not available, or if they were available 
they were not realized in input, or if realized 
in input were improperly processed. 

This approach was examined by making 
operational each of these elements: considering 
each group as a unit, resources were deter- 
mined by the number of correct Time 1 
answers, input by the number of correct 
answers mentioned in the discussion divided 
by the number of incorrect answers, and the 
degree of facilitative processing by adding the 
number of arguments for the correct answer 
to the number of rejections of an incorrect 
answer and then dividing by the sum of the 
total number of instances a rationale was 
given and the times the correct answer was 
rejected; ouput was the number of correct 
answers at Time 2. Table 6 presents the cor- 
relations for these elements and shows that 
for all sizes input is a function of resources, 
that facilitative processing is a function of 
input and, to a lesser extent, of resources; 
and that output is a function of facilitative 
processing, of input, and (except for Size 2) of 
resources. It is noteworthy that the only size 
effect in the table is the depressed correlation 
(.22) between resources and output for groups 


TABLE 6 


INTERCORRELATIONS (rho’s) or RESOURCES, INPUT, 
PROCESSING, AND OUTPUT, BY GROUP SIZE 


Resources Input Processing: 

Size 2 

Output .22 .94** .Bie* 

Resources .46* .48* 

Input .90** 
Size 3 

Output .66* .56 192. 

Resources 78** +34 

Input .61* 
Size 4 

Output .67* VAT .80* 

Resources .86** AON 

Input .98** 
P Output .62* .86** .86** 

Resources .70* d 

Input ,85 

* 9 < 05 

** p «01 
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of Size 2. This correlation is another indication 
that it is possible in a two-person group for the 
members to arrive at the correct answer at 
Time 2 when they were both incorrect at 
Time 1. 


Discussion 


One reason why the outcomes predicted for 
the consensus model rather than those for the 
rational model were found was because pres- 
sures toward uniformity were high. The pres- 
sures were manifested as (a) the weight of 
the answer most highly endorsed prior to group 
discussion, (6) the members’ disposition to 
change, and (c) the attempts of interacting 
members to change the answers of others. 
Consensus was also favored by the fact that 
most members could not verify an answer to 
determine whether it was correct or incorrect. 
In retrospect, it is clear why Maier’s horse 
trading problem is not as highly verifiable as 
originally presumed. The problem is arithmeti- 
cally simple, yet approximately half of college 
students fail to solve it correctly. It is in fact 
a trick problem in which the subjects are often 
misled by the apparent loss when the horse 
trader buys the horse again, and thus apply 
the operations of arithmetic from an incorrect 
starting point. There would be very high 
accuracy in solving the problem if subjects 
began with the same premises concerning the 
transaction, or if all subjects employed an 
agreed upon set of criteria for solving such 
problems. 

The findings indicated that the two-person 
group was most likely to come up with the 
correct answer when none of the individuals 
had the correct answer at the outset of discus- 
sion. While analysis revealed no clue as to 
why the two-person group was unique, two- 
person groups differ from larger ones in several 
respects when viewed theoretically. First, in 
the two-person group there is opportunity for 
each individual to communicate his opinion 
whereas in larger groups it is possible for one 
or more persons to be “squeezed out.” Second, 
it is not possible for individuals in a two-person 
group to be supported in a minority. In a 
three-person group one can be aligned with 
another in a majority, and in the case of groups 
four and larger one can be aligned with another 
in a minority or an equal sized faction. The 


existence of social support in a minority in- 
creases its strength greatly to oppose a 


majority (Asch, 1953) and provides social . 


validation for one’s position. These features of 
two-person groups may account for their more 
searching and successful exploration of the 
problem when neither member began with the 
correct answer. 

Tf supported by future experiments, the 


models presented here provide a provisional - 
answer to the old question of whether, or . 


under what conditions, groups are superior to 
individuals. The models predict groups to be 
superior to individuals in solving problems 
when (a) verifiability of the problem’s answer 


is high rather than low, or (b) pressures toward ` 


uniformity are high amd the probability of 
Success in solving the problem is appreciably 
higher than the highest probability of having 
an incorrect answer. How much larger the 
value of pı should be than the value for the 
most common incorrect answer, when pres- 
Sures toward uniformity are high, must be 
determined empirically. The models also shed 
light on the relationship between group size 
and the quality of group problem solving. The 
quality of group problem solving will increase 
with , to the extent that verifiability is high. 
Whether the positive deceleration of quality 
with » is rapid or slow depends upon the 
value of pı, problem difficulty. 


Related. Problem Solving Models 


The models proposed here are but three of 
many that may be proposed for problems 
having discrete, multialternative answers and 
in which the group's product isa distribution 
of individual answers. Some additional out- 
come possibilities and model names are given 
in Table 7. The types expressing diversity in 
the distributional outcomes (Process I, II, 
and III) could plausibly arise from norms and 
values sanctioning diversity rather than unani- 
mity of opinion. The types expressing homo- 
geneity of outcome (Process IV, V, and VI) 
could plausibly grow out of pressures toward 
uniformity in which the socially defined cate- 
gories of sameness do not correspond to a 
single solution to the problem. For example, 
for Process IV only combinations of correct 
and incorrect answers would be perceived by 
members as different, Perceptual qualities of 


TS 


TON 
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TABLE 7 t 
THEORETICAL PROBABILITIES OF OBTAINING GIVEN DISTRIBUTIONS OF GROUP MEMBERS’ ANSWERS FOR PROCESS 
Mopzrs 
t (Multialternative Case)* 
Type of Distribution 
Model Names 
AII Members Some Members Correct r All Members In 
rect and Some Incorrect — | Al Members Incorrect, Mixed | AT Mepbers Incor- 
Po Pot DH 1227 

Diversity types 

Process I «p 2i-[W"-ü-»n" | > = pi)" = Ge o> + oe) «Te D) 

Process II <p" >1— [p^ + (1 — )") «ü = n" — (hr te) | «GP em) 

Process IIT <p" «1— [p^ + (1 — 2)"] >(= pi)" = (ba + o HA) | «Gre bae? 
Homogeneity Types 

Process IV 2p" «i-[m Fü-»"] | »0—-2"—- (be + + + de") | Dres em 

Process V >p” 21- [p^ 3 (1 — 5)" «ü — n (AP + ve + he) | > (da + vee + pe) 

Process VI <p" <1 — [p^ + (1 — $)"] «ü — pr)” — (m + o> + he) | De +b py) 

Process VII 1= (1 — fi)” 0 0 (1 = pi)" 
Strong leader 

Process VIII ph 0 0 1-9 


2 For meaning of symbols, see Table 1. 


b The term fj is the probability of the leader solving the problem while working in the group. 


different answers or social norms which serve 
to define sameness differently from the answers 
as objectively determined would be conducive 
to such outcomes. Process VII gives outcomes 
that would be predicted from high verifiability 
of answers combined with pressures toward 
uniformity. Process VIII proposes outcomes 
for groups having a very strong leader whose 
power stems from complete autocratic control 
or from compelling charismatic qualities. If 
the leader gets the answer then all others will 
adopt it; if he gets the wrong answer then 
everyone adopts the wrong answer. The 
models given in Table 7 need to have the 
possible antecedent conditions carefully speci- 
fied so that they too may be studied experi- 
mentally. There are of course other distribu- 
tional possibilities than those in Table 7 that 
might also be candidates for models. 

Many problems call for two-alternative 
solutions, such as a correct or incorrect answer, 
rather than multialternative solutions. Prob- 
lems with two-alternatives can be handled by 
expanding appropriate parts of the binomial 
(b + q)", where p equals the probability of 
Success of an individual working alone on the 
problem, g = 1 — p, and n is the number in 
the group, For the independence model, 
Po = f^, Per = 1 — (p^ + q") and Pry = 
q”. The expressions for the other two models 
are analogous to those given for these models 
in Table 1. 


A restriction on the present models is that 
they apply to distributions of member answers 
and not to unitary, undifferentiated group 
products, such as a single decision or answer. 
1f single group products composed of uniform 
elements and arrived at by majority rule or 
unanimous agreement are found to bear an 
explicit relationship to the distribution of 
member answers prior to formulating the group 
outcome, then we may learn how the models 
may be extended to apply to such outcomes. 
However, these models can not be applied to 
complex, differentiated group products com- 
prised of an integration of diverse individual 
contributions. 


Extension of Preferences 


The theoretical probabilites for the various 
distributional outcomes apply to preferences, 
where the term p takes the designation of the 
probability of endorsing yes, agree, or some 
other form of preference. In this extension, 
legislative or committee votes and other group 
decisions based upon the aggregation of mem- 
ber preferences may be treated. The outcomes 
may be for two or for three or more preference 
alternatives. The probabilities for the out- 
comes of the independence model remain, as 
before, as base lines against which to determine 
the effects of the group or social situation. 

However, antecedent conditions for the 
rational model must necessarily be different, 
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even though the outcomes predicted remain 
the same. No objective criteria are available 
to validate preferences, and the norms and 
values of the group (social reality) are resorted 
to to justify preferences, depending upon the 
relevance of the preferences to the norms and 
values in question. In offering criteria for 
validating individual preferences, such norms 
and values may be said to provide normative 
justifiability. Preferences will converge toward 
& norm so as to produce a final distribution of 
preferences like that predicted for the rational 
model when: (a) the norm in question is 
internalized, punishment for deviation is 
moderate to Strong, or reward for conformity 
to it is moderate to high; (b) the norm guides 
behavior most effectively if aroused and made 
salient; and (c) an expressed preference closest 
to the norm arouses and makes salient the 
norm for the other group members. Under 
these conditions, if one Person adopts the 
normatively preferred alternative so will all 
others. It follows also that Pe will be less than 
1 — (1 — 5) to the extent that any of the 
above conditions is not met and Po will tend 
toward unity and the values for the other 
distributional alternatives will tend toward 
zero to the extent that the norms are interna- 
lized by members and there are Strong sanc- 
tions for conformity and deviation, and the 
norms are ubiquitously salient, 

As for the consensus model, the antecedent 
condition of pressures toward uniformity 
applies to preferences as well as to answers, 
and the'mathematical outcomes are the same 
as for answers to problems, However, unlike 
the normative model, there must be no norms 
or standards to steer consistently the prefer- 
ences one way or the other, or otherwise the 
convergence would not be indiscriminate, as 
predicted. 


SUMMARY 


Three models of group problem solving were 
Proposed which stipulate the antecedent condi- 
tions and theoretical outcomes applicable to 
group products consisting of distributions of 
individual answers as expressed in three or 
more discrete categories. The independence 
model states the theoretical Probabilities for 
different distributional Possibilties assuming 
that the group members’ answers will not 


differ from the responses of individuals working 
alone. The rational model states that when a 
least one member of the group obtains the 
Correct answer to the problem all other mem: 
bers will adopt the answer; a major require 
ment for the rational model to operate is 
verifiability of the answer to the problem, 
The consensus model assumes Pressures toward 
uniformity such that the expected Proportion 
of groups with all members i i 


tions the Superiority of groups over individuals 
in problem solving increases with the number 


These predictions and assumptions underly- 
ing them were tested in an experiment in 
which 193 subjects solved Maier’s horse 
trading problem either as individuals working 
alone or in groups ranging in size from two 
to five. 

The outcomes did not correspond to those 
Predicted for the rational model and, instead, 
matched the theoretical probabilities of the 
consensus model. F: urthermore, the antecedent 
conditions for the consensus model of high 
Pressures toward uniformity and low verif | 
ability were demonstrated to exist. Pressures 
toward uniformity were found to be repre- - 
sented concretely as a disposition of members : 
to change and as active attempts of interacting 
group members to Change the answers of 
others. Factors Positively influencing quality 
were (a) the diversity of the member's initial 
answers, (6) the number of members with 
ability to solve the problem prior to entering 
the group discussion, and (c) the frequency 
With which correct answers and their ration- 
ales were endorsed during discussion. The 
elements of resources, input, and the processing 
ofinputinthe group were thus to varying degrees 
determinants of the final outcomein problem 
solving. Groups of Size 2 were the only ones 
in which members could arrive at final correct 
answers when neither member could initially 
solve the problem, 

Eight other multialternative models for 
problem Solving were Suggested and an exten- 
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sion of the models to preferences was discussed. 

The two-alternative problem or preference 

was noted as a special case to which the models 

applied. 
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cate that a person does not respond to 

the space about him as if it were an 
homogenous medium. Instead, he manifests 
anisotropic or directional biases that reflect 
the assignment of special valences to various 
spatial sectors. Illustratively, it has been 
demonstrated that both the nearness and 
intensity of a stimulus may vary in terms of 
whether it appears on the right or left (Adair 
& Bartley, 1958; Dallenbach, 1923). Speed of 
movement may be evaluated differently 
depending upon whether it proceeds to the 
right or left (Takala, 1951). Constant errors 
either to the right or left may occur in judg- 
ment of verticality (Sandstrom, 1954). The 
degree of a person's anisotropic response can 
be influenced by controlling the nature of the 
stimulus condition. Thus, it can usually be 
enhanced either by making the situation less 
Structured (Takala, 1951) or by inserting 
special cues with directionality connotations 
(Kaden, Wapner, & Werner, 1955). Beyond 
the influence of the stimulus conditions them- 
selves, however, it has been observed that a 
person may bring to the situation preferences 
for certain kinds of asymmetries. That is, he 
may manifest biases in his perceptions which, 
for example, favor either the right as compared 
to the left or up in contrast to down (Takala, 
1951; Thurstone, 1952). Little or nothing is 
known concerning the variables that underlie 
such individual asymmetry patterns. A few 
studies (Hastings, 1952; Talmadge, 1958; 
Thurstone, 1952) and also some clinical obser- 
vations (Goody & Rheinhold, 1953) suggest 
that an individual’s anisotropies may reflect 
significant features of his personality. Stern 
(1938) particularly affirmed that for each 
person there is a “personal space” which is 
closely bound up with the organization of his 
personality. 


Se findings from previous studies indi- 


1 This study was mainly supported by United States 
Public Health Career Investigator Grant M-1578. 

2Now at State University of New York Medical 
School, Syracuse, New York. 


The present project was intended to explore 

a phase of the “personal space” problem, 
concerns the relationship between achi 
ment orientation and right vs. left directio 
ality in autokinetic movement. First will 
described the results of an exploratory study 
and second the outcome of two cross-validation 
efforts. 1 
Stupy I L] 


There have been various conjectures as to 
the psychological significance of right v& 
left space. It has been suggested that right is 
associated with masculinity and left with 
femininity (Fenichel, 1945). Some exploratory 
studies have pointed to the possibility that 
right vs. left is linked with traits of sociability 
vs. reflectiveness (Talmadge, 1958; Thurstone, 
1952). The position has also been taken 
(Lewinson, 1940; Sonnemann, 1950; Wolff, 
1943) that right vs. left constitutes a con- 
tinuum with drive or achievement connota- 
tions. It is this last speculation which the 
present study focused upon. Achievement 
orientation was chosen as the personality | 
variable to be correlated with right-left ani- 1 
sotropy. The direction of autokinetic move 
ment was selected as a measure of right-left | 
anisotropy because the autokinetic situation 
is so completely unstructured and therefore 
maximizes the possibilities for a person to 
impose his own biases upon the stimulus. Since | 
dependable data regarding the personality | 
correlates of anisotropy are lacking, it was - 
decided in this initial phase of the project not 
to make any specific predictions regarding 
the relationship between need for achieve- | 
ment and right vs. left asymmetry. 


Method 


Need for achievement was measured by means of 
five TAT pictures (1, 2, 6BM, 7BM, 7GF) which were , 
administered to subjects on an individual basis. The ; 
protocols were scored in terms of the scheme described | 
by McClelland (McClelland, Atkinson, Clark, & | 
Lowell, 1953). All scoring was done without any . 
knowledge of subjects’ autokinetic asymmetry patterns: | 


m 
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Autokinetic directionality was evaluated by bringing 
each subject into a dark, lightproof room with his eyes 
closed. He was seated at a table on the surface of which 
was a sheet of paper (14 X 17 inches) with a dot in the 
middle of it. The position of the subject was such that 
his vertical midline was lined up with a pinpoint of light 
(13 feet distant and 2 feet from the floor) which consti- 
tuted the autokinetic stimulus. He was told that a small 
light would be shown to him in the dark and that when 
it began to move he was to draw a picture on a sheet of 
paper of its exact path. He was also instructed to indi- 
cate the moment that he noted the first movement of 
the light. Just before ion was to begin and while 
the room was illuminated, the subject was allowed to 
open his eyes long enough to take a pencil and place it, 
on the dot in the center of the sheet before him which 
was defined as his starting point. When the room was 
totally. dark, the subject opened his eyes and began to 
draw the autokinetic path as soon as he perceived move- 
ment. At the instant that the subject announced the 
first perception of movement a timer was started that 
rang a bell after 15 seconds. This signaled to the subject 
to stop drawing and to close his eyes. The same pro- 
cedure was followed for a total of four trials. On the 
occasion of each trial, the subject was supplied with a 
new recording sheet for his autokinetic drawing. 

Scoring of the drawings for directionality was re- 
stricted entirely to the initial direction taken from the 
starting point. All shifts thereafter were ignored, This 
was done because it has been shown elsewhere (Kline, 
1950) that with continued exposure to an autokinetic 
stimulus satiation effects are produced that run counter 
to initial tendencies towards right or left. Actually, the 
only reason a period as long as 15 seconds was employed 
for each trial was because any shorter time interval 
made the subjects feel hurried and under pressure. Each 
drawing was scored as right-directional, left-directional, 
or without definite direction. Definite direction was 
ascribed to a drawing only if the initial path deviated 
more than }¢th of an inch from the vertical at some 
point in its course. 

Subjects. There were 91 subjects, of whom 55 were 
male and 36 were female. Their median age was 20 years 
and their median educational level was 14 years. They 
were all members of fraternities and sororities who were 
paid a fee for participating in the experiment. In order 
to control for the possible influence of laterality upon 
autokinetic directionality, only right-handed subjects 
were used, Subjects were classified as right-handed only 
if they considered the right hand to be the strongest and 
also used it to write, to hold a fork when eating, and to 
hold a hammer when pounding a nail. These subjects 
had originally been recruited for another project to 
es their participation in the present study was inci- 

ntal. 


Results 


The median achievement score was +1. 
For the autokinetic measure the right-direc- 
tional score was 1, the median left-directional 
score 0, and the median nondirectional score 
2. As defined by a chi square analysis, there 


TABLE 1 
Cnr SQUARE ANALYSIS OF RELATIONSHIPS BETWEEN 
MCCLELLAND ACHIEVEMENT SCORES AND AUTO- 
KINETIC DIRECTIONALITY IN EXPLORATORY SAMPLE 


Hi ignifi- 
Autokinetic Direction Aare: Pregl x pe 
ment ment Level 
High right* 20 28 | 4.1] .05-.02 
Low right 27 16 
High left 24 25 | nt -— 
Low left 23 19 
High no direction 34 19 |6.6| .01 
Low no direction 14 24 


® In all of the chi square analyses the cutting point has been 
the median. Values above the median are high and all values at or 
below the median are low. 

b Not significant. 


were no significant sex differences for either 
the achievement score or the autokinetic 
measure. 

As indicated by the chi square values in 
Table 1, subjects with high achievement 
scores were significantly (.05-.02 level) less 
often right-directional than subjects with low 
achievement scores. There was no relationship 
between achievement scores and left-direc- 
tionality. A very significant trend (x: = 6.6) 
appeared for high achievement subjects to 
produce more nondirectional autokinetic move- 
ment than low achievement subjects. When 
the achievement score was related within each 
sex group separately to the three autokinetic 
directionality scores, none of the relationships 
was significant. 

Tt should be incidentally noted that subjects 
manifesting high right-directional scores 
obtained significantly (x? = 21.1; .001 level) 
lower nondirectional scores than subjects low 
in right-directionality. But the relationship 
between left-directionality and nondirection- 
ality was not significant. 


Stupy II 


Method 


The apparent relationship found between achieve- 
ment fantasy and autokinetic directionality was suffi- 
ciently encouraging so that two cross-validation proj- 
ects were undertaken. These studies were very similar in 
design to the first except for certain changes intended to 
increase the range of scores obtained from each measure. 
Thus, the TAT series was altered to consist of 1, 2, 
9BM, 14, and a specially drawn picture depicting a fam- 
ily scene in which a mother and father were observing a 
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TABLE 2 


Cui SQUARE ANALYSIS OF RELATIONSHIPS BETWEEN MCCLELLAND ACHIEVEMENT SCORES AND AUTOKINETIC 


DIRECTIONALITY Scores IN Two Cross-VALIDATION SAMPLES 


Cross-Validation 1 Cross-Validation 2 d 
Autokinetic Direction e zu dl Huh ma sisi y 
hcec t areae | 8 reci Acier aent lAearant |) Tevel Sai 
High right 5 17 a 7 13 E. 
Yos right 16 13 4.2» | .02-.01% 17 9 3.0 | .05-.02 
High left m T m 9 M. 
Ee it 8 20€ 11/16: ^| penne 2 15%) | "5 
High no direction 8 19° 13 10 M 
Lae D ect 13 GERE Hees 10 13 " 


® Yates’ correction has been applied whenever a theoretical frequency of less than 10 occurs. 
b One-tailed tests have been used for the specific predictions involving the relationship of achievement scores to right and left direc 


tionality, 


© The N is 49 instead of 51 because two cases could not be completely scored for left-directionality and nondirectionality. 


daughter and a crippled son. The pictures substituted 
for those in the previous series were thought to have 
potentially greater pull for achievement themes. The 
autokinetic series was altered by increasing it to five 
trials instead of four. Stories were obtained in group 
sessions in which from three to six subjects participated 
at any one time. 

Subjects. The subjects in the first cross-validation ef- 
fort consisted of 51 right-handed women who were re- 
cruited from sorority and social groups. They were paid 
a fee for participating. Their median age was 22 years 
and their median educational level 13 years. 

The subjects in the second cross-validation group 
were comprised of 46 men who were also obtained from 
social and fraternity groups and paid a fee for their time. 
Median age was 20 years and median educational level 
14 years. 


Results 


The median achievement score in the female 
cross-validation group was +6. The median 
number of right-directional autokinetic percep- 
tions was 2. For left-directionality the median 
was 1 and for nondirectionality 0. 

In the male cross-validation group the 
median achievement score was +2. The 
median right-directional score was 2, the 
median left-directional score 1, and for non- 
directionality the median was 0. 

It can be seen in Table 2 that in both the 
male and female groups there was a significant 
tendency (.02 and .05 levels, respectively) for 
subjects with high achievement scores to 
manifest right-directional autokinetic move- 
ment less frequently than low achievement 
subjects. For left-directionality the results 
were less consistent. In the female group high 


achievement subjects produced 
tional movement significantly (x? = 6.9) more 
often than did low achievement subjects, 
However, in the male group this relationship 
was of a chance order. High achievement sub- | 
jects in the female group manifested non- 
directional movement significantly less often 
than did low achievement subjects; but no such 
relationship appeared in the male group. In 
both of the cross-validation groups there was 
only a chance relationship between right- 


| 


left-direc- 


directionality and nondirectionality and be- | 


tween left-directionality and nondirectionality. 


DISCUSSION 


The data from the exploratory and two 
cross-validation projects clearly support the 
proposition that there is a significant li 
between achievement orientation and right- 


directional autokinetic movement. In all three | 


instances subjects with low achievement 
scores manifest more right-directional move 
ment than do high achievement subjects. The 
relationship between achievement themes and 
left-directionality tends to be nonsignificant 
and might perhaps be characterized as indeter- 
minate. Thus, in the exploratory study and in 
the male cross-validation group the relation 
ship was nonsignificant; but in the female 
group high achievement subjects were signifi 
cantly more left-directional than low achieve 
ment subjects at the .01 level. The reason fot 


-— 


this variation in results from group to group — 


is not readily apparent. The pattern of relation” 
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ship between achievement scores and non- 
directional movement fluctuated erratically 
and nonmeaningfully from group to group. 

Overall, the results indicate with assurance 
only that persons with a low achievement 
orientation impart right-directionality to their 
autokinetic perceptions to a greater degree 
than do high achievement persons. These 
findings suggest that at least within the limits 
of the techniques employed in the present 
study anisotropy may be influenced by a per- 
sonality variable. They support the concept 
of a “personal space" with configurations that 
are biased by personality. 

How might achievement orientation influ- 
ence autokinetic directionality? Differences in 
right-left autokinetic directionality have been 
found in some instances to be related to differ- 
ences in the tonus of the right and left sides 
of the body. Wishner and Shipley (1954), for 
example, found that right-handed subjects 
were more likely than left-handed subjects to 
manifest right-directionality in autokinetic 
response. Blane (1957) and Kline (1950) have 
similarly made observations that imply that 
both short-term and long-term asymmetries 
of muscle tonus result in autokinetic bias in 
the direction of the most activated side. If 
this be so, it is possible that need for achieve- 
ment is translated into autokinetic direction- 
ality via patterns of muscular tonus. 

A speculative picture of this process will 
now be offered. To begin with, in the average 
right-handed person there tends to be more 
tonus on the right side than on the left that 
accompanies a set to respond preferentially 
with the right hand (Davis, 1956; Miller, 1957). 
It may be conjectured on the basis of observa- 
tions made by Davis and Miller that the more 
the subject is set for immediate response with 
the right hand the greater will be the differ- 
ential tonus level favoring the right side. At 
this point it may be further considered that 
the subject with a high need for achievement 
1s usually one who has learned to forgo immedi- 
ate gratification for future reward. He is 
Unusually oriented toward the future (Green 
& Knapp, 1959; Teahan, 1958). Therefore, he 
is less oriented to the kind of immediate readi- 
ness for response here conjectured to underlie 
the maintenance of high tonus in the right 
hand. His tonus level on the right side might, 


for these reasons, be lower than that of a person 
characterized by low achievement orientation 
(less future oriented) and by a lesser set for 
delayed reaction. The difference in autokinetic 
directionality between groups high and low 
on the achievement measure would, then, be 
reflection of this fact. This picture is conjec- 
tural, but it does lend itself to objective study. 
SUMMARY 

In three samples of college students, it was 
found that subjects with a low score on achieve- 
ment orientation (from a brief TAT) indicated 
more often initial autokinetic movement to the 
right than subjects with high need for achieve- 
ment. A tentative explanation was offered 
concerning the manner in which differences in 
achievement motivation might be translated 
into differences in the autokinetic measure. 
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viewed and evaluated the myriad 

criticisms which have been leveled 
against the conventional system of psychiatric 
diagnosis (e.g, Harrower, 1950; Hoch & 
Zubin, 1953; Menninger, 1955; Noyes, 1953; 
Roe, 1949; Rogers, 1951; Rotter, 1954). Two 
frequently recurring arguments have been 
that this classificatory schema lacks reliability 
(Ash, 1949; Boisen, 1938; Eysenck, 1952; 
Mehlman, 1952; Roe, 1949; Rotter, 1954; 
Scott, 1958) and that many of its categories 
encompass individuals who are heterogeneous 
with respect to various criteria (King, 1954; 
Rotter, 1954; Wittenborn, 1952; Wittenborn 
& Bailey, 1952; Wittenborn & Weiss, 1952). 

Several investigators have noted the con- 
ceptual and methodological weaknesses of 
those studies which have emphasized the 
unreliability of psychiatric diagnosis and have 
offered evidence that the present system is 
reliable (Foulds, 1955; Hunt, Wittson, & 
Hunt, 1953; Schmidt & Fonda, 1956; Seeman, 
1953). Although the studies concerned with 
the reliability of psychiatric diagnosis suggest 
that so long as diagnosis is confined to broad 
categories it is reasonably reliable, with relia- 
bility diminishing as one proceeds from broad 
inclusive class categories to narrower, more 
Specific ones (Zigler & Phillips, 1961b). 

The present status of the heterogeneity 
Criticism is much less clear. The confusion 
Surrounding this issue appears to stem from 
the inherent ambiguity of the heterogeneity 
parameter (Zigler & Phillips, 1961b) as well as 
from the failure of certain investigators (Rot- 
ter, 1954) to differentiate homogeneity from 
reliability. As King (1954) has noted, relia- 
bility refers to the agreement in assigning in- 
dividuals to diagnostic categories, whereas 
homogeneity refers to the uniformity of 
behavior subsumed within categories. The 


7 and Phillips (1961b) recently re- 
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failure to differentiate adequately between 
these concepts can in part be ascribed to their 
inherent overlap. 

The demarcation of this area of commonality 
can be sharpened by noting that what defines 
any diagnostic category is a particular symp- 
tom configuration. As the reliability of the 
categories decreases, their heterogeneity must 
increase. This can best be exemplified by the 
extreme case in which individuals are indis- 
criminately assigned to the various classifica- 
tions. In this instance, each category would 
include every possible symptom pattern. But 
although a decrease in the reliability of a 
category is invariably accompanied by an 
increase in its heterogeneity, categories which 
subsume quite heterogeneous phenomena are 
not necessarily unreliable. There are many 
categories throughout science which, though 
reliable, subsume quite diverse phenomena— 
e.g., organic vs. inorganic matter. Thus, the 
reliability of psychiatric diagnosis is not deter- 
mined by the number or diversity of symp- 
toms which any diagnostic category encom- 
passes, but is determined rather by the extent 
to which the same symptoms appear in more 
than one category of the system. 

On the basis of the studies noted, it appears 
reasonable to assume that each of the broad 
diagnostic categories in common psychiatric 
use is both heterogeneous (subsumes a large 
number of diverse symptoms) and reasonably 
reliable (any particular symptom will tend to 
occur in only one category). One of the major 
purposes of this study is to evaluate empirically 
the validity of this position. Although the 
defense of the present diagnostic system tends 
to rest almost exclusively on the merits of its 
descriptive and nondynamic aspects (Caveny, 
Wittson, Hunt, & Herman, 1955; Eysenck, 
1952; Hunt et al., 1953; Jellinek, 1939), sur- 
prisingly little evidence is available that 
membership in a diagnostic category conveys 
accurate information about the gross symp- 
tomatology of the patient. The findings of 
studies that have examined this issue (Freuden- 
berg & Robertson, 1956; Wittenborn, Holz- 
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berg, & Simon, 1953) suggest that although 
the occurrence of certain symptoms tend to 
be associated with particular diagnostic cate- 
gories, many symptoms are related to more 
than one category. These findings also indicate 
that even in those cases where a statistically 
significant relationship exists between the 
occurrence of a symptom and a diagnostic 
category, the symptom also occurs in a remark- 
ably high proportion of individuals assigned to 
other diagnostic categories. 

That further investigations of this type are 
needed can be emphasized by pointing out 
certain ambiguities of the two studies just 
noted. One of these (Wittenborn et al., 1953) 
employed narrow diagnostic categories rather 
than broad, inclusive ones. As pointed out 
earlier, this reduces the reliability of the cate- 
gories and affects both the number of symp- 
toms appearing in a category and the overlap 
of symptoms from category to category. 
Neither study employed a sample of patients 
in which all of the major functional diagnostic 
groups were represented. Nor did either study 
make use of the spontaneously noted symp- 
toms of the patient. Instead, each patient in 
both studies was rated on each of the 55 symp- 
toms which compose the Wittenborn symptom 
scales. Thus, the upper limit of the number of 
symptoms which appear in any category is 
arbitrarily established. 

Use of the Wittenborn scales raises the 
issue of the pathognomonic vs. nonpathog- 
nomonic symptom, a distinction advanced by 
Thorne (1953). In view of the manner in 
which symptoms were chosen for inclusion in 
the Wittenborn (1950) scales, it is quite 
possible that many of these symptoms are 
not those characteristically employed to define 
membership in a particular diagnostic class, 
but are nonpathognomonic symptoms, char- 
acteristic of virtually any hospitalized indiv- 
ual. If such were the case, use of the 
Wittenborn scales would artificially increase 
the number of symptoms occurring in any 
diagnostic category while also making it more 
likely that a symptom would appear in more 
than one category. 

In addition to a further clarification of the 
homogeneity and reliability issues, the second 
major objective of this study is a comparison 
of the present diagnostic system with the 


system of psychiatric classification rei 
advanced by Phillips and Rabinovitch (1' 
These investigators, after empirically ident 
fying three symptom clusters, conceptualize 
one of the three groups of symptoms as beir 
indicative of “self-deprivation and turnin 
against the self,” another as “self-indulgen 
and turning against others," and a third ¢ 
indicative of “avoidance of others.” In 
second study (Zigler & Phillips, 1960), 
was found that these three symptom ¢ 
gories were related to the level of premori 
social effectiveness attained by the patien 
After a cursory examination of the sympto 
in each of the three categories, it was suggest 
that these three symptom categories Co 
respond to the manic-depressive, charac 
disorder, and schizophrenic syndromes, T 
spectively. In view of this suggestion and t 
findings of Hollingshead and Redlich (19 
that conventional psychiatric categories: 
also related to premorbid social adequacy; 
of interest to investigate the degree of C0 
respondence or overlap between the Phill 
and Rabinovitch symptom categories 
conventional psychiatric categories. 


METHOD 


Subjects. The study was based on an examinatio 
the case histories of 793 patients admitted to Wore 
State Hospital during a 12-year period (1945-1! 
The particular case histories used were those of pati 
who were referred to the psychology departm 
psychological appraisal and who were even 
diagnosed as suffering from a functional disorder. 
this sample includes a wide variety of cases, extrel 
deteriorated or very agitated patients are sel 
referred for psychological evaluation and are not 
quately represented here. These patients differed 
those used in earlier studies (Freudenberg & Robel 
1956; Wittenborn et al., 1953) in that they int 
individuals who varied widely in their degree of 
nostic difficulty, rather than presenting little 0! 
diagnostic problem. 

Diagnosis. The diagnosis ascribed to each 
in the study was that psychiatric classification ag 
upon at a diagnostic staff conference and noted 0D. 
patient's hospital record as his primary diagnosis. 
diagnostic classification of mental illness used 
Worcester State Hospital during this period was b 
on the Standard Classified Nomenclature of Di 
(1945-1952) and Diagnostic and Statistical Ma 
Menial Disorder (1953-1957). 

j The patients in the present study were cates 
into four major diagnostic groups: manic-dep! 
schizophrenic, psychoneurotic, and character dis 
The composition of each of these groups was as foll 
1. Manic-depressive here was defined to 
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the diagnoses of involutional psychotic reaction; 

manic-depressive, manic; manic-depressive, de- 

pressive; manic-depressive, other; and psychotic 

depressive reaction (V = 75). 

2. The schizophrenic group included those with 
diagnoses of the following types of schizophrenic 
reactions: simple, hebephrenic, catatonic, paranoid, 
undifferentiated, and schizo-affective (IV = 287). 

3. Psychoneurotic patients carried diagnoses of 
the following types of psychoneurotic reactions: 
anxiety, dissociative, conversion, phobic, obsessive- 
compulsive, depressive, and other similar reactions 
(N = 152). 

4. Character disorder was defined to include 
diagnoses of personality pattern disturbance— 
inadequate, schizoid, cyclothymic, or paranoid type; 
personality trait disturbance—emotionally unstable, 
passive-aggressive, compulsive personality, or other 
type; and, such sociopathic disturbances as anti- 
social, dyssocial, sexual deviation, alcoholism, and 
drug addiction (V = 279). 

Symptoms, A symptom in the present study, as in 
early studies (Phillips & Rabinovitch, 1958; Zigler 
& Phillips, 1960), refers to the description of a pa- 
tient's behavior by a psychiatrist at the time of 
initial institutional contact or the description of be- 
havior presented by referring physicians as the primary. 
reason for hospitalization. The symptoms include 
specific actions (assault), general behavior patterns 
(irresponsible behavior), thoughts (sexual preoccupa- 
tions), somatic reactions (headaches), and general 
affect states (being tense). Examination of the 793 
case records resulted in the tabulation of 48 discrete 
presenting symptoms. Symptoms which did not occur 
in at least 5% of the individuals included in any one 
of the four diagnostic categories were eliminated from 
further consideration; their low frequency of occurrence 
made them unsuitable for the type of statistical treat- 
ment employed. Employment of this criterion resulted 
in the following 13 symptoms being excluded from 
Consideration; addictions other than drinking, delirium 
tremens, fearful, memory impairment, amnesia, black- 
outs, convulsions, eats a lot, charged with murder, fire 
setting, feels going crazy, feeling of sexual inadequacy, 
and compulsions, 

Of the 35 symptoms employed in the present study, 
31 had previously (Phillips & Rabinovitch, 1958; 
Zigler & Phillips, 1960) been placed in the category of 
self-deprivation and turning against the self, self- 
indulgence and turning against others, or avoidance of 
others. Prior to the analyses conducted in this study, 
the four remaining symptoms were classified in the 
following manner: Psychosomatic disorders, phobias, 
and obsessions were placed in the self-deprivation and 
turning against the self category, and depersonalization 
was placed into the avoidance of others category. This 
Categorization of the four symptoms appears to be most 
ìn accord with the conceptual framework underlying 
the symptom categories. 


RESULTS 


In order to assess the number of symptoms 
appearing in each of the four diagnostic 


categories and the frequency with which the 
various symptoms appeared in each of these, 
the percentage of individuals in each category 
who manifested each symptom was computed. 
These percentages, as well as the percentage 
of the total sample manifesting each symptom, 
are presented in Table 1. Symptoms in this 
table are arranged in the order of their fre- 
quency of occurrence in the total sample. 

As can be seen in Table 1, of the 35 symp- 
toms, 30 appear in the manic-depressive, 34 
in the character disorder, and all 35 in both 
the psychoneurotic and schizophrenic cate- 
gories. 


TABLE 1 
PERCENTAGE OF INDIVIDUALS IN TOTAL SAMPLE 
AND IN EAcH DIAGNOSTIC CATEGORY 
MANIFESTING EACH SYMPTOM 


Total | me Pay- | Chari lg ia 
Hos | De- | nei: | Dis- EE 
Symptom ital | pres- | rotic | order 

N =| sive rz (N = LU 

793) |A | 152) | 279) | gS 
Depressed 38 | 64 | 58 | 31 | 28 
Tense 37 | 32 | 46 | 33 | 36 
Suspiciousness 35-255] 36, AOS 
Drinking 19 | 17 | 14 | 32 8 
Hallucinations 19 | 11 4.12] 35 
Suicidal attempt 16 | 24 | 19 | 15 | 12 
Suicidal ideas 1571297] 23^] 15 8 
Bodily complaints 155152131 21. 5:11:19 
Emotional outburst 14 | 17 | 12 | 18 9 
Withdrawn 14 4 | 12 7| 25 
Perplexed 14 9 9 8 | 24 
Assaultive 12 5 6 | 18 5 
Self-depreciatory 12 | 16 | 16 8 | 13 
Threatens assault 10 4| 11} 14 7 
Sexual preoccupation 10 9 9 6 | 14 
Maniacal outburst 9 | 11 6 7| 12 
Bizarre ideas 9/7 13 1 2| 20 
Robbery 8 0 3 | 18 3 
Apathetic 8 8 8 4 | 11 
Irresponsible behavior 7 3 7 9 7 
Headaches 6 7|10| 4 5 
Perversions (except 5 0 5 | 10 2 

homosexuality) 
Euphoria SOLI, 2 2 5 
Fears own hostile im- 5 4 9 5 2 
'ulses 
Mood swings 5. 9 5 4 4 
Insomnia edhe 1|. 29 [79 
Psychosomatic dis- 4 7 6 3 5 
orders 

Does not eat 4 9 4 2 4 
Lying 3 0 1 7 0 
Homosexuality 3 3 3 8 2 
Rape 3 0 3 8 1 
Obsessions 3 8 3 1 4 
e a cd eg sc aen 
Feels ert 
Phobias 2 4 5 0 2 
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TABLE 2 
SYMPTOMS TENDING TO JOINTLY Occur or Nor TO Occur WITH EACH OF THE DIAGNOSTIC CATEGORIES 
Manic-Depressive Psychoneurotic Character Disorder Schizophrenic 
Symptoms Suicidal attempt | Suicidal ideas Perversions (except Suspiciousness 
which tend | Suicidal ideas Bodily complaints homosexuality) Perplexed 
to occur Euphoria Tense Drinking Bizarre ideas 
Depressed Headaches Robbery Hallucinations 
Insomnia Depressed Assaultive Sexual preoccupation 
Obsessions Fears own hostileim- | Threatens assault Apathetic 
Does not eat pulses Emotional outburst Withdrawn 
Phobias Lying Depersonalization 
Homosexuality Feels perverted 
Rape Maniacal outbursts 
Bodily complaints 
Symptoms Withdrawn Suspiciousness Suspiciousness Perversions (except 
which tend Robbery Perplexed Perplexed homosexuality) 
not to occur Bizarre ideas Bizarre ideas Drinking 
Hallucinations Hallucinations Threatens assault 
Robber: Sexual preoccupation Emotional outbursts 
Assaultive Apathetic Lying 
ithdrawn Rape 
Depersonalization Suicidal attempt 
Feels perverted Suicidal ideas 
Euphoria Depressed 3 
Bodily complaints Fears own hostile im- 
Selí-depreciatory pulses 
Depressed 
Phobias 
Obsessions 


The chi square test of independence was 
employed to evaluate the tendency of each of 
the 35 symptoms either to occur or not to occur 
in each of the four diagnostic categories. 
Thus, 140 comparisons were made. The fre- 
quencies entered into the cells of each of these 
2 X 2 contingency tables were the number of 
individuals in a particular diagnostic category 
who exhibited or did not exhibit a particular 
symptom vs. the number of individuals who 
did or did not exhibit that symptom in the 
other three diagnostic categories combined. 
Since the .05 level of significance was selected, 
7 statistically significant relationships would 
be expected due to chance alone. Acutally, 
67 such relationships were found. These find- 
ings are summarized in Table 2, which lists 
those symptoms whose frequency of occurrence 
in a particular diagnostic group differed sig- 
nificantly from the frequency of occurrence 
of the symptoms in the other three diagnostic 
groups combined. 

While this last analysis discloses which 
symptoms are significantly related to the 
particular diagnostic categories, it does not 
reveal the extent of the relationships. Further- 


more, this analysis tends to be misleading in 
that in many instances where a significant 
relationship exists between a symptom and a 
diagnostic category, that symptom appeals 
with surprisingly high frequency in the other 
diagnostic categories. In order to assess the 
degree of relationship between the symptoms 
and the diagnostic categories, contingency 
coefficients were computed for each of those 
contingency tables which resulted in a 9& 
nificant chi square? The results of this analysis 


"The contingency coefficient was chosen in prefer- 
ence to the phi coefficient because of the extremely 
uneven split which prevailed between occurrence ani 
nonoccurrence for almost all of the symptoms. Since # 
contingency coefficient based on 2 X 2 tables has à 
maximum value of approximately .70, the coefficients 
reported in this study represent a conservative estimate 
of the relationships obtaining between symptoms 4” 
diagnostic categories. It should further be noted that 
the contingency coefficient is significant at the same 
level reached by the chi square from which it w% 
computed. Thus, all contingency coefficients report 
are significant at least at the .05 level. The contingeno 
coefficient can only be positive in value. Nevertheless 
they are reported here as negative in the instance? 
where the relationship between symptom and diagnos 
is such that the symptom tends not to occur wit 
that diagnosis, 


A. 
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are presented in Table 3, in which symptoms 
are grouped according to the three symptom 
categories discussed earlier. 

In addition to indicating the degree of re- 
lationship which obtains between each symp- 
tom and each diagnostic category, Table 3 
shows the nature and extent of the relation- 
ships between the symptom categories and 
the major diagnostic groups. In those in- 


TABLE 3 


MAGNITUDE OF THE RELATIONSHIP BETWEEN 
SvwProMs AND DIAGNOSES 


ü Char- 
Manic-  psycho.| acte i 
Symptom Depres-l neurotic] Die lputeate 
order 
Category 1 
Self-deprivation and 
turning against the 
self: 
Suicidal attempt «07 —.07 
Suicidal ideas .13| — .10 —.16 
Euphoria 19) —.08 
Bodily complaints .09| —.20| 10 
Tense -09) 
Headaches -09) 
Self-depreciatory —.09 
Depressed A7  .19| —.11| —.16 
Fears own hostile im- +10) — .08 
pulses 
Mood swings 
Insomnia - 08} 
Psychosomatic dis- 
order 
Phobias .10| — .09 
Obsessions .09 —.10) 
Does not eat .09 
Category 2 
Self-indulgence and 
turning against 
others: 
Maniacal outbursts .08 
Perversions (except +16) —.11 
homosexuality) 
Drinking .24| —.20 
Robbery —.08| —.09| 27) 
Irresponsible behavior 
Assaultive —.09 .12 
Threatens assault .11| —.08 
Emotional outburst +10) —.10 
Lying 19) —.11 
Homosexuality ` .08) 
Rape 18} —.09 
Category 3 
Avoidance of others: 
uspiciousness —.19} —.27  .42 
Perplexed —.07| —.13 21 
Bizarre ideas —.13| —.18| .27 
Hallucinations —.18| —.13| .30 
Sexual preoccupation —.09) = .11 
A Bede —.09 .09 
ithdrawn —.09 —.15 .24 
Depersonalization -.di| ^ 15 
Feels perverted —.08} .10 


stances where a significant relationship holds 
between a symptom and a diagnosis, the degree 
of relationship, as estimated by the contingency 
coefficient, tends to be quite small. In 33 of 
the 67 instances where a significant relation- 
ship occurs, the absolute magnitude of the 
contingency coefficient is .10 or less. In only 
one instance is there a coefficient greater than 
30. 

These significant but relatively small con- 
tingency coefficients indicate that while there 
is a relationship between the two sets of cate- 
gories, both category systems retain a con- 
siderable amount of independent variance. 
The nature of the relationship here seems 
quite clear. The self-deprivation and turning 
against the self category is positively related 
to both the manic-depressive and psycho- 
neurotic categories but negatively related to 
the other two. Self-indulgence and turning 
against others is positively related to the 
character disorders but negatively related 
to the other three diagnostic groups. Avoidance 
of others is positively related to the schizo- 
phrenic category but negatively related to 
the other three. 


DISCUSSION 


A certain degree of relationship has been 
discovered between symptom manifestation 
and diagnosis. However, the most striking 
finding of the present study is that the mag- 
nitude of these relationships is generally so 
small that membership in a particular diag- 
nostic group conveys only minimal informa- 
tion about the symptomatology of the pa- 
tient. One is faced with the perplexing finding 
that the occurrence of a wide variety of symp- 
toms may be related to more than one diag- 
nostic category. These data are essentially in 
agreement with those of earlier studies 
(Freudenberg & Robertson, 1956; Witten- 
born et al., 1953). Since the basis for diagnostic 
classification is ostensibly symptom mani- 
festation, the question arises as to why such 
classification has been found to be reliable in 
spite of the fact that symptoms tend to occur 
with surprisingly comparable frequency across 
diagnostic groupings. 

One possible answer is that although single 
symptoms may not grossly differentiate diag- 
noses, configurations of symptoms do so 
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discriminate. In opposition to this view is 
the finding of Freudenberg and Robertson 
(1956) that complex combinations of symp- 
toms do not lead to greater differentiation 
between diagnoses. Another possibility is 
that symptoms are not sufficiently unitary 
indices. It is open to question whether “with- 
drawal" in the depressive is qualitatively of 
the same nature as "withdrawal" in the 
Schizophrenic, or whether the "suicidal at- 
tempt" of the depressive is the equivalent of 
the histrionics of the psychopath. While cer- 
tain investigators (Wittenborn et al., 1953) 
have attempted to meet this deficiency by 
dimensionalizing symptoms, the criticism 
advanced above has nevertheless been leveled 
at even these efforts (Freudenberg & Robert- 
son, 1956). 

In relation to this problem, one is reminded 
of the position taken by Thorne (1953) that 
the experienced clinician diagnoses not on 
the basis of textbook symptoms but primarily 
on the basis of “the feel of the case." It may 
be that the diagnostician is responding to 
finer gradations of those discernible be- 
haviors which are labeled symptoms than are 
defined by the classical indices of particular 
diagnoses. In addition to this, the clinician 
may be responding to some global configuration 
which transcends symptomatology or to 
criteria which lie beyond the domain of 
psychiatry proper. This possibility receives 
some support from the finding of Hollings- 
head and Redlich (1958) that diagnosis 
is related to social class membership. It re- 
ceives further support from the finding of 
Zigler and Phillips (1961a) that diagnosis is 
telated to premorbid social competence as 
measured by variables which both include 
and go beyond those typically employed to 
define social class membership. To the extent 
that such factors influence the diagnostic 
process, it suggests that psychiatric classifica- 
tion is based not on pathological manifestations 
alone but on the clinician's evaluation of the 
over-all maturity level of the patient. This 
view is supported by both the clinical folklore 
and the systematic evidence which characterize 
the manic-depressive and the psychoneurotic 
as mature, of higher socioeconomic class, 
phallic, etc. In contrast, schizophrenics and 
'character disorders are typically conceived as 


immature, regressed, of lower socioeconomic ^ 
class, oral, etc. } 
Another finding of the present study is 
that there is a positive relationship between | 
the symptom clusters proposed by Phillips 
and Rabinovitch (1958) and psychiatric 
diagnosis. Thus, the self-deprivation and 
turning against the self symptom cluster is- 
related to both the manic-depressive and 
psychoneurotic categories. The self-indulgence 
and turning against others and the avoidance. 
of others clusters are positively related to 
the character disorder and schizophrenia 
categories, respectively. Since psychiatric 
classification is related to these global role | 
orientations also, it may be that we have here 
another basis for diagnosis which is inde 
pendent of the specific pathological behaviors 
manifested. | 
The finding of a relationship between the | 
Phillips and Rabinovitch (1958) symptom 
clusters and conventional psychiatric diagnoses 
is also relevant to the heterogeneity issue | 
raised here earlier. It has been asserted (Rotter, 
1954) that symptoms are completely chance 
occurrences, unrelated to the “deeper” or 
more “dynamic” aspects of individual func- 
tioning. To find this view in an area which 
so heavily emphasizes determinism is some 
what surprising, and it has been refuted at 
some length (Eysenck, 1953; Jenkins, 1953; 
Phillips & Rabinovitch, 1958; Wittenbom 
et al., 1953). Nevertheless, if heterogeneity 18 
defined simply as the number of symptoms 
which appear in any diagnostic category, the 
findings of the present study would support 
the view that our present diagnostic categories 
are heterogeneous in native phenomena. AN 
alternative solution to the heterogeneity- 
homogeneity problem is, however, possible. | 
Homogeneity may be defined in terms of the 
imposition of some organizing principle 
otherwise discrete and apparently random 
pathological events. This is merely to assett 
that homogeneity is a construction of thé 
observer or classifier rather than a quality 
which inheres in symptomatic behavior , 
The Phillips and Rabinovitch symptom ^ 
clusters represent just such an attempt a | 
conceptual classification, and they, thus | 
provide a means for viewing diverse patho- 
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logical behaviors as basically homogeneous in 
nature. 


SUMMARY 


This study was directed towards an exami- 
nation of the relationship between symptom 
manifestation and inclusion in a particular 
diagnostic category. The study also investi- 
gated the nature of the relationship between 
the present diagnostic system and the system 
of psychiatric classification recently advanced 
by Philips and Rabinovitch (1958). The 
most striking finding of the present study is 
that although relationships exist between 
symptoms and diagnoses, the magnitude of 
these relationships is generally so small that 
membership in a particular diagnostic group 
conveys only minimal information about the 
symptomatology of the patient. A relation- 
ship was also found between the Phillips and 
Rabinovitch symptom categories and the 
conventional psychiatric classification. These 
findings were discussed in relation to the issues 
of reliability and heterogeneity. 
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THE INTERPRETATION OF OPINION STATEMENTS AS A 
FUNCTION OF MESSAGE AMBIGUITY 
AND RECIPIENT ATTITUDE 


MELVIN MANIS! 
University of Michigan and Veterans Administration Hospital, Ann Arbor, Michigan 


function of the objective stimulus 
situation and characteristics of the 
behaving organism (e.g., motivation, set, 
attitude, etc.). Starting from this premise, 
perception theorists have suggested that in 
ambiguous stimulus situations, the lack of 
environmental constraint may serve to magnify 
the importance of organismic variables. The 
present study was designed to test this hy- 
pothesis within a communication setting; 
more specifically, the major goal of the study 
was to assess the proposition that in interpret- 
ing an ambiguous statement of opinion, the 
average person would be more strikingly 
influenced by his own views than he would be 
when interpreting a nonambiguous statement. 
This study also represents an attempt to 
replicate results previously obtained by the 
author. In an earlier experiment (Manis, 1960), 
subjects were presented with a series of 
opinion statements designed to explain the 
writers’ views on college fraternities; after 
reading each message, subjects were asked 
to indicate the writer’s probable attitude 
toward fraternities by predicting how the 
writer would have described fraternities on a 
series of evaluative rating scales. The results 
indicated that there was an orderly relation- 
ship between the subjects’ predictions and 
their attitudes concerning fraternities. In 
responding to both profraternity and anti- 
fraternity messages, neutral subjects in- 
terpreted the communicator’s stand as being 
less extreme than did those who were com- 
mitted to either the “pro” or the “anti” 


Bu: is often assumed to be a joint 


1 All statements are those of the author and do not 
necessarily represent the opinions or policy of the 
Veterans Administration. 
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position. In responding to neutral messages, 
there was a direct linear relationship between 
the subjects’ attitudes and their predictions: 
those who favored fraternities thought that 
the neutral communicators were more pro- 
fraternity than did those who opposed them, 
while the neutral subjects occupied an in- 
termediate position. 


METHOD 


In the first part of the experiment, subjects rated 
the concept college fraternities, as well as a series of 
“filler” concepts, on six evaluative scales of the semantic 
differential (Osgood, Suci, & Tannenbaum, 1957). The 
following scales were used: kind-cruel, valuable- 
worthless, honest-dishonest, fair-unfair, nice-awful, 
and good-bad. Attitudes toward fraternities were 
quantified by assigning a value of 1 to the “good” end, 
7 to the “bad” end of each scale, and summing across 
scales. 

Following the attitude assessment, subjects were 
presented with a series of 18 messages concerned with 
the fraternity issue;? 6 of the messages favored fra- 
ternities, 6 opposed them, and 6 were relatively neutral. 
Subjects were asked to predict how each writer had 
described fraternities on a series of evaluative rating 
scales (see above), which were presented below each 
message. A measure of each communicator’s judged 
position on the attitude continuum was obtained by 
summing the subjects’ predictions across scales. 

The messages were assembled in a fixed sequence 
which controlled for possible order effects by spacing 
the messages of each type throughout the series; context 
effects were also controlled by making certain that 
messages of a given type (profraternity messages, for 
example) appeared in a variety of settings. Thus, of 
the six pro messages, two were preceded by anti 
messages, two by neutral messages, and two by pro 
messages; a similar plan was employed for the other 
types of statements. 


Ambiguity 
Messages were made ambiguous by systematically 
deleting every other word of the original statement. The 


? The experimental messages were drawn from a pod! 
of such passages written by undergraduate students at 
the University of Pittsburgh. Some of the passage 
had been used in an earlier study (Manis, 1960). 
messages were adapted so as to appear that they had 
been written by Michigan students. 


ws 


oe 


AMBIGUITY, ATTITUDE, AND INTERPRETATION OF STATEMENTS 77 


following passage illustrates the effect of the ambiguity 
manipulation on one of the profraternity statements: 


I ——- to ----- fraternities ----- because —--- the 
-—-- education ----- can ----- a —--- student. ——- are 
—- many ----- events ----- group —— that ----- mem- 
ber ----- a ----- or ----- must ----- into --— he —- 
probably ----- enter ----- were ----- not ——. I -— 
also -—- since ----- fraternity ---- as ——- group, 
---- gives ----- the ----- of ---— with ----- learning 
----- live ----- the ----- different ----- within ----- 
group. 

---- can ----- help ----- student ----. They —-- 
----- organize ----- groups ----- make - 


—— the ----- who ----- having ----- with —--- grades. 


For half of the subjects, the deletion procedure was 
applied to the first nine messages of the experimental 
series; for the other half, deletion was applied to the 
fast nine messages. This permitted each subject to 
serve as his own control in evaluating the effects of 
ambiguity, while taking account of order effects and 
chance differences between the messages in the first 
and second halves of the test booklets. 


Subjects 
The subjects were 66 male undergraduate students 
at the University of Michigan, who voluntarily par- 


ticipated in the study. The data which are reported 
below were obtained from three selected groups: those 


who were extremely profraternity, those who were 
extremely antifraternity, and those who were relatively 
neutral. There were 10 subjects within each of these 
attitude groups; 5 subjects in each group had received 
the ambiguous (deleted) messages in the first half of 
the experimental series, and 5 had received them in the 
last half. 


RESULTS 


Figure 1 presents the mean predictions of 
the three recipient groups to the pro, neutral, 
and antifraternity messages (predictions that 
are high on the ordinate reflect a belief that 
the communicator disliked fraternities); each 
data point represents the averaged results of 
six messages. The results obtained under 
ambiguous and nonambiguous conditions are 
shown separately. 

Before analyzing the data statistically, 
each subject’s performance was described in 
terms of his average response to each of the 
six situations presented in the experiment 
(two levels of ambiguity X three types of 
messages = six stimulus situations). The 
data obtained from the three types of messages 
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Fic. 1. Mean predictions as a function of recipient attitude and message ambiguity. 
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were then analyzed separately, using Lind- 
quist’s (1953) Type III design. The “between 
effects” were the subjects’ attitudes and the 
order in which the ambiguous and unam- 
biguous messages were presented (ambiguous- 
unambiguous vs. unambiguous-ambiguous) ; 
the “within effect” was the ambiguity variable 
itself. 

The introduction of ambiguity produced a 
displacement of the subjects’ predictions 
toward the midpoint of the scale. For the 
profraternity and the antifraternity messages, 
this effect was significant at the .005 and .001 
levels, respectively; for the neutral messages 
p < 45. Contrary to prediction, ambiguity 
did not lead to increased differences between 
the predictions of the various recipient groups, 
for the curves representing the ambiguous 
and unambiguous messages are essentially 
parallel (see Figure 1); similarly, the attitude 
X ambiguity interactions were all nonsignifi- 
cant. 

In responding to each of the message types, 
the attitudes of the recipients significantly 
affected their predictions (p < .025). To 
evaluate the form of the relationship between 
attitude and message interpretation, the 
analysis of variance for trends was applied 
(Grandage, 1958; Grant, 1956); since, con- 
trary to prediction, the attitude X ambiguity 
interaction was not significant for any of the 
message types, the trend analyses were ap- 
plied without reference to the ambiguity 
variable, i.e., the data obtained under ambig- 
uous and nonambiguous conditions were 
pooled. For the profraternity messages, the 
quadratic (curvilinear) component was signifi- 
cant at the .01 level; for the antifraternity 
messages, the quadratic component yielded a 
p < .06, and the linear component a p < .05. 
For the neutral messages, both the linear and 
the quadratic components were significant 
(p < .01). When the two curves representing 
the neutral messages were analyzed separately, 
for the nonambiguous statements, the linear 
trend was significant at the .001 level, and 
the quadratic trend at the .10 level; for the 
ambiguous statements, both the linear and 


3 The average attitude scores for the three recipient 
groups were used in calculating the orthogonal coeffi- 
cients. 


the quadratic effects were significant at the - 
40 level. 


DISCUSSION 


As noted above, the ambiguous and the | 
nonambiguous curves shown in Figure 1 
were quite similar in shape; if the ambiguity 
variable had functioned as hypothesized, it 
would have enhanced the differences between 
the interpretations of the various attitude 
groups. The introduction of ambiguity did 
not, however, produce this effect; instead, 
the ambiguous messages were consistently 
displaced toward the midpoint of the scale. 
This may mean that the word-deletion pro- 
cedure served to "tone down" the extremity 
of the statements. It is also possible that the 
subjects were somewhat uncertain and rela- 
tively unwiling to commit themselves in 
judging the ambiguous statements. In either 
case, the present data do not support the 
hypothesis that ambiguous statements of 
opinion are more susceptible to attitude- 
related distortions than are monambiguous 
statements. It should be noted that the pre- 
dicted effects could have been superimposed 
upon the observed shift toward the midpoint; 
thus, the occurrence of this shift did not 
eliminate the possibility of obtaining positive 
results. 

These results suggest that it may be neces 
sary to qualify the hypothesis that led to 
the present experiment. For example, it is 
possible that ambiguity does not enhance the 
effects of organismic variables unless the 
decrease in environmental constraint occuts 
in a setting where there are few response 
demands placed upon the individual. In 
the Rorschach test, for example, the subject 
is presented with an ambiguous inkblot and 
asked to indicate anything that it looks like 
or reminds him of. He is, thus, free to respond 
in terms of an almost infinite variety 
response dimensions; his responses may 10 
dicate concern with such diverse areas 48 
dependency, hostility, impulse control, etc; 
and the clinician’s diagnostic impression 1 
largely based on the frequency with which the 
resulting protocol reflects any of these themes: 
In contrast to this “free choice" situation, 
subjects in the present study were “forced” 
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to respond in terms of an evaluative scale, as 
established by the experimenter, and the 
introduction of ambiguity served mainly to 
increase the subjects’ uncertainty as to the 
correct response. Thus, McClelland’s (1958) 
distinction between the subject’s ambiguity 
concerning the response that he is to make, 
and ambiguity in the sense of uncertainty 
in choosing between well-defined responses, 
may be crucial. A related consideration is the 
possibility that ambiguity may amplify the 
effect of organismic variables only if the 
subject is responding primarily in terms of 
fantasy, as in the Rorschach or TAT. In 
these situations, ambiguous stimuli may be 
valuable mainly because they facilitate 
compliance with the examiner’s instructions, 
ie, it would be difficult to obtain fantasy 
productions in response to an unambiguous, 
realistic stimulus. In the present case, on the 
other hand, the subject’s set was for accuracy, 
and it is likely that this approach was main- 
tained under both ambiguous and unam- 
biguous conditions. Finally, it should be noted 
that although the hypothesis which suggested 
this experiment is widely accepted, there 
seem to be few controlled studies in which 
systematic manipulation of stimulus ambiguity 
has been shown to yield the predicted effect. 


Replication Results (Unambiguous Messages) 


The results obtained under nonambiguous 
conditions are in close accord with data re- 
ported previously (see Manis, 1960, Figure 1). 
In both studies, the profraternity and the 
antifraternity statements yielded curvilinear 
relationships between attitude and message 
interpretation; ie. the neutral subjects 
perceived the communicators as occupying 
less extreme positions than did the committed 
subjects! Similarly, the results from both 


* As in the previous study (Manis, 1960), a test was 
performed to see if the results obtained on the profra- 
ternity and antifraternity messages might have been 
attributable to the neutral subjects’ unwillingness to 
use the extremes of the scales, as compared with the 
committed subjects’. The three groups’ ratings of the 
concepts evil and justice were therefore compared; the 
results indicated that the groups did mot differ in their 
willingness to use either the positive or the negative 
extremes. The subjects? willingness to use the extremes 
was also quantified in terms of the extent to which they 


studies indicated that neutral messages were 
interpreted so as to yield a direct linear re- 
lationship between the subjects’ views and 
their predictions.* 

The fact that the two studies were conducted 
in two very different campus settings, one in 
a large city and the other in a campus town, 
suggests that these results may be obtainable 
in a variety of social psychological situations. 
Moreover, the fact that similar results re- 
lating attitude to message interpretation 
were obtained when the messages were “muti- 
lated,” so as to produce ambiguity, indicates 
that these results are probably independent of 
any particular stylistic features that may by 
chance have characterized the original set of 
messages. 

The author has previously discussed the 
relationship between recipient attitude and 
message interpretation in terms of the Hov- 
land, Harvey, and Sherif (1957) theory of 
assimilation and contrast. This theory, which 
relies heavily upon results obtained in psycho- 
physical experiments, asserts that in judging 
opinion statements, the subject may use his 
own views as an anchor, so that when he is 
presented with a message that is in general 
accord with his beliefs, he tends to assimilate 
it toward this standard; if the message favors a 
position that is unacceptable to him, he 
displaces it away from his own stand (contrast 
effect). 

Another interpretation that seems at least 
equally plausible asserts that the distortions 
obtained in this type of experiment mainly 
result from the recipients’ efforts to maintain 
their own beliefs in relatively unchanged 
form, and reduce the communicator’s in- 
fluence potential. This view assumes that 
when the message source is unknown (as it 
was in this study) pressure to change increases, 


deviated from neutrality in responding to the entire set 
of filler concepts; when this measure was used as a 
control variable in an analysis of covariance design, 
the results for the profraternity and the antifraternity 
messages were virtually unchanged. 

5 There was, however, a possibility that the results 
obtained on the neutral messages might have been 
attributable to group differences in the tendency to 
make socially desirable responses; e.g., the profraternity 
subjects might have had a response set to describe 
virtually anything in favorable terms. This hypothesis 
was not supported by an examination of the filler data. 
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COMMUNICATOR- RECIPIENT DISCREPANCY 


Fic. 2. Communicator-recipient discrepancy as a determinant of pressure to change and message distortion, 


when the communicator’s identity is unknown. 


up to a point, as a direct function of the com- 
municator-recipient discrepancy; eventually, 
however, this relationship is reversed, i.e., very 
large discrepancies produce less pressure to 
change than do fairly large discrepancies. 
Thus, when the communicator is anonymous, 
his acceptability may be primarily determined 
by the stand he takes on the experimental 
issue; when his views diverge too widely from 
those of the recipient, he may be regarded as 
biased and unreliable, and may consequently 
be relatively powerless to influence the re- 
cipient’s views (Hovland et al., 1957). 

Figure 2 graphically depicts the hypothe- 
sized relationship between communicator- 
recipient discrepancy and pressure to change, 
and presents some predictions which are 
implicit, if we assume (a) that most messages 
can be interpreted in a variety of ways, and 
(b) that the recipient is motivated to reduce 
the influence potential of incoming messages. 
For example, it can be deduced that with a 
moderate discrepancy between communicator 
and recipient, the message should be assimi- 
lated toward the recipient's own stand; thus, 
Message A is shown as being displaced from its 


veridical position at Ay, to its perceived posi- 
tion at Ay, with a resulting decrease in pressure 


to change. With a fairly large discrepancy ' 


value, influence potential can be most readily 
reduced if the message is displaced to a point 
that is even more discrepant with the T€ 
cipient’s views; accordingly, Message B has 
been displaced from its veridica] position 3 
By, to its perceived position at Bp. Finally, if 
the communicator-recipient discrepancy 95 
sufficiently large, the message should be 
perceived veridically, as shown in the case of 
Message C, since at this discrepancy level, 
veridical interpretations should produce little 
or no pressure to change. 

The present results are in general accord 
with the preceding deductions: 

1. When responding to messages that they 
essentially agreed with, subjects tended 1? 
displace the communicators toward their ow? 
position (This did not, however, produce ? 


ê Statements in this section concerning the displace 
ment of unambiguous messages are based on a C07 
parison of the mean predictions of a given group E 


Subjects and the mean predictions of all subjects com- 


bined. 
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decrease in the perceived discrepancy between 
communicator and recipient for the average 
subject, since in some cases, the communicator 
was displaced to a point beyond the recipient’s 
favored stand.) 

2. When responding to messages that 
deviated moderately from their own stands, 
there was a consistent tendency for subjects 
to displace messages toward the point on 
the scale that they themselves favored; this 
trend was shown by the pro and the anti 
groups in response to the neutral messages, 
and by the neutral group in response to both 
pro and antifraternity messages. 

3. When the subjects responded to state- 
ments that were definitely opposed to their 
own stands (ie, pro’s interpreting anti 
messages and anti’s interpreting pro messages), 
no consistent displacement trends were ob- 
tained, perhaps because veridical interpreta- 
tions of these messages produced little pressure 
to change. 


SUMMARY 


Three groups of college students, who varied 
in their attitude toward college fraternities, 
were presented with 18 written messages, 
designed to explain the writers’ views on this 
issue. There were three types of writers: 
those who favored fraternities, those who 
opposed them, and those who were relatively 
neutral. Each message was presented to half 
of the subjects in its original, unambiguous 
form, and to the other half, in an ambiguous 
form, which was prepared by removing every 
other word. After reading each message, 
recipients estimated the writer’s attitude 
toward fraternities, by predicting how he 
(the writer of the message) would have de- 
scribed fraternities on a series of evaluative 
rating scales. 

The data revealed that there was a 
curvilinear relationship between attitude and 
judgment for both the profraternity and the 
antifraternity messages; in both cases, com- 
mitted subjects perceived the communicators 
as occupying more extreme positions than 
did neutral subjects. The introduction of 


ambiguity did not affect the shapes of these 
curves, although it did lead to a significant 
displacement toward the neutral point. 

The neutral messages yielded an essentially 
linear relationship between attitude and 
judgment; subjects who favored fraternities 
felt that the neutral communicators were 
more profraternity than did subjects who 
opposed them. Again, the introduction of 
ambiguity did not significantly affect the 
shape of the curve, although subjects showed 
some tendency to displace the ambiguous 
messages toward the midpoint of the scale. 

The results obtained with the complete, 
unambiguous messages, are in accord with 
previous research. These findings, relating 
attitude to message interpretation, were 
discussed in terms of the possibility that the 
distortions which were obtained, resulted 
from the recipients’ attempts to minimize the 
influence potential of the messages. 

The ambiguous messages were consistently 
displaced toward the midpoint of the scale; 
contrary to hypothesis, however, ambiguity 
did not lead to increased differences between 
the predictions of the various recipient groups. 
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THE INTERPRETATION OF OPINION STATEMENTS AS A 
FUNCTION OF RECIPIENT ATTITUDE AND 
SOURCE PRESTIGE 


MELVIN MANIS! 
University of Michigan and Velerans Administration Hospital, Ann Arbor, Michigan 


N attempting to account for the repeated 
finding that attitudes often influence 
the interpretations placed upon state- 

ments of opinion, it has been suggested that 
the obtained distortions may result from the 
recipient’s attempts to maintain his own 
views in relatively unchanged form (Kelley, 
1957; Manis, 1961). Thus, given the fact 
that most messages can be plausibly in- 
terpreted in a number of ways, the recipient 
may be expected to select that interpretation 
which is least likely to affect his beliefs. 
On this assumption, there should be virtually 
no distortion in situations where a veridical 
interpretation of the communicator's message 
generates little pressure to change. 

In the present study, this hypothesis was 
tested by presenting a series of opinion state- 
ments in two contexts: in the first experi- 
mental condition, the messages appeared in a 
context which suggested that the various 
authors were people of low prestige; in the 
Second condition, the messages appeared to 
come from high prestige authors. In the low 
prestige condition, the messages were expected 
to have little influence potential and the 
subjects, therefore, were expected to show rela- 
tively little distortion. In the high prestige 
condition, where these same messages would 
doubtless generate considerable pressure to 
change, the recipients! views were expected to 
influence their interpretations to a significant 
degree. 

METHOD 

The present experiment followed the methods 
employed in other studies in this series (Manis, 1960, 
1961). In the first part of the experiment, attitudes 


1 All statements are those of the author and do not 
necessarily represent the opinion or policy of the 
Veterans Administration. 
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were assessed by having the subjects rate the concept 
college fraternities on a series of seven-step evaluative 
scales; the following scales were used: kind-cruel, 
valuable-worthless, honest-dishonest, fair-unfair, nice- 
awful, and good-bad. Attitude scores were derived by 
assigning a value of 1 to the “good” end, and 7 to the 
“bad” end of each scale, and summing across scales, 
Following the attitude assessment, subjects were 
presented with a series of 18 messages, each expressing 
the views of an individual student on the fraternity 
issue (there were 6 profraternity, 6 antifraternity, and 
6 relatively neutral messages); the subject’s task was 
to indicate how he thought the author of each of the 
statements had described fraternities on the above 
mentioned evaluative scales. Each author’s perceived 
stand on the attitude continuum was then assessed by 
summing the subjects’ predictions across scales (as 
above). 

The messages were assembled in a fixed sequence 
that controlled for possible order effects; context effects 


were also controlled by making certain that messages | 


of a given type (profraternity messages, for example) 
appeared in a variety of settings. Thus, of the six “pro” 
messages, two were preceded by anti messages, two by 
neutral messages, and two by pro messages; a similar 
plan was employed for the other types of statements, 

In every test booklet, half the messages representing 
each communicator position (pro, neutral, or anti) 
were preceded by one of three brief statements, depict- 
ing the author as a high prestige student; the other 
messages were accompanied by statements depicting 
the author as a low prestige student. Half the subjects 
received Booklet A, in which messages 1, 3, 5, 8, 10, 
12, 13, 15, 17 appeared within a high prestige setting, 
and the other messages in a low prestige setting; the 
remaining subjects received Booklet B, in which this 
pattern was reversed, i.e. the messages listed above 
were presented in a low prestige setting, and the others 
in a high prestige setting. Table 1 presents the six 
statements that were used to manipulate the prestige 
of the sources; each of these statements was paired with 
three messages—one profraternity, one neutral, and 
one antifraternity. 


Subjects 


The subjects were 65 male undergraduate students - 


at the University of Michigan, who voluntarily pat 
ticipated in the study. The present report concerns 
the data from three selected groups: subjects who were 
extremely profraternity, subjects who were relatively 
neutral, and subjects who were extremely antifraternity. 
There were 10 subjects within each of these attitude 
groups; 5 of the subjects within each group received 
Booklet A and 5 received Booklet B (see above). 


B 
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RESULTS 


Figure 1 presents the mean ratings of the 
three recipient groups in response to the 
three types of messages (pro, neutral, and 
antifraternity). Each data point represents the 
averaged results of six messages. Ratings 
that are high on the ordinate reflect a judg- 
ment that the author disliked fraternities. 
The results obtained under high and low 
prestige conditions are shown separately. 

Before analyzing the data statistically, 
each subject's performance was “summarized” 
by describing it in terms of his average re- 
sponse to each of the six situations presented 
in the experiment (two levels of prestige X 
three types of messages = six situations). 
The data obtained under high and low prestige 
conditions were then analyzed separately, 
using Lindquist’s (1953) Type III design. 

For the high prestige data, there was a 
Significant difference (p < .005) between the 
responses of the three attitude groups. In 
responding to each of the three types of 
messages, the pro's viewed the communi- 
cators as being more favorable toward fra- 
ternities than did the anti's, while the neutral 
Subjects occupied an intermediate position 
(see Figure 1); application of the analysis of 
variance for trends (Grandage, 1958; Grant, 
1956) indicated that over all messages, there 
was a linear relationship between the subjects’ 
own attitudes, and their judgments of others’ 
attitudes, which was significant past the .001 
level? Since the interaction of recipient atti- 
tude and message content did not approach 
Significance (F < 1.00), this linear effect can 
be taken as essentially constant, regardless 
of the message input. 

Under low prestige conditions, the re- 
Sponses of the various recipient groups were 
not significantly different from one another; 
all of the interactions involving recipient 
attitude were also nonsignificant (all Fs « 
1.00), indicating that under low prestige con- 
ditions, regardless of the message input, the 
different recipient groups were in essential 
agreement as to the communicators’ views. 

As a more direct means of comparing the 
results obtained under the high and low 


2 The average attitude scores for the three recipient 
groups were used in calculating the orthogonal co- 
efficients, 


TABLE 1 


INTRODUCTORY STATEMENTS WHICH PRECEDED 
EACH OF THE AL MESSAGES 


High Prestige Sources Low Prestige Sources 


a1. The author of this | 1. Theauthor of this pas- 


is an honors sage was eventually 
student who has shown asked to leave Michi- 
outstanding promise gan because of low 
since arriving at Michi-| grades and repeated 


gan. misconduct. 
2. The author of this | 2. The author of this pas- 
ge is basically sage is basically un- 

stable and mature. stable and neurotic. 
3. The author of this | 3. The author of this pas- 
passage is extremely sage is rather unpopu- 


well-liked and re- ular and is generally 
spected by his fellow considered to be unre- 
students. liable. 


® Messages that were preceded by Statement 1 of the high 
prestige group were paired, in the low prestige condition, with 
Statement 1 of the low prestige set; a similar scheme was followed 
for Statements 2 and 3. 


prestige conditions, a difference score was 
constructed for each subject, by subtracting 
the responses he produced under high prestige 
conditions from the corresponding responses 
which he produced under low prestige con- 
ditions; e.g., subject's average response to 
the profraternity statements, from high pres- 
tige sources, was subtracted from his average 
response to the profraternity statements from 
low prestige sources. The resulting difference 
scores were all raised to a positive value by 
adding a constant, and were then subjected 
to an analysis of variance, again using Lind- 
quist’s (1953) Type III design. 

Table 2 summarizes the results of this 
analysis; as may be seen, over all messages, 
the magnitude of the difference scores tended 
to vary from one attitude group to the next 
(p < .07). Application of the analysis of 
variance for trends (Grandage, 1958; Grant, 
1953) indicated that this main effect was 
primarily attributable to a linear relationship 
(p < .05) between the subjects’ attitudes and 
the difference score. In general, regardless of 
the message content, subjects who favored 
fraternities tended to assume that the high 
prestige communicators would be more pro- 
fraternity than would those of low prestige; 
subjects who opposed fraternities showed some 
tendency to reverse this assumption (they 
seemed to feel that if there was a difference, 
the low prestige communicators were more 
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RECIPIENT ATTITUDE 


Fic. 1. Mean predictions as a function of recipient attitude and prestige of the author. 


profraternity than the “highs”), while the 
neutral subjects typically fell somewhere 
between these two extremes. 

The linear effect just described indicates 
that the regression lines relating attitude to 
message interpretation varied as a function 
of the communicator’s prestige; as may be 
seen in Figure 1, regardless of the communi- 
cator’s views, under the high prestige con- 
dition, the regression line had a steeper slope 
than it did under the low prestige condition, 
where the slope was not significantly different 
from zero. 

Apart from the results attributable to 
recipient attitude, the significant Fs which 
are reported in Table 2 (Booklet A vs. Booklet 
B, and the interaction of Test Booklet X 
Message Content) are relatively unimportant, 
since they reflect differences mainly due to 
the experimenter’s inability to assemble 
alternate test booklets that were truly equiva- 
lent. 


DISCUSSION 


The overall results of this study can be 
summarized succinctly: When the exper » 
mental messages were presented as coming 
from high prestige sources, subjects tended 
to displace the communicators toward the 
end of the scale that they themselves favored; 
when these same messages were presented as 
coming from low prestige sources, there was 
no consistent relationship between the sub- 
jects’ attitudes and their interpretations of 
the messages. [ 

These findings support the hypothesi 
that message distortion often results from the 
recipient’s attempts to reduce the communi- 
cator's influence potential; i.e, it may 
reasonably be assumed that given any ap- 
preciable communicator-recipient discrepancy; 
veridical interpretations would have resulted 
in more pressure to change in the high prestig? 
condition than in the low prestige condition 
In the high prestige settings, there might, 
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TABLE 2 
ANALYSIS OF VARIANCE FOR 
DIFFERENCE SCORES 
(Difference Score = “Low Prestige” Minus 
“High Prestige” Responses) 


Source SS.) lap | SMS SEA 
Between subjects 3,702.23 | 29 
Recipient attitude 530.87 | 2| 265.44 | 3.20 | «.07 
Test booklet (A vs. | 1,144.90 1| 1,144.90 |13.79 | <.005 
B)* 
Attitude X Test 33.26| 2| 16.63 
booklet 
Error (b) 1,993.20 | 24 | 83.05 
Within subjects 5,994.67 | 60 
Message content 237,02 | 2| 118.54 | 1.28] ns 
Attitude X Message 194.46 4 48.62 
content 
Test booklet X Mes-| 760.26 | 2| 380.13 | 4.10 | <.025 
sage content 
Test booklet X 357.28 | 4| 89.32 
Attitude X Mes- 
sage content 
Error (w) 4,445.60 | 48 | 92.62 
Total 9,606.70 | 89 


? The two booklets differed with respect to the specific mes- 
sages that were depicted as coming from high and low prestige 
sources. 


thus, be considerable motivation to distort, 
while there was little to be “gained” by dis- 
tortion in the low prestige settings. 

The fact that the attitude variable did not 
significantly affect the subjects’ judgments 
under low prestige conditions seems somewhat 
at variance with the views of Newcomb (1953), 
and Osgood, Suci, and Tannenbaum (1957). 
These theorists suggest that a more “balanced 
state” results if negatively valued sources 
are perceived as disagreeing with one’s self; 
theoretically then, the recipient should be 
under pressure to either shift his views away 
from those advocated in the message, or to 
displace the message away from his preferred 
stand. It may be, of course, that this predicted 
contrast effect was not obtained because the 
negative sources were not described in suffi- 
ciently unfavorable terms; on the other hand, 
previous research by Hovland and Weiss 
(1951) and Kelman and Hovland (1953) is 
consistent with the present finding, in that 
despite their use of negatively valued sources, 
these investigators did mot find any evidence 
for a “boomerang” effect. Taken together, 
these studies suggest that in many instances, 


opinion statements from low prestige sources 
do not motivate the recipient to magnify the 
discrepancy between his own stand and the 
position that is being advocated. 

It is interesting to contrast the results ob- 
tained under high prestige conditions with 
results obtained when the recipients are 
given no information concerning the message 
source. Previous studies have shown that when 
the communicator is anonymous and his 
views are substantially discrepant from those 
of the recipient (if, for example, the com- 
municator favors fraternities and the re- 
cipient opposes them), the recipient tends 
either to distort the message so as to magnify 
this difference of opinion (Manis, 1960), or to 
interpret the message veridically (Manis, 
1961). In this experiment, on the other hand, 
in the high prestige condition, “opposing” 
messages were typically assimilated toward 
the recipient’s stand. 

How can these varied results be reconciled? 
The available data suggest that when the 
message source is someone whom the recipient 
respects, influence potential increases mono- 
tonically as a function of the communicator- 
recipient discrepancy (Harvey, Kelley, & 
Shapiro, 1957; Hovland & Pritzker, 1957; 
Zimbardo, 1960). Thus, when the source is 
prestigeful, and the communicator and his 
recipient are in substantial disagreement, 
assimilation effects are to be expected, since 
pressure to change would be high, and assimi- 
lation would effectively reduce the communi- 
cator’s influence potential (displacement away 
from the recipient’s position would only lead 
to increased pressure to change). When the 
message source is unknown, however, data 
from attitude change experiments suggest 
that beyond a given point, increases in the 
communicator-recipient discrepancy are ac- 
companied by decreased pressure to change 
(Hovland, Harvey, & Sherif, 1957). Thus, if 
the message source is unknown, and the 
communicator-recipient discrepancy is quite 
large, influence potential can be effectively 
reduced if the statement is displaced to a 
point that is even more discrepant with the 
recipient’s stand (contrast effect), since the 
recipient is likely to regard an anonymous 
communicator with whom he is in marked 
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disagreement, as a somewhat biased and 
unreliable person. When the communicator is 
unknown and his statement is almost totally 
discrepant with the views of the recipient, 
there should be virtually no distortion, since 
veridical interpretation of such a statement 
would produce little pressure to change. 


SUMMARY 


| Three groups of college students, who 
varied in their attitudes toward college 
fraternities, were presented with a series of 
short passages concerning the fraternity issue. 
Half the passages were presented as coming 
from high prestige sources and half from low 
prestige sources. After reading each passage, 
subjects attempted to describe fraternities as 
they had been described by the writer, using 
six evaluative rating scales. 

Under high prestige conditions, regardless 
of the message content, there was a direct 
linear relationship between the subjects’ own 
views and their judgments concerning the 
views of the authors; i.e., those who favored 
fraternities generally felt that the communi- 
cators were more profraternity than did those 
who opposed them, while the neutrals oc- 
cupied an intermediate position. Under low 
prestige conditions, there was essentially no 
relationship between the subjects’ views and 
their judgments. 

These results were interpreted as sup- 
porting the hypothesis that distortion often 
results from the recipient’s attempts to reduce 
the influence potential of incoming messages; 
in the low prestige condition, where veridical 
interpretations would probably generate little 
pressure to change, there was scant motiva- 
tion to distort. 


Metvin Manis 
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INTERRELATIONSHIPS AMONG PERSONALITY VARIABLES, KINES- 
THETIC FIGURAL AFTEREFFECT, AND REMINISCENCE IN 
MOTOR LEARNING' 

MANFRED J. MEIER? 
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ECENT investigations of the interactions 
among personality, learning, and 
perceptual variables (Eysenck, 1955; 

Gibson, 1933; Klein & Krech, 1952; Kohler & 
Dinnerstein, 1947; Köhler & Wallach, 1944; 
Livson & Krech, 1955; Wertheimer & Jack- 
son, 1957; Wertheimer, Levine, & Wertheimer, 
1955) have pointed up the possibility of 
generating and testing hypotheses relating 
behavioral variables hitherto considered in- 
dependent and theoretically disparate. 
Eysenck (1955) extended the Hullian concept 
of reactive inhibition (Iz) to individual differ- 
ences by postulating more rapid generation of 
Ip in hysterics (conversion reactions and 
psychopaths) than in dysthymics (reactive 
depressions, anxiety reactions, and obses- 
sionals). By assuming that the rate of de- 
velopment of kinesthetic figural aftereffects 
(KAE) is a function of generation of Jp and 
personality differences in introversion-extro- 
version, he predicted and found more rapid 
development of KAE in hysterics than in 
dysthymics. 

To account for these data, he postulated a 
cortical correlate of Ig (cortical inhibition) 
such that the generation of Ie corresponds to 
an analogous cortical process which is measur- 
able as the size and persistence of KAE. 

Mein and Krech (1952) had earlier elicited 
differences between neurologically normal and 
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ministration Centers at Tomah and Wood, Wisconsin, 
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Course of the investigation. 
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cortically damaged subjects. Such a finding is 
consistent with Eysenck's (1955) formulations. 
They introduced the "cortical conductivity" 
hypothesis, couched in terms of individual 
differences in the damping and distortion of 
electrical excitation patterns across the 
cerebral cortex upon sensory stimulation. 

Since neither of these studies had simul- 
taneously considered learning, perceptual, and 
personality variables, the necessary empirical 
foundation for extending Iz to individual 
differences and for postulating corresponding 
cortical processes appeared to be incomplete. 
The purpose of this study was to construct 
and test empirically some hypotheses relating 
individual differences in motor learning to 
some perceptual and personality variables. 
More specifically, this investigation sets out 
to examine the interrelationships among 
personality factors as measured by the MMPI 
and psychiatric diagnosis, the magnitude and 
persistence of KAE, and variation in the 
amount of reminiscence in inverted alphabet 
printing. These variables were selected for 
their relevance to converging lines of earlier 
research and theory. 

Five hypotheses were tested: 

Hypothesis I. Individual variation in the 
magnitude of KAE varies negatively with the 
amount of reminiscence in inverted alphabet 
printing. If persons in whom Jp generates 
rapidly and dissipates slowly developed larger 
KAE in Eysenck’s (1955) study, then such 
persons should exhibit low reminiscence. This 
follows from the assumption that rapid dis- 
sipation of Ip relates to high reminiscence; 
conversely, slow dissipation of Ip relates to 
low reminiscence. 

Hypothesis IT. Similarly, individual variation 
in the persistence of KAE varies negatively 
with the amount of reminiscence. 


Hypothesis III. Some personality variable 
or variables, as yet empirically unspecified, 


are related to reminiscence and KAE. 


Hypothesis IV. The size and persistence of 
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KAE vary positively with age and negatively 
with size of vocabulary. 

Hypothesis V. Reminiscence varies nega- 
tively with age and positively with size of 
vocabulary. 


METHOD 


The design required large sampling procedures, 
correlational techniques, and an item-analytic method. 
A wide range of individual differences in the chosen 
variables was sought. 

Subjects. In order to obtain a wide range of variation 
for the required behavioral measures, the sample was 
procured from a psychiatrically mixed and normal 
population at two Veterans Administration Centers in 
Wisconsin. Acceptance of subjects was on a random 
basis from a population meeting the following criteria: 
received no drugs for 2 weeks prior to the investigation, 
had not been treated with EST for 3 months, had no 
known organic pathology of the central nervous system, 
45 years of age or less, right-handed by preference, 
male sex, and Caucasian racial origin by record. One 
hundred twenty-eight male veterans, 64 from each 
center, met these requirements and served as subjects. 

Procedure and measures. Upon arrival for the first 
of two testing sessions, each subject was offered coffee, 
and the situation was structured as research. During 
the first session, groups of four to eight subjects were 
run between 8:30 and 11:45 a.m. The following tests 
were given in a constant order: 

1. Inverted Alphabet Printing. This learning task 
was identical to that used by Kientzle (1946), 
Kimble (1949), and Archer (1954) with minor modi- 

fication. All subjects received 16 30-second trials 
under massed practice, followed by a 5-minute rest 
interval and four postrest 30-second massed trials. 
Group conversation on topics other than the study 
was deliberately encouraged during the rest. The 
difference between the average number of inverted 
letters printed on the last four prerest trials and the 
four postrest trials constituted the measure of the 
amount of reminiscence. 

2. Shipley-Hartford scale. A maximum of 20 
minutes was allowed to complete the Vocabulary 
and Abstraction sections of the Shipley-Hartford. 
The raw vocabulary score was utilized for the study 
as a control variable and as an estimate of intellectual 
capacity. 

3. Minnesota Multiphasic Personality Inventory 
(MMPI). The group form booklet and IBM answer 
sheets were used. Instructions were read simul- 
taneously by the experimenter and subject, The 
“cannot say” response alternative was eliminated. 
Item responses and profiles were retained for analysis. 
Appointments were made with each subject for 

some one-half hour on an afternoon within one week of 
the first testing session. During the second sessions, 
two sets of observations were obtained. 

1. Identifying data. These included birthdate, 
educational level, occupation, and major medical 
complaints. Corroboration of report was gained later 
from the clinical records, 


2. KAE. The procedure outlined by Klein anj a. 
Krech (1952) was used. Three hardwood bars were 
mounted on stable bases and located on wooden 
stands 16 inches in height and 18 inches square. The 
distance between stands was 16 inches so that the 
subjects’s chair could be placed between them. 
The comparison scale for KAE was located on the 

subject’s left and varied continuously from 1 cent. 
meter to 10 centimeters in width. The test and stimula- 
tion objects were situated on the right stand. The test 
object was 1.5 inches wide, and the stimulation object 
was 2.5 inches wide. The blindfolded subject was 
seated in a straight-backed chair with a seat 16 inches 
square and 17 inches from the floor. Instructions were | 
standardized to yield four baseline PSEs by having the 
subject set the comparison with thumb and forefinger | 
of the left hand to the experienced width of the test | 
object on the right. Thirty, 60, and 90-second stimula, 
tion periods on the stimulation object were followed 
by four comparison settings for the test object. Four 
identical settings were made after a 5-minute rest 
period and again after a 15-minute period following ~. 
the final 90-second stimulation period. | 
Two measures of KAE were developed: 
1. Magnitude of prerest KAE, With the average PSE | 
prior to stimulation as a baseline, the decrement in the 
perceived width of the test object was computed foreach 
stimulation period. These measured decrements were 
then transformed into percentage scores expressed rela- 
tive to the baseline before stimulation. An overall 
measure of the magnitude of the KAE was computed by 
summing the percentage scores or each stimulation pe 
riod. The magnitude of KAE was assumed to be a lineat 
function of the amount of tactile stimulation. There 
fore, the percentage decrement in the perceived width 
of the test object following 90-second stimulation was 
divided by three, that following 60-second stimulation — 
by two, and that following 30-second stimulation by 
one. The final measure of magnitude of KAE was the 
algebraic sum of these measures. F 
2. Persistence of KAE following final stimulation 
period. Percentage decrement of the perceived widi 
of the test object was computed against the same ) 
baseline measure for the average PSE obtained by 5 
and 15-minute intervals. The over-all persistence © 
KAE was measured by the mean of the average pet 
centage decrements for both poststimulation settings 


RESULTS AND Discussion 


Since various analyses were necessary p 
test the stated hypotheses, each hypothesis I5 
treated separately here. A number of factors 
prevented the accumulation of all described 
measures for all subjects. Occasionally, ? 
subject failed to arrive for Session II or was 
started on a course of intensive physiological 
treatment between the two testing periods. 
These and other influences resulted in differing 
numbers of individual measures for the 85 
sorted variables. 

Hypothesis I. Individual variation in the 


KINESTHETIC FIGURAL AFTEREFFECT AND REMINISCENCE 89 


TABLE 1 
DISTRIBUTION OF KAE AND REMINISCENCE 
(Rem) Scores 


Magnitude of Persistence of 
KAE KAE 


Measure Rem 

N 100 100 100 
Mdn 4.48 4.92 7.35 
Mn 4.53 5.01 8.14 
Sigma 2.07 3.12 5.84 


magnitude of KAE varies negatively with the 
amount of reminiscence in motor learning. 
The over-all prerest magnitude of KAE was 
correlated with the amount of reminiscence 
(Rem) in inverted alphabet printing. Distribu- 
tions and scattergrams revealed no severe 
departures from the assumptions underlying 
_ application of the Pearson r procedure. Table 1 
resents distribution statistics for these 
is The correlations bearing on this 
and other hypotheses are found in Table 2. 
\The obtained r of —.474 (df = 98; p < .001) 
with age and vocabulary partialed out reached 
an acceptable level of statistical significance 
and supports the prediction. Credence is 
added to Eysenck’s (1955) contention that 
Ir has general significance for individual 
differences in perception and learning. A 
logical inference is that this interrelationship 
may be determined by some other variable. 
Klein and Krech’s (1952) observation that 
brain damaged subjects exhibited larger 
KAE implies a neural process as a possible 
explanation of the relationship between Rem 
and KAE. Measurable manifestations of such a 
neural process might include the rate at which 
the alpha blocking response habituates to 
repeated visual stimulation, conditionability 
of the alpha blocking response to a neutral 
stimulus, and the degree of electroenceph- 
alographic abnormality. Investigation of the 
telationships between these neural variables 
and such behavioral measures as Rem and 
KAE may yield a stronger empirical founda- 
tion for generating more testable and in- 
Cisive neuropsychological constructs. 
Hypothesis II. Individual variation in the 
persistence of KAE varies negatively with 
the amount of Rem in inverted alphabet 
printing. Rem and the over-all poststimulation 
KAE yielded an r of —.375 (df = 98; p < -01) 
with age and vocabulary partialed out. Con- 


firmation of the prediction at an acceptable 
level of significance adds further credibility to 
the general hypothesis of a cortical basis for 
the generation and dissipation of Ip. 

Hypothesis III. Some personality variable, 
as yet unspecified, relates to Rem and KAE. 
Earlier, Eysenck (1955) regarded the rate of 
arousal, size, and persistence of KAE as the 
measurable equivalents of the rate of genera- 
tion, magnitude, and the degree of dissipation 
of Ir. Since operations designed to generate 
and dissipate Ip have emerged in experiments 
involving the evocation of motor responses, it 
was decided to focus upon Rem in motor 
learning as an external criterion to which the 
MMPI data and psychiatric classifications 
might relate. 

This search for personality correlates of the 
dissipation of Iz as measured by the amount 
of Rem in the inverted alphabet printing 
task included the development of two MMPI 
scales that empirically differentiate indi- 
viduals along the Rem criterion, the analysis 
of MMPI configurations for their discrimi- 
nability of high and low Rem scores, and the 
analysis of variation in Rem and KAE as a 
function of psychiatric classification. 

MMPI responses and Rem scores were 
available for a total of 128 subjects. Eight 
MMPI records were eliminated owing to 
evidence of obvious distortions of response 


TABLE 2 
Summary OF CORRELATIONS 


Variable df r ? 


Rem vs. Magnitude of KAE 98 


E «€. 

Rem vs. Persistence of KAE 98 | —.375*|«.01 

R scale A vs. Rem B. 58 | +.279 |<.05 

R scale B vs. Rem A 58 | 4-.251 |«.05 

R scale A vs. Magnitude of 93 | —.133 |» .05 
KAE 

R scale A vs. Persistence of 93 | —.088 |>.05 
KAE 

R scale B vs. Magnitude of 93 | —.122 |».05 
KAE 

R scale B vs. Persistence of 93 | —.086 |>.05 
KAE 

Magnitude of KAE vs. Age 98 | +.080 |» .05 

Magnitude of KAE vs. Vo- 98 | —.260|<.05 
cabular, 

Baert of KAE vs. Age | 98 +.168 |>.05 

Persistence of KAE vs. Vo- 98 | —.181 |>.05 
cabulary 

Rem vs. Age 125 | —.223 |«.05 

Rem vs. Vocabulary 125 | +.282 |«.05 


a Effects of age and vocabulary partialed out. 


TABLE 3 
DISTRIBUTION OF AGE, VOCABULARY, AND REM 
RES FOR GROUPS A AND B 


Group A Group B 
Measure —— — 
M SD M SD 
Age 34.15 6.65 34.18 6.15 
Vocabulary 28.08 5.89 28.33 5.65 
Rem 4.52 2.49 4.48 2.29 


(L scale T score of 75 or higher; F scale T 
score of 80 or higher; K scale T score of 75 or 
higher). The remaining 120 subjects were 
divided into two groups of 60, each equally 
represented by subjects from both hospital 
samples. It was found that one of the groups 
was slightly older, so a few subjects were 
interchanged to balance age. Table 3 presents 
the means and standard deviations for age, 
vocabulary, and Rem scores for the resulting 
Groups A and B. These groups were essentially 
equated for these variables and for sex and 
race by sampling. 

The Rem criterion was dichotomized at the 
median for each group into high and low 
Rem scores. For Group A, the median was 4.48 
with a range from 0.00 to 13.25 Rem score 
units. The Group B median was 4.43 with 
scores ranging from 0.25 to 11.25. 

A separate analysis of the MMPI item 
response frequencies against the dichotomized 
Rem score criterion was run independently 
for each group. This made possible the de- 
velopment of two scales and permitted the 
cross-validation of each scale on the Rem 
scores of the alternate group. Item response 
frequencies for high and low Rem subjects 
in each group were transformed into propor- 
tions. Each item was analyzed for the relation- 
ship of its response-split to the Rem categories. 
Items which differentiated high from low 
Rem scores at the .11 level of significance 
were retained.? 

Table 4 contains distribution data and 
odd-even reliability coefficients for each 
scale, uncorrected and corrected by applying 
the Spearman-Brown formula (12). The 
resultant 7’s (r = .94 for R scale A and .90 


* The items comprised by the two scales and other 
descriptive aspects of this study may be obtained from 
the author upon request or from University Microfilm, 
Ann Arbor, Michigan, for $1.50. 
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for R scale B) are typical of those reported i 
for most personality scales. R scale A con. 
sisted of 76 items; R scale B included 50 
items. Comparison revealed that only 9 items 
were common to both scales. Of these items, 
only 6 were scored in the same direction. To 
determine whether or not the scales were 
correlated, a Pearson r was computed for 
Scales A and B over the entire sample. The 
obtained r = .539 for 118 df confirmed a 
positive relationship despite low item ovalni 

A rough indication of commonality of item 
content between the scales was obtained by 
having three judges, all staff psychologists at | 
the Wood center, check each item on a list of * 
adjectival phrases referring to a wide variety 
of behaviors and self-attitudes. For both. 
scales, the most frequently checked phrases . 
for those item responses made by individuals 
with low Rem scores were lack of self- 
confidence, anxiety, withdrawal from inter- 
personal relationships, schizophrenic ideation, 
lack of motivation, and constriction of affec- 
tive expression. 

Data on the discrimination of high and low 
Rem scorers by both scales appear in Table 5. 
R scale A differentiates between the criterion 
categories at the .001 level (chi square — 21.62, 

1 df) for Group A. Cross-validation of R scale 
A on Group B reveals a chi square of 427 | 
for 1 df, significant at the .05 level. Similarly, | 
R scale B differentiates at the .001 level for q 
Group B (chi square — 22.27, 1 df) and, 
upon cross-validation on Group A, separates 
along the criterion at the .05 level (chi square 
= 4.34, 1 df). Cutting scores for each scale 
were determined by inspection before cross- 
validation was carried out. Once set, they were 
applied consistently. 

The predictive efficiency of the scales on 
cross-validation is supportive of the hy- 
pothesis that some personality variable may 
account for some of the variance in remi- 


" EE. 


TABLE 4 
R SCALE SCORE DISTRIBUTIONS AND 
RELIABILITY COEFFICIENTS 


[: ted. 
Rscale| N Range M SD Ed bart A 
r Even? x 
— |o 
A | 60 | 14-65 | 43.02 | 13.33 | .87 194 
B | 60 | 13-40 | 28.62 | 7.34 | .82 .90 
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^ TABLE 5 
DIFFERENTIATION OF REM CATEGORIES ON THE Basis or MMPI R Scares 
rea | Gating sae Ae o Meo E] FAN 
High Rem Low Rem é High Rem Low Rem a y 

<44—Low 5 24 11 19 

A 21.62 «.001 4.27 <.05 
z44—High 25 6 19 11 
<27—Low 9 21 3:127 

B 434 <.05 22.27 <.001 
>27—High Fie 1B 22 


niscence. In view of the negative relationship 
between amount of Rem and both KAE 
- measures, a negative relationship might be 
anticipated for the R scales and KAE. The ob- 
tained r's are found in Table 2. They all run 
in the predicted direction but do not reach 
the .05 level. Thus, the personality variables 
measured by the R scales do not appear to 
account for variation in KAE. Also, the 
negative correlation between Rem and KAE 
‘is not clearly attributable to personality 
| variables by these data. 
To study MMPI profile patterns for their 
elation to the Rem criterion, the MMPI scale 
res of each subject were coded (Welsh, 
1948). The codes of each group were treated 
4 independently, so that the efficiency of code- 
based predictions could be determined by 
cross-validation on the Rem scores of the 
. alternate groups. 
l By examining a given code for its corre- 
- spondence with a high or low Rem classifi- 
- cation along the criterion, it was possible to 
devise a set of code rules which imi 
— Correct predictions from code to criterion for 
each group. When a code type occurred more 
_ frequently among subjects in one of the 
_ Rem categories selectively, the major char- 
acteristics of the code were translated into a 
Prediction rule. Such judgmental determina- 
tions were continued until the predictive yields 
_ of the codes seemed exhausted. 
— The resultant criterion separation rules for 
1 Groups A and B are found in Table 6. The 
merid data for both sets of code rules 
appear in Table 7. 
The A code rules differentiate between the 
Criterion categories at the .001 level of sig- 
. Rificance (chi square = 11.38, 1 df) for Group 
A. Cross-validation of the A rules by applying 


them to the Group B criterion measures 
resulted in a number of correct predictions 
beyond chance expectation (chi square — 
4.11, 1 df; p < 05). Similarly, the B code 
rules differentiate at the .01 level for Group B 
(chi square = 6.67, 1 df) and, upon cross- 
validation on Group A, separate along the 
criterion at the .01 level (chi square — 6.37, 
1 df). Thus, both sets of code rules hold up 
on cross-validation and maintain levels of 
predictive accuracy comparable to the scales 
with an identical Rem criterion. 

The code rule sets appear to be highly 
consistent and provide a. basis for generating 
some tentative inferences with regard to 
possible personality correlates of reminiscence. 
Examination of the High Rem rules reveals a 
preponderance of codes frequently seen in 
the conversion reactions and character dis- 
orders (3’, 31’, 94’, 49^). Similarly, schizo- 
phrenic code types occur frequently among 
the Low Rem subjects (8 high point codes). 
Between these code groupings, the Sc scale 
score appears to interact with the primary 
elevations in some of the *nonpsychotic" codes 
so that the code type yields a different pre- 
diction, depending on the rank and relative 
elevation. of the Sc score. Similarly, such 
“suppressor effects” seem to occur in 4' codes 
with a primary elevation on the Pd scale 
score when account is taken of the location 
of the Mf scale score in the profile. The sample 
size is insufficient to warrant specific conclu- 
sions about multivariate interactions among 
the MMPI scales in relation to the Rem cri- 
terion. However, by introducing such possible 
interactions into the criterion separations, 
predictive accuracy is increased and main- 
tained on cross-validation. : 

Eysenck (1955) found that “hysterics” 
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TABLE 6 
PREDICTION OF REM CATEGORIES FROM MMPI Copk RULES 
Group A Group B 
Low Rem High Rem Low Rem High Rem 
8 hi; int 9’ high point 8 high point 3' high point 
L4 ipn or 4/9 24' or $4 code unless 8 | 8ranked 4th or higher and} 31’ or 13’ code 
64 or 46 code if either primed T = 60 unless 9’, 3’, 13' | 1234 code combinations 


8 ranked 4th or higher with 
T = 60, unless 13’, 3’, or 
9' high point 

4’ high point unless 5 ranked 
3rd or higher 


ranks 3rd or higher or 
T > 60 

1234, 1237 code combina- 
tions 

13 and 31 codes 

45 codes 

No scales T 2 60 unless 8 


high point or 5 ranks 


9' high point 


is high point 

49 codes unless 8 ranks 
3rd or higher and 
T — 70 

All others 


Sth or higher 94’ and 49’ codes 
4’ high point unless 5 | 42 codes unless 8 ranked 
SERES 3rd or higher 4th or higher 
4’ unless 49, 94, or 45 code | No scales T > 70 unless 
8of T = 70 8 high point 
All others 


TABLE 7 


DIFFERENTIATION OF REM CATEGORIES ON THE 
Basis or MMPI Cope Rures 


Group A Group B 
Rule | Predic- g E 
Pea PERC 
sem MI 
Š X|? |E | gi xt) 2 
High Rem| 23 | 7 2 | 9 
^ teow Rem| 9 | 21 | 1-38/<-001] 45 | 4g [4-11] <.05 
High Rem} 22 | 8 a | 9 
B teow Rem| 11 | 19 | 9| €9| io | 99 [6-67] <.01 


exhibited larger and more persistent KAE 
than "dysthymics" and interpreted this result 
to indicate relatively greater generation and 
slower dissipation of Iz among hysterics. The 
MMPI correlates of reminiscence in the 
present study suggested more rapid dissipa- 
tion of Zp for hysterics in that the conversion 
reaction and character disorder codes were 
associated with High Rem. Schizophrenic 
codes were associated with Low Rem and, b: 
inference, with slower dissipation of Ip . Th 
the MMPI findings imply diagnostic differ- 
ences in opposite directions from those pre- 
dicted by Eysenck for Rem and indirectly 
for KAE. 

Distributions of the Rem and prerest KAE 
scores for the psychiatric groups are shown in 
Table 8. Rem scores are highest for the con- 
version reaction and character disorder 
samples. Schizophrenics exhibit the lowest 
amount of reminiscence. An over-all analysis 


of variance of the Rem scores for the assorted 
groups (F = 7.06 for 5 and 122 df) is signifi- 
cant at the .001 level. Analysis of the Rem? 
scores for dysthymics (anxiety reactions) and d 
hysterics (conversion reactions and character 
disorders) reveals differences (F = 9.25 for 
1 and 35 df) at the .01 level of significance. 
When the sample is divided into psychotic 
and nonpsychotic subgroups, the psychotics) 
show significantly lower Rem scores 
(F = 16.52; 1 and 126 df; p < .001). From 
these and the MMPI data, hysterics appear | 
to dissipate Zz more rapidly than dysthymics, | 
whereas psychotics as a group dissipate Ie 
more slowly. As predicted, these results point | 
up a shift in the direction of psychiatric group 
differences from those reported by Eysenck | 
(1955). However, the data substantiate the 
hypothesized relationship between personality f 
variables and Rem. A | 
Examination of the prerest KAE scores in 
Table 8 indicates mean differences opposite 
to those reported by Eysenck (1955). How- 
ever, variability of KAE within each group 
is considerable, indicating extensive overlap. 
The over-all analysis of variance of | 
scores for psychiatric groups does not allow 
rejection of the null hypothesis at an accept- 
able significance level (F = .743 for 5 and 
93 df). Descriptively, the sample means differ 
in direction from Eysenck, dysthymics show- 
ing higher KAE than hysterics. However, the 
evidence for a relationship between KAE and 
psychiatric classification is not unequivocal. 
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TABLE 8 


DISTRIBUTIONS OF REM AND Prerest KAE Scores 
FOR DIAGNOSTIC CATEGORIES 


Rem KAE 
Category 

N|M|SD| N|M|SD 

| Nonpsychiatric 17 |4.53/2.28| 6 |4.862.35 
Psychophysiologic 9 |4.53]11.93| 8 |2.63)3.22 

| Copwersion reaction 13 |6.71/2.59) 9 |3.62|2.60 
Ay reaction 21 |4.64|1.88| 19 |5.62/2.99 
Chafacter disorder 6 |6.92/1.60| 6 |4.85/4.51 

| Schizophrenia 62 |3.70/1.68| 51 |5.29/3.04 


Hypothesis IV. The size and persistence of 
KAE vary positively with age and negatively 
with size of vocabulary. Table 2 shows the 
critical rs necessary to test the hypothesis. 
Magnitude of KAE and age did not relate 

significantly (r = .08; 98 df; p > .05), nor 
did poststimulation KAE and age (r = .17; 98 

; p > .05). Although the trends suggested 
were in the predicted direction, this segment 
of the hypothesis was not confirmed. Vocabu- 
lary and magnitude of KAE were significantly 
related (r = —.26; 98 df; p < .05) in the 
predicted direction, but persistence of KAE 
and vocabulary were not (r = —.18; 98 df; 
p > .05). Age and vocabulary seem to be 
relatively unrelated to KAE. 

Hypothesis V. Rem varies negatively with 

- age and positively with size of vocabulary. 
. Table 2 reveals correlations at acceptable 

significance levels and in the predicted direc- 
| tion. In view of evidence for the deterioration 
' of cortical structure and function with age 
and the probable cortical correlates of intelli- 
gence, further credence is lent to the hypothe- 
Sis of some central variable in the determina- 
| tion of variation in Rem. Such a variable may 
be independent of personality effects or 
operate in interaction with such effects. The 
data do not afford relevant information for 
_ specific conclusions along these lines, but they 
point up a need for investigation of cortical 
effects on Rem, including the effects of lateral- 
ity, caudality, and size of lesions on Rem in 
motor learning. In addition, electroencephalo- 
graphic measures should be studied for their 
telationships with the behavioral variables 
sampled in this investigation. 

Tn general, the results add empirical support 
to suggestions of Eysenck (1955), Klein and 


Krech (1952), and others that theoretical 
concepts can yield testable predictions of inter- 
relationships among behavioral variables in 
learning, perception, and personality. Exten- 
sion of contructs to encompass individual 
differences in seemingly disparate phenotypes 
appears to be a fruitful endeavor and signifi- 
cant for the development of a general behavior 
theory. For the behaviors treated in this 
investigation, extension of the concept of 
reaction inhibition seems especially promising, 


SUMMARY 


The present study was designed to demon- 
strate some interrelationships among kin- 
esthetic figural aftereffects (KAE), reminis- 
cence (Rem) in motor learning, and personal- 
ity variables. Five hypotheses were tested: 
Individual variation in the magnitude of KAE 
varies negatively with the amount of Rem in 
inverted alphabet printing; individual varia- 
tion in the persistence of KAE varies nega- 
tively with amount of Rem; personality 
variables, as measured by the MMPI and 
psychiatric classification, are related to Rem 
and KAE; the size and persistence of KAE 
vary positively with age and negatively with 
size of vocabulary; Rem varies negatively 
with age and positively with size of vocabulary. 

For a sample 128 mixed psychiatric and 
nonpsychiatric hospitalized male veterans, 
the results largely supported the hypotheses 
under examination. The amount of Rem 
varied negatively with the magnitude and 
persistence of KAE. Two empirically derived 
MMPI scales differentiated High from Low 
Rem scores upon cross-validation, but failed 
to account for variation in KAE. Similarly, 
two sets of MMPI code rules separated along 
the Rem criterion when cross-validated. Rem 

ried with psychiatric classifications as psy- 


‘@otics exhibited lower scores than nonpsy- 


chotics and hysterics showed higher scores 
than dysthymics. KAE failed to relate with 
psychiatric classification at conventional sig- 
nificance levels. Size of vocabulary and age 
varied with Rem and KAE in accord with 
prediction, but the correlations did not reach 
significance for the age and KAE relationship. 

Although some of the variation in Rem could 
be accounted for by MMPI variables and 
psychiatric classification, these measures of 


94 


personality did not relate to KAE. The 
observed relationships between Rem, KAE, 
age, and vocabulary may derive from some 
neural process involving the generation and 
dissipation of 7; . 
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ASSOCIATIVE STRUCTURE AND THE SERIAL REPRODUCTION 
EXPERIMENT! 


JAMES DEESE 
Johns Hopkins University 


Img serial reproduction experiment or, 

as it has sometimes been called, the 
rumor or gossip experiment has long 
heen regarded as a model for the social 
communication of retained information. It 
was so considered by its inventor (Bartlett, 

1932), and it has provided the most important 

experimental tool in the analysis of the phe- 

nomena of rumor made by Allport and 

Postman (1947). The terms, leveling, sharpen- 
ing, and assimilation, used to describe the 

processes at work in producing the changes in 
successive reproductions by different in- 
dividuals have been of great theoretical im- 
“portance both in social psychology and in 
— theories of personality. 

Some recent experimental evidence (Deese, 
1959a, 1959b, 1960) has suggested that the 
‘changes occurring in successive reproductions 
of verbal material as that material moves from 
— one person to another could be predicted by 
—— free association frequencies obtained from the 

population of which the transmitting individ- 

uals are members. The present experiment is 

designed to explore this possibility. Such a 
. demonstration would not lessen the usefulness 
of the concepts of leveling, sharpening, and 
assimilation; it would, however, provide a more 
general framework, associative structure, to 
which these concepts could be related. 

In one experiment (Deese, 19592), the in- 
fluence of associative structure of lists of Eng- 
lish words upon recall was studied. Associative 
Structure was evaluated by computing the 
| average frequency with which every word 
_ ia particular list elicited every other word 
‘in the list as a free associate. An index of 
associative structure so computed for 18 
different lists ranged from 0 to nearly 30%. 
_ This index correlated .88 with the mean cor- 
_ rect recall for these lists. The same index cor- 
-. related negatively with frequency of intrusions 
| Or importations in recall and positively with 
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the tendency for the intrusions that did occur 
to be identical from subject to subject. Thus, 
the index of associative structure predicts the 
tendency for lists of words to remain intact in 
immediate free recall. Furthermore, a second 
study (Deese, 1959b) illustrated that con- 
vergence of associative strength from words in 
the list to particular words not in the list pre- 
dicted very precisely the frequency with which 
the latter words occurred as intrusions in re- 


Thus, if lists of words are associatively re- 
lated they are recalled with greater accuracy. 
Xf the words in the list are unrelated to one 
another fewer correct words occur in recall and 
a larger number of idiosyncratic intrusions 
tend to occur. Also, if the words in the list are 
part of an associative structure that has repre- 
sentative words not in the list, those words 
will tend to occur in the recall of different sub- 
jects, and, thus, recall for such lists will sys- 
tematically change in a particular direction 
for most subjects, j 

These changes that occur between the 
original list and the subjects" recall of it seem 
strikingly similar in nature to the kinds of 
changes described in the serial reproduction of 
meaningful material by Bartlett (1932) and 
others. It remains to be seen whether the 
changes that occur between the original list 
and serial reproduction of it would also mirror 
the associative structure of the original list. 
Tt seems reasonable to suppose that such would 
be the case, since the changes in serial repro- 
duction represent the pooled effects of proc- 
esses going on in different individuals. The 
free association norms themselves are the 

led result of processes in different indi- 
viduals. Thus, the sample values from the free 
association norms should predict the average of 
changes in the transmission of verbal materials 
from one group of individuals to another. The 
present experiment studies the changes that 
occur in the serial reproduction of lis of 
words that vary in their associative struci 
The changes are studied through three t 
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missions, including that from the original 
list to the first subjects. 


PROCEDURE 


A suitable design for an experimental study of serial 
reproduction is difficult to produce. Typically, the 
problem has been studied by the more or less clinical 
procedure of examining the changes that occur in indi- 
vidual protocols as the material is transmitted from 
one subject to another. One of the important contribu- 
tions of the present approach, however, is the idea that 
such changes can be predicted for and examined in sam- 
ples of individuals. Thus, in the present study the ma- 
jor results are based upon central tendencies derived 
from a number of individuals, 

For even a small series of transmissions, the number 
of subjects required for a complete examination of all 
changes becomes very large. It is, as a matter of fact, 
N raised to the kth power, where k is the number of 
transmissions. For only 10 subjects recalling the original 
passage and three transmissions, 1000 subjects would 
be required. Therefore, a compromise had to be made, 
In the present instance 30 subjects recalled the original 
passage. From the 30 recall protocols that resulted for 
each list, 3 protocols were chosen at random. These 
were reproduced and presented for recall to 10 addi- 
tional subjects. Thus, a total of 30 subjects recalled the 
first derived lists. One protocol per list from each of the 
3 groups of first derived lists was then randomly chosen 
to provide stimulus material for the second derived 
lists. Thus, there were three second derived lists pre- 
sented to 10 subjects each; this made a total of 30 sub- 
jects for the second derived lists. A total of 90 subjects 
took part in the recall portion of the experiment; these 
subjects were all undergraduate male students at the 
Johns Hopkins University. 

This design is a compromise, a compromise in which 
a small number of protocols is sampled with a fair 
number of subjects recalling each protocol. It would 
have been possible to sample a larger number of proto- 
cols (with, perhaps, fewer subjects). Such a procedure 
would be suitable for a problem in which it is possible 
to predict stable differences between individual subjects 
in the recall of particular protocols. 

Each subject was presented with 18 lists. One group 
of subjects was presented with the original 18 lists, 
which were 15 words in length, and each of the other 
groups was presented with 18 lists, derived by succes- 
sive recalls from the original lists. The original lists 
were those used earlier (Deese, 1959a) because they 
were known to produce stable differences in associa- 
tive structure and because they are well balanced for 
frequency of usage. The lists themselves are reproduced 
in the earlier paper. 

All lists were presented to the subjects with the 
usual instructions for free recall (Deese, 1959a). Both 
instructions and the lists were presented by tape record- 
ing. The lists themselves were presented at the rate 
of 1.5 seconds per word. Subjects were provided with 
answer sheets on which they wrote their recalls at the 
end of the presentation of each list. There was a brief 
pause between each list, and all lists were presented in 
one session. Subjects were tested in groups of two, Five 


different randomizations of list order were used to 

minimize cumulative transfer effects, though such 

effects have not yet been detected in studies of free re 
call using numbers of lists. 

For each of the original lists an index of associative 
structure was computed by finding the mean percentage 
frequency with which every word elicited every other 
word on the list. These indexes were based upon associa- 
tion norms for all words used in the experiment, and 
these norms were obtained from a sample of 50 under- 

luate males at the Johns Hopkins University, Care 
řas taken to make sure that no subject served in both 
parts of the experiment. 

These indexes of associative structure computed 
from this sample ranged from 0.1% for one list to 28.2% 
for the highest. These differed only very slightly from 
the values obtained in an earlier experiment on Univer | 
sity of California undergraduates (Deese, 1959a). t 


RESULTS 


The important results of the experiment 
consist of (a) a comparison of the words re 
called at each reproduction with the associa: 
tive structure of the original lists and (b) the 
nature of the changes in recall at each repro- 
duction. 

A comparison of the number of words cot 
rectly recalled at the first reproduction and 
the associative indexes for the various lists 
provides a repetition of the data obtained in 
an earlier experiment (Deese, 1959a). The 
appropriate correlations in Table 1 indicate 
that the basic results of the earlier experiment 
are confirmed. There is a high and statistically 
significant correlation between the number of 
words correctly recalled in the first reproduc 
tion and the index of associative structure: 
There is a slight discrepancy between the pre 
vious and present data on the mean number o 
words correctly recalled for all lists, but this 
no doubt reflects a difference in the populations 
tested in the two experiments. a 

The only serious discrepancy from the orig 
nal study is in the correlation between the 
number of intrusions into original recall and 
associative structure. In the original exper 
ment this correlation was significantly nega- 
tive. In the present data this correlation § 
negative but not statistically significant. 

Table 1 also reveals that the same pattern of 
results holds for both derived recalls. There 
are positive correlations of about the same 
magnitude for recall of these lists, when sco? 
against the original lists, and associative struč 
ture. Likewise, there are low negative correla- 
tions of intrusions, compared with the righ” 
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nal lists, and associative structure. As is to be 
expected, Table 1 shows that recall scored 
against the original lists deteriorates with suc- 
cessive reproductions with, perhaps, greater 
change occurring between the original re- 
production and the first derived reproduction 
than between the first and second derived re- 
productions. Likewise, the number of intru- 
- ions increases slightly through each successive 
reproduction. Y 
s An important result is revealed by the least 
squares slope and intercept constants for the 
regression of recall scores upon associative 
structure for each reproduction. The slopes of 
the linear regressions are practically the same 
for original and both derived reproductions 
and well within the standard error of estimate. 
Thus, we must conclude that, irrespective of 
the inherent organization of any list, the drop 
| in correct recall from reproduction to repro- 
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duction is a constant for all lists. All of the 
advantage in reproducing the original of the 
more highly structured lists occurs in the 
initial reproduction. This advantage is merely 
passed on from reproduction to reproduction. 

Table 2 presents the percentage of correct 
words for each list on each reproduction when 
scored against (a) the original list and (b) 
against the list actually presented to the sub- 
ject. In the case of the first reproduction this 
latter figure is the same as the former, but for 
the other two successive reproductions it is the 
derived list presented to those subjects against 
which the reproductions are scored. The major 
interests in these data are in (a) the absolute 
percentage correct at each reproduction and (6) 
the correlation between percentage correct and 
associative structure. For scoring against the 
original list, of course, the correlations are 
those presented in Table 1. 


TABLE 1 
SUMMARY OF RELATIONSHIPS BETWEEN CORRECT RECALL AND THE INDEX OF ASSOCIATIVE STRUCTURE 
Mean Number Correlation ^ Correlation? Intercept? 
Correct Correct vs. Intrusions vs. Correct vs. Slope? Correct 
Against Mean Number Association Association Association vs. Association 
riginal) of Intrusions Structure Structure Structure Structure 
Original recall 7.89 0.63 .675 —.207 7.42 .058 
__ First derived recall 5.99 0.64 -706 —.109 5.49 .062 
__ Second derived recall 4.98 0.88 -700 —.216 4.63 044 
5 
® Correlation of 485 significant at approximately .01 level. 
P Least squares. 
TABLE 2 


INDEX OF ASSOCIATION STRUCTURE AND PERCENTAGE CORRECT RECALL SCORED AGAINST ORIGINAL AND 
PRESENTED Lists FOR ALL Lists 


3 Ti e Cor- Percentage Cor- Percentage Cor- 
Tuto SA. rdstion À EET fato A eot relation 4 RC uum eun i SESS ‘Aeainat y 
DM Ses (a Ean ETE) (nd derived) (st Derived) (2nd derived) 
H 3.3 45.3 32.0 31.3 73.8 83.5 
2 0.8 31.3 20.0 18.0 63.0 94.8 
3 4.8 41.3 36.7 29.3 88.9 84.5 
" 4 28.2 67.3 53.3 41.3 73.4 89.0 
5 16.8 52.0 32.0 31.3 79.4 90.8 
$ 1.6 51.3 34.7 36.7 65.0 83.2 
7 3.5 54.0 33.3 37.3 79.0 80.1 
8 0.1 50.7 33.3 32.7 67.5 93.2 
? 1.5 54.7 41.3 32.7 71.9 85.5 
10 14.0 66.0 54.7 52.0 77.3 85.7 
1 1.3 58.0 44.7 38.0 66.5 78.8 
12 0.7 46.7 30.0 20.0 94.2 94.7 
13 8.5 47.3 42.0 39.3 70.5 90.2 
14 0.1 50.7 42.7 38.7 86.4 94.4 
15 20.1 62.7 56.0 43.3 84.7 86.7 
d 0.3 47.3 42.0 30.0 86.6 88.1 
H 23.1 58.7 45.3 40.0 81.3 89.6 
18 16.9 62.0 46.0 30.0 83.2 87.0 
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Scored against the original list, the percent- 
age correct for each list falls to a very low level 
by the reproduction of the second derived list. 
The best list produces an average of only 41% 
correct and the worst list produces an average 
of only 18%. 

The percentage of words recalled in derived 
reproduction scored against the list actually 
presented to the subject remains very high 
through all reproductions. For the reproduc- 
tion of the second derived recall, the lowest 
mean percentage correct recall is 78.8. This is, 
of course, what one would expect, since after 
the original reproduction all the material is 
filtered through the memory span of the origi- 
nal subject. Thus, this subject has reduced all 
of the lists to manageable proportions so far 
as immediate memory is concerned. 

The fact that after the first reproduction 
everything left has been filtered through a 
human memory span produces another im- 
portant result. The correlations of percentage 
correct recall scored against presented lists 
and associative structure are drastically re- 
duced. The first derived reproduction scored 
against original reproduction produces a corre- 
lation with associative structure of .122; the 
second derived list scored against the first 
produces a correlation of .024. These nearly 
zero correlations occur because the memory 
span flattens out the differences between the 
lists. The original subjects recall more of the 
lists high in associative structure than they do 
of those low in structure. This means that sub- 
jects presented with the original reproductions 
are presented with lists uneven in length. The 
lists low in associative structure are short, 

those high in associative structure relatively 
long. The difference in length tends to counter- 
balance the difference in organization between 
the lists with the result that all lists in derived 
reproduction are about equally well recalled, 
and the correlations of correct recall with asso- 
ciative structure is reduced. 


DISCUSSION 


The present experiment produces two im- 
portant results: the correlations between re- 
call when scored against original lists and 
associative structure remains high through at 
least three successive reproductions, and the 


correlations between recall of the derived re. ` 
productions scored against the lists actually pre- 
sented and associative structure drops to near | 
zero. It is very likely that the latter result is 
in part at least the result of lists originally loy - 
in associative structure being restructured by | 
successive subjects as the result of converging 
associations. There are two facts that make 
this likely: it is known that associative struc 
turing does produce stable, relevant intrusions 
in recall (Deese, 1959b), and it is known that 
even short random collections of ordinary 
words possess some small amount of associative 
structuring (Deese, 1960). Thus, the original 
subject in a transmission series is likely to make ` 
a list low in associative structure somewhat 
higher than it was originally by adding a rele 
vant intrusion. The next subject in this series 
will maintain this added word in recall if it is | 
highly associatively related to the list. The | 
next subject may also alter the list further in 
the direction of the structural change suggested 
by the new list. i 

Unfortunately there are not available ex 
tensive enough free association norms to make 
a direct examination of this process possible 
in the present data. The recall protocols for 
the present experiment, however, amply con- 
firm the analysis made on the basis of the corre- 
lations found here and the previously known 
facts about the recall of short lists. 

For example, List 6, which has an index of ^ 
associative structure of 1.6, consists of the 
following words in the original: home, exactly, 
step, devotion, morning, wave, fine, paper, 1m- 
prove, cat, glitter, company, weapon, watch, 
evening. Almost the only high frequency 8530 
ciations in the list are evening-morning aN 
morning-evening. In all three of the original 
reproduction protocols used as first order de- 
rived lists, the evening-morning pair occurs. It 
remains in all subsequent reproductions. Ih 
addition, however, two of the three original te- 
call protocols used as first order derived lists 
included the word “home.” This remains 1 
12 out of the 20 recall protocols for which it 
was presented on the first order derived lists: | 
All three of the first order derived protocols 
that were used as second order derived lists | 
included the word home. For two of thesé 
the word "house" was added by subjects E 
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an intrusion. Thus, 20 subjects were presented 
with second order derived lists including both 
the words home and house. In the recall proto- 
cols for these second order lists 20 out of 20 
subjects produced house and 16 out of 20 pro- 
duced home (note that the intrusion house is 
apparently stronger than home). Thus, for 
many subjects recalling the second order de- 
rived lists, the recall protocols consisted of 
nothing more than the associative doublets, 
house-home and evening-morning. 

Thus, it is likely that some organization is 
introduced by subjects into the material they 
transmit and that this organization is based 
upon the associative frequencies common to 
the population of which the subjects are mem- 
bers. The new organization produces stability 
for the added material; stability for the old ma- 
terial has been produced by the associative 
Structure inherent in the original lists. This 
stability produced for all of the material by 


the associative processes in the original list 


and in the original reproductions is like the 
Stability found in the protocols of successive 
reproductions of meaningful material, and it 
Provides a basic associative model for the 
Processes of leveling, sharpening, and assimi- 
lation as these have been used to describe 
effects in successive reproductions. 

Doubt may be expressed that the results ob- 
tained here with lists of words would apply to 
the recall of connected discourse. Several facts, 
already mentioned, argue that such a transla- 
tion is possible. For one thing we know that 
associative structures are rather well organized 
around “topics” or “concepts” (Deese, 1959b), 
and we also know that associative structures 
form networks even in randomly selected words 
(Deese, 1960). Thus, it seems almost certain 
that associative structures will appear in con- 
nected discourse. No more than this can be 
Said at present, but the simplicity of an asso- 
Clative model for the social transmission of re- 

tained information makes it an appealing one. 
. A brief word should be said about another 
Important implication of the present results 
- or social transmission of verbal information. 
These results imply that such transmission 


. depends in some part upon the associative net- 


Works existing within individuals. Individuals 
. 8Té members of different groups whose 
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members characteristically exhibit different 
associative frequencies; these typically reflect 
class, occupational, and attitudinal interests. 
Successive recalls, of course, should on the 
average conform to the associative charac- 
teristics of the population within which the 
transmission occurs. Thus, starting out with 
the same material one would expect different 
populations to show differences in successive 
reproductions and one would expect these to 
reflect associative frequencies within these 
populations and to be correlated with other 
characteristic differences (Deese, 1959b). 


SuMMARY 


In the present experiment the successive 
reproductions of lists of words in immediate 
recall are studied through three transmissions. 
One transmission is from the original lists of 
words to the original subjects. The second 
transmission is from randomly selected recall 
protocols of subjects receiving the original 
list, and the third transmission is from ran- 
domly selected recall protocols of those sub- 
jects receiving the first derived lists. The lists 
differ in the mean frequency with which every 
word in the list elicited every other word as a 
free associate. This mean percentage frequency: 
for each list provides an index of associative 
structure. Associative structure is highly cor- 
related with correct recall when recall is scored 
against the original list; this high correlation 
is maintained through all transmissions. Scored 
as percentage recall against the list actually 
presented, however, the derived lists produce 
recall with nearly zero correlation with associa- 
tive structure. This is because the first trans- 
mission reduces all lists to the limits of the 
human memory span. It does this by making 
the lists low in structure shorter than the lists 
high in structure. The net result is that the 
original structure remains for the highly 
structured lists in derived recall but it does not 
remain for lists originally low in structure; the 
percentage of words that an individual can 
reproduce of what he has heard in a derived list 
is approximately a constant for all lists, how- 
ever. The results were related to the basic 
phenomena in the social transmission of infor- 


mation. 
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AUTHORITARIANISM, TENDENCY TO AGREE, AND 
INTERPERSONAL PERCEPTION! 


HERBERT C. SCHULBERG? 
Teachers College, Columbia. University 


MPRESSIVELY consistent results have been 
obtained in recent studies of the ability 
of college students scoring relatively high 

and low on the F Scale to judge peers' scores 
on the same scale. Low scorers invariably judge 
the person with whom they have had a brief 
interaction to be different from themselves and 
to have a score in the middle range of the F 
Scale distribution; high scorers think partners 
are like themselves and assign a score similar 
to their own (Crockett & Meidinger, 1956; 
Scodel & Freedman, 1956; Scodel & Mussen, 
1953). These results have been interpreted as 
indicating that those low and high on the F 
Scale have differing stereotypes of their fellow 
Students. However, under conditions of brief 
interaction, neither group is able to determine 
which partners have the high or low F Scale 
Scores (Christie & Cook, 1958; Rabinowitz, 
1956). The consistency of results in this area 
certainly belies part of Cronbach's (1958) wry 
summary of results in the area of interper- 
sonal perception as being “.. . interesting, 
Statistically significant, and exasperatingly 
inconsistent” (p. 353). 

Although the results are interesting and 
Statistically significant, they are, however, 
theoretically inconsistent with the general 
assumptions underlying The Authoritarian 
Personality (Adorno, ^ Frenkel-Brunswik, 
Levinson, & Sanford, 1950). Low F Scale 
Scorers should display greater sensitivity to 
others than the high F Scale scorers, rather 
than simply having different stereotypes. The 
presumed projectivity of high scorers is also 
Puzzling. Rather than referring weaknesses in 
themselves onto other people, as is theo- 


.! This paper is based on a doctoral dissertation sub- 
ee to Teachers College, Columbia University, New 
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tion to his advisors, Rosalea A. Schonbar and Laurance 
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encouragement in all phases of the study. 
Now at Fairfield State Hospital, Newtown, Con- 
necticut. 


retically assumed by Adorno et al. (1950, pp. 
239—241), these experiments have found that 
high F Scale scorers apparently assume that 
other students share their opinions about 
human nature and ideology. 

The previous studies of authoritarian per- 
ceptiveness used the California F Scale, which 
contains only positively worded items agree- 
ment with which is considered indicative of 
authoritarianism. A unilateral scale of this 
type is susceptible to a response set to agree 
(Cronbach, 1946, 1950), and fails to dis- 
tinguish between people who are consistently 
authoritarian (CA) and those who are highly 
agreeing (Ha). Since there are personality 
differences between CA and Ha individuals 
(Cook, 1958; Couch & Keniston, 1960), group- 
ing them together is an invalid procedure. Re- 
cent research suggests that among college 
samples persons who are high on the California 
F Scale by virtue of indiscriminant agreement 
outnumber the consistent authoritarians by a 
ratio as high as 4:15 The consistent previous 
findings are thus probably based upon a com- 
parison of low scorers (consistent nonauthori- 
tarians—CN) with a mixed group of high 
scorers. The latter group is not predominantly 
ideologically authoritarian, but finds F Scale 
items sufficiently ambiguous to agree indis- 
criminately. 

Concern about these problems led to two 
modifications of the Scodel and Mussen (1953) 
experimental design. The first was to isolate 
the three groups of subjects: CN, CA, and Ha. 
The latter two groups were distinguished by 
considering their responses to the reversed F 
Scale items constructed by Christie, Havel, 
and Seidenberg (1958) as well as their re- 
sponses to original California F Scale items. 

The presence in past studies of people who 
are high scorers on the F Scale because they 
simply agree with ambiguous items rather 
than because of such ideological orientation 
may be assumed to account for the reported 
tendency of high scorers to view others as 


3 Personal communication from Richard Christie. 
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similar in F Scale scores. The “acquiescent” 
behavior of this group, on this view, reflects a 
lack of concern rather than an active tendency 
to agree with anything. Cook (1958) found 
that this Ha group was unaffected by a per- 
suasive communication whereas both the CA 
and CN groups showed significant changes in 
the direction of the communication. The Ha 
group was also noted by others as generally 
presenting a “bland” exterior with little evi- 
dence of overt anxiety. 

The hypothesis is thus posed that the CNs 
are the most accurate group in making evalua- 
tions of others since they are thought by 
Adorno et al. (1950) to be the most insightful. 
Although CAs have been primarily thought to 
project their own attitudes onto others, they 
have been found by Siegel (1954) and Lipetz 
(1960) to show some ability in discriminating 
between their own and others’ attitudes. How- 
ever, the CA group sees others only in terms 
of complete agreement or disagreement with 
itself and should therefore be relatively in- 
accurate in making evaluations of others. 
People displaying the response set of highly 
agreeing to both the original and reversed F 
Scale items are assumed to be unaware of 
differences and should consequently be the 
least accurate of the three groups in evaluating 
others. 

The second modification of this study was to 
include an additional measure of interpersonal 
perception. Previous findings that even the 
presumably interpersonally perceptive non- 
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authoritarians were unable to predict accu 
rately partners’ scores on the F Scale suggests 
that college undergraduates might not or. 
dinarily evaluate others along this particular 
dimension. Harris and Kalis (1954) and 
Haythorn, Haefner, Longhorn, Couch, and 
Carter (1956) attempted to have psycho 
logically sophisticated observers judge after a 
period of observation which respondents were 
high and low on the F Scale. The observers 
were unable to do so at better than a chance 
level even though there were significant be- 
havioral differences between the high and low 
scorers. Harris and Kalis pointed out that the 
judges had erroneous stereotypes of the be 
havior associated with differing F Scale scores, 

Such considerations lead to the possibility 
that asking undergraduates to fill out the F 
Scale as their partner would leads to stereo- 
typed judgments. The respondents are neither 
experienced with nor do they usually think in 
terms of an ideological continuum when evalu- 
ating others. Perhaps greater accuracy could 
be attained by the respondents if they made 
their judgments upon overt behavioral cor- 
relates of F Scale scores. People are probably 
more proficient at making judgments about 
others when using familiar descriptive terms 
that have behavioral relevance than when 
forced to guess how others would respond to 
such esoteric items on the F Scale as a com- 
parison of the sex life of the ancient Greeks 
and Romans with current practices. 

To test this possibility an Adjective Rating 


TABLE 1 
DISTRIBUTION OF STUDENT AGREEMENT TO ORIGINAL AND REVERSED F SCALE ITEMS 
(W = 237) 
Ed a à Number of Original Items Agreed With 
Agreed With 0 1 2 3 4OMES 6 7 8 9 4 
10 
3 ae ud i 3 1 1 1 (1) " 
(Dy 155 2 4 (3) 
8 2 (1) 1 7 m MURUS 8 (5) 24220) 1() 
7 2:0)54.0)552300:.: 9 Ds. 2 10 8 (6) 4 (1) 
6 1 9 96.12 7 3 1 1 
5 1 1 6 3 593011 4 SONG) iNT Q) - 1 
4 3 2 3 OS BCR TES) BOA SSI 1 1 
1 1 1 ? 3 (2) 1 1 a 
1 1 gu 
CA 


1 (1) 


Note.—The CN subjects are at the upper left, the Ha sub; 


jects at the upper right, and the CA subjects at the lower right. The num- 


bers in parentheses are the numbers of eligible students included in the test sample. 
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TABLE 2 
F Scare Score or RELIGIOUS GROUPS 
Religion 
F Scale 
Protestant Catholic Jewish 
Original items 4.01 4.60 3.83 
Reversed items 3.21 3.84 3.20 
Full scale* 3.61 4.22 3.52 


* Significantly different on the Full scale by analysis of variance 
beyond the .05 level. 


Scale was constructed consisting of bipolar 
traits that have been observed to differentiate 
between the three groups. By hypothesis, 
judges should be more accurate in making 
evaluations on the Adjective Rating Scale than 
on the F Scale. 


METHOD 


Subjects 


The F Scale and the Adjective Rating Scale were ad- 
ministered to 237 students in a private college in Nassau 
County, New York, during regular class sessions. The 
criterion for selection of the three experimental groups 
was the pattern of response to the full F Scale (Table 1). 
The CN and Ha groups easily met the criterion for con- 
sistent belief in nonauthoritarian ideology, and general 
high agreement, respectively. The CA group was not 
as ideologically authoritarian as would be desirable but 
it was probably as authoritarian a sample as could be 
obtained in this area. The CN group had a mean full 
F Scale score of 2.73, the Ha group had a mean full 
F Scale score of 3.80, and the CA group had a mean 
full F Scale score of 4.69. By analysis of variance, the 
groups’ self-ratings are significantly different beyond 
the .01 level and the groups may therefore be considered 
discrete, 

A product-moment correlation of agreement with 
original and reversed items was computed on the basis 
of the data in Table 1. The obtained r of —.46 indicates 
a strong degree of sophistication in the student body 
tested, in view of its ability to discern between the con- 
Hates ideologies reflected in the original and reversed 
items. 

A letter from the investigator was distributed to all 
of the eligible students asking them to signify times at 
which they would be available to meet further with him. 
The students’ availability at specified times determined 
the final sample of 20 CN, 20 CA, and 20 Ha students. 

+ No other restrictions were placed on the experimental 
sample. 

All experimental subjects were asked to list their 
Dame, sex, age, race, religion, year in college, and 
major subject. The only characteristic significantly dif- 
ferentiating the experimental groups was religion (Table 
2). An analysis of variance revealed greater acceptance 
of original items and rejection of reversed items among 
the 20 Catholic subjects as compared with the 20 Jewish 
and 18 Protestant students. 
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Measures 


The 20-item F Scale developed by Christie et al. 
(1958) was the version used in this study. It consists of 
10 original items from the California F Scale, and 10 
items designed to have a reversed psychological mean- 
ing from the original F Scale items. 

The Adjective Rating Scale was composed of com- 
mon descriptive adjectives. The literature was reviewed 
for previous findings of adjectives that might differen- 
tiate authoritarian, nonauthoritarian, and highly agree- 
ing persons. Other adjectives were added that might 
theoretically differentiate the three groups, and these 
several hundred adjectives were then roughly grouped 
into six categories: impulse control, conventionality, 
intraceptiveness, submissiveness, destructiveness and 
cynicism, and life orientation. 

The final selection of 26 pairs of adjectives was made 
from these six categories by the investigator and three 
other psychologists. A subjective attempt was made 
by the investigator to equate the social desirability of 
each adjective in the pair, and the pairs were then ar- 
ranged on a seven-point scale similar to that used with 
the Semantic Differential Adjective scales. 

The median reliability of the Adjective Rating Scale 
for self-description was determined by the test-retest 
method with 34 students, otherwise unconnected with 
this study, at a 2-week interval. It was found to be .78. 

The median reliability for judging a partner's self- 
description was found to be .76 by test-retest as a result 
of the data obtained from 55 of the 60 judges 2-5 days 
after they had first estimated their partners’ scores. 


Procedure 


Self-descriptions on the Adjective Rating Scale and 
the F Scale had been obtained from every subject during 
the initial group testing sessions. The experimental pro- 
cedure involved having judges attempt to guess the self- 
descriptions of their partners. To provide a basis for the 
estimates, each judge was given the opportunity to in- 
teract with a partner for 15 minutes in a discussion of 
radio, television, and movies. This mode of interaction 
was employed to provide a replication of Scodel and 
Mussen’s (1953) and Scodel and Freedman’s (1956) 
studies. 

Since there were three experimental groups, six com- 
binations of interaction were arranged: Ha-Ha, Ha-CA, 
Ha-CN, CA-CA, CA-CN, CN-CN. All male judges met 
with male partners, while all female judges met with 
female partners. In every interaction, each person 
served as a judge of his partner and as the subject of his 
partner's judgment. After the termination of the conver- 
sation, each judge was asked to respond to the F Scale 
and the Adjective Rating Scale in the manner in which 
he thought his partner had responded to them. In addi- 
tion, every judge was asked to give his own personal 
description of his partner by evaluating him on the Ad- 
jective Rating Scale. 


RESULTS 
Accuracy of Perception. 


Previous studies of authoritarian social per- 
ception have generally determined accuracy 
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TABLE 3 


F SCALE Scores OBTAINED AND ESTIMATED BY THE 
JUDGES IN THE THREE GROUPS 


AE F 
bye esl e 
Groups N 
M |SD| M |SD| M |SD 
Consistent Nonauthori- 
tarian (CN) Judges:| 
Own scores 20 |2.66 |.56 |2.79 |.54 |2.73 | .40 
Estimates on 
CN subjects 10 |3.37 2.85 3-11 
Ha subjects 5 |3.86 3.18 3.52 
CA subjects 5 [4.18 3.14 3.66 
All subjects 20 |3.70 |.68 |3.01 |.56 |3.35 | .47 
Own-estimate differ- 1.04**| x -62°*| 
ence 
Highly agreeing (Ha) 
Judges: 
Own scores 20 44.77 |.56 |2.83 |.51 |3.80 | .43 
Estimates on 
Ha subjects 10 44.65 3.47 4.06 
CA subjects 5 [4.56 3.48 4.02 
CN subjects 5 4.78 3.38 4.08 
All subjects 20 4.67 |.84 |3.44 |.64 |4.06 | .62 
Own-estimate differ- 410 -61**| +26 
ence 
Consistent Authoritar- 
ian (CA) Judges: 
Own scores 20 14.88 |.37 |4.51 |.57 |4.69 | .34 
Estimates on 
CA subjects 10 |4.53 4.17 4.35 
Ha subjects 5 |4.54 3.64 4.09 
CN subjects 5 49 3.68 4.30 
All subjects 20 4.63 |.65 |3.91 |.86 |4.27 | .61 
Own-estimate differ- 25 .60* .42**| 
ence 


* Significant beyond the .05 level, 
** Significant beyond the .01 level. 


by comparing the mean F score of each group 
with that attributed to it by its judges. When 
the full F Scale scores in this study were con- 
sidered, the size of the own-estimate difference 
was in the direction predicted (Table 3). The 
nonauthoritarian judges were most capable of 
distinguishing that others were not like them- 
selves, and estimated the greatest difference be- 
tween their own and partners’ scores. The con- 
sistently authoritarian judges did not see as 
much difference between themselves and others 
as did the nonauthoritarians, but they too 
estimated their partners to have significantly 
different scores. The high agreeing judges saw 
no difference between themselves and others, 
and estimated a score similar to their own for 
all of their partners. 

Since both original and reversed F Scale 
items were used, each type was considered in- 
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dividually. On the original items, the Tesults 
are closely similar to those reported in the 
earlier studies (Crockett & Meidinger, 1956; 


Scodel & Freedman, 1956; Scodel & Mussen, 


1953). The nonauthoritarians again made 
estimates significantly different from their 
own scores, while the authoritarian and high 
agreeing judges made estimates very similar 
to their own scores. An opposite pattern, how- 
ever, appeared on the reversed items where it 
was the authoritarian and high agreeing judges 
who estimated their partners’ scores to be 
significantly different from their own. The 
nonauthoritarians did not distinguish in their 
estimates on the reversed items between their 
own and their partners’ scores. 

Judging the relative accuracy of the experi- 
mental groups by comparing their mean F 
Scale estimate raises the question of the 
possible effect of stereotyping. For this reason 


it was considered preferable to determine the 


relative accuracy of each judge by comparing 
his estimate with the actual score of his partner 
on each item of the F Scale. Using the 
Cronbach and Gleser (1953) discrepancy score, 
it was found that the mean discrepancy score 
for each group of judges was as follows: non- 
authoritarian, 11.47; authoritarian, 12.25; and 
high agreeing, 12.44. Testing these differences 
by analysis of variance showed them to be 
nonsignificant. z 

The first hypothesis concerning relative 
judgmental ability of the three groups of 
judges tended to be supported by a comparison 
of the mean F Scale scores. However, analysis 
of the data by the preferable method of dis- 
crepancy score did not support the hypothesis. 

The relative judgmental ability of the judges 
on the Adjective Rating Scale was also tested 
by use of discrepancy scores. The mean dis- 
crepancy scores were: nonauthoritarian, 9.93; 
authoritarian, 10.49; and high agreeing, 10.65: 
Analysis of variance showed these differences 
between groups to be nonsignificant. 

The lack of significant differences in per- 
ceptiveness between groups of judges on both 
the F Scale and the Adjective Rating Scale 
confirms previous findings. Nonauthoritarians 
employ different stereotypes than author 
tarians but do not display greater perceptive’ 
ness. 


v" 
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- Comparative Accuracy on the Test Instruments 


To determine whether significantly greater 
accuracy was achieved on the Adjective Rating 
Scale than on the F Scale, a correlation was ob- 
tained of the relationship between a subject’s 
self-description and a judge's estimate of this 
subject’s score for each test instrument. The 
mean correlations of .18 on the F Scale and .32 
on the Adjective Rating Scale were then con- 
verted to z scores to permit testing the sig- 
nificance of the differences. The difference be- 
tween correlations was found to be nonsignifi- 
cant. The hypothesis is, therefore, not sup- 
ported that it is easier to estimate a partner's 
score on the Adjective Rating Scale than on the 
F Scale. 


Personality Descriptions 


The three groups of judges had been shown 
to differ significantly in their attitudes to 
ideological statements. The question arose as 


TABLE 4 

SELF-DESCRIPTIONS OF THE THREE GROUPS 
Num-| Adjective | Ha | CA | CN Adjective 
1 | Ambitious 3.60 | 2.85 | 2.95 | Happy-go-lucky 
2 | Many interests | 2.90 | 3.60 | 2.95 | Specific interests 
3 | Warm 2.65 | 1.85 | 2.60 | Cold 
4 | Stubborn 4.40 | 5.00 | 4.50 | Easygoing 
5 | Intellectual 4.55 | 4.30 | 3.95 | Practical 
6* | Cooperative | 4.20 | 3.40 | 2.90 | Competitive 
7 | Nervous 4,30 | 4.35 | 4.70 | Calm 
8 | Neat 3.30 | 2.30 | 2.40 | Careless 
9 | Smooth 3.40 | 2.75 | 2.75 | Rough 
10 | Questioning 2.55 | 3.55 | 2.75 | Accepting 
11** | Simple 5.25 | 4.20 | 5.45 | Complex 
12* | Submissive 5.30 | 5.00 | 4.20 | Aggressive 
18 | Awkward 5.45 | 5.25 | 4.95 | Poised 
14 | Introverted 4.65 | 4.15 | 4.20 | Extroverted 
15* | Soft 3.90 | 2.75 | 3.15 | Hard 
16 | Optimistic 2.75 | 3.65 | 3.05 | Pessimistic 
17 | Skeptical 4.15 | 5.10 | 4.55 | Trusting 
18 | Restless 3.10 | 3.60 | 3.30 | Placid 
19* | Respectful 3.20 | 2.20 | 3.55 | Sarcastic 
20 | Flexible 3.20 | 3.25 | 2.35 | Rigid 
21* | Far 4.25 | 5.40 | 4.55 | Close 
22 | Average 4.40 | 3.00 | 3.95 | Different. 
23 | Self-satistied | 4.90 | 5.20 | 4.95 | Concerned with 

others 

24** | Flexible morals | 2.95 | 5.55 | 4.10 | High morals 
25 | Friendly 2.00 | 1.80 | 2.45 | Restrained 
26 | Nonconforming | 3.40 | 4.55 | 3.90 | Conventional 


^ 5 Significantly different by analysis of variance beyond the .05 
level, 


i 3 Significantly different by analysis of variance beyond the .01 
rel, 
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to whether these differences are the result of 
artifacts in the construction of the F Scale, 
or whether they are reflective of deeper per- 
sonality traits. Some evidence about this can 
be obtained from the self-descriptions of the 
three groups on the Adjective Rating Scale 
(Table 4). The lower the numerical score, the 
more a subject felt the left-hand adjective in 
the pair to be self-descriptive. The higher the 
numerical score, the more the right-hand ad- 
jective was felt to be self-descriptive. Possible 
scores ranged from 1 to 7. 

An analysis of variance was performed of the 
self-descriptions on every pair of adjectives. 
Seven of the 26 pairs showed significant differ- 
ences, seemingly indicating the presence of at 
least some inherent personality differences. Of 
greatest surprise were the self-descriptions of 
the high agreeing subjects relative to the other 
two groups. They did not view themselves as 
the conforming, simple, submissive persons 
that others think they are, but rather viewed 
themselves in almost opposite terms. This 
finding supports Couch and Keniston's (1960) 
conclusion that "yeasayers" are “id-domi- 
nated” personalities, with little concern about 
or positive evaluation of an integrated control 
of impulses. 

The manner in which each of the three 
groups was described by the judges on the Ad- 
jective Rating Scale was compared. The data 
used were the ratings made by the judges when 
asked what their partners are really like. 
Differences between each of the three groups 
on every pair were subjected to an analysis of 
variance but now only Pairs 1, 2, 3, and 11 
were discriminating. In view of the closely 
similar profiles that result, these data are con- 
sidered indicative of a stereotyped manner of 
judging others that is independent of the F 
Scale characteristics of the judges. 

Comparisons were also made of the differ- 
ences between each group’s self-description 
and its description by others. A common core 
of adjectives appear to be significantly differ- 
entiating between the manner in which all of 
the subjects describe themselves and the way 
they are viewed by others (Table 5). 

Tt seems that the present social environment 
has made it quite desirable for all of the sub- 
jects to view themselves as complex, aggres- 
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TABLE 5 
ADJECTIVES SIGNIFICANTLY DIFFERENTIATING 
BETWEEN SELF-DESCRIPTIONS AND 
DESCRIPTIONS Bv OTHERS 


Group] Seli-Descriptions |  Descziptions by t 

CN | Complex Simple 3.00** 

Restless Placid 2.73* 

Sarcastic Respectful 2.54* 

Flexible Rigid 2.65* 

Flexible morals | High morals 2.61* 

Nonconforming | Conventional 2.54* 
Ha Competitive Cooperative 3.13** 
Questioning Accepting 2.76** 
Complex Simple 3.05** 
Aggressive Submissive 3.32** 
Restless Placid 3.39** 
Sarcastic Respectful 2:33** 
Flexible morals | High morals 4.87** 
Nonconforming | Conventional 3.50** 

CA Ambitious Happy-go-lucky | 3.49** 

arm Cold, y 2.67* 

Complex Simple 2.62* 
Aggressive Submissive 3.42** 

Restless Placid 2.40* 

High morals Flexible morals | 2.42* 


* Significant beyond the .05 level, 
** Significant beyond the .01 level. 


sive, restless, and nonconforming. However, 
when fellow judges are asked what these stu- 
dents are really like, they are described as 
being significantly less complex, even simple; 
placid; more on the submissive than aggressive 
Side; more respectful than they realize; and 
certainly more conventional. 


Discussion 


This study contributes further evidence to 
the findings that there is no difference in the 
ability of authoritarian, nonauthoritarian, and 
high agreeing individuals to estimate the scores 
of partners on the F Scale or on an Adjective 
Rating Scale. It, therefore, becomes important 
a clarify the extent to which this finding might 

e geni lj 

In the present and previous studies, judges 
have had misconceptions about the relation- 
Ship of behavior and ideological belief. In addi- 
tion, the nature of the interaction was such 
that there were no behavioral manifestations 
of the traits to be judged, and judges resorted 
to stereotypes. A similar problem unfortu- 
nately arose regarding the Adjective Rating 


Scale as the social desirability of many pairs of 


HERBERT C. SCHULBERG 


items was found to be unequated. This m. 
sulted in stereotyped descriptions, and pn. 
vided little valid basis for discriminatin be 
tween groups on parts of the scale. 
Objections can be raised about the expe 
mental design of investigations of relative 
accuracy between the groups studied. Thed 
velopment of a more appropriate form of ip 
teraction in which a judge can obtain dire th 
pertinent evidence is necessary. Upon thi 
basis of reported research it cannot be con 
cluded whether there is any relationship be 
tween general perceptiveness and ideological 
beliefs. At this time the investigator is stud inp 
the relationship of perceptiveness and ideolog: 
cal belief in a different manner. The exper 
mental interaction has been eliminated ant 
instead the three groups are being tested for 
perceptiveness on the Insight Test developel 
by Tolor and Reznikoff (1960). J 
The failure of judges to make more accuratt 
judgments on the Adjective Rating Scale than 
on the F Scale may be due to various factors) 
Judging others may be a broad general ability 
unaffected by the type of judgment required, 
as has just been reasserted by Cline and 
Richards (1960). However, it is also possible 
that varied cues and sensitivity are requirel 
in making different judgments, making ont 
instrument superior to the other. The in 
Struments used in this study, though, may 
depend upon the same cues. j 
The nature of the stereotyped F Scale judg 
ments obtained in this study indicate the neal 
for a re-examination of this issue. On thé 
original conservative items the previously t€ 
ported stereotypes were again obtained (Table 
3). However, an opposite stereotyped pattem 
appeared on the reversed liberal items. It wa’ 
the nonauthoritarians who thought others 
Were just as liberal as they, or that others weit 
like themselves. The authoritarians thought 
others were more liberal than they, or that 
others were unlike themselves. The present 
findings seem to indicate that both authori 
tarians and nonauthoritarians think that 
others agree with what they agree. It is only 
about things that they oppose that they feel 
themselves to be isolated from others. b 
This phenomenon cannot be explained on the 
basis of response set. The high agreeing group, 
chosen specifically on the basis of its demor- 
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strated agreeing response set, should have 
considered others to be like itself. However, it, 
too, saw others as significantly more conserva- 
tive when estimating their scores on the liberal 
items. The “mirror image" nature of these 
findings are quite unexpected but raise a 
theoretically interesting point. Nonauthori- 
tarians, too, are capable of systematic distor- 
tions because of inner needs. It would be de- 
sirable to have this facet of the study retested 
on a different sample to determine the gen- 
erality of the present findings. 

Significant differences between one’s own 
self-description and the judgment of another 
were found on primarily the same cluster of 
adjectives (Table 5). The current social en- 
vironment has made it quite desirable for 
everyone to view himself as complex, aggres- 
sive, restless, and nonconforming. The ideal 
of the maturing college student has thus be- 
come quite similar to the individualistic, 
striving personality favored by Whyte (1956). 
Alas, when fellow judges are asked what these 
students are really like, many of the traits of a 
future “Organization Man” are still very 
prominent. 

It is quite likely that the judges are very 
accurate in describing their partners as really 
being easygoing, good-natured, submissive in- 
dividuals. Although the judges are unaware of 
these traits in themselves and instead have an 
opposing idealized self-image, they are quite 
capable of recognizing these traits in others. 
Further research is necessary to cross-validate 
the differences obtained. This is particularly 
necessary for the high agreeing group because 
of the striking disparity between the way it 
has been described, (e.g., Barron, 1953), and 
the way in which it views itself, (e.g., Couch & 
Keniston, 1960). Also of value would be experi- 
mental clarification of the relationship between 
an agreeing response set and conforming or 
yielding social behavior. 


SUMMARY 


Previous studies of authoritarian social per- 
ception failed to distinguish between consistent 
high and agreeing high scorers on the Cali- 
fornia F Scale. This distinction was now made 
by use of the revised items developed by 
Christie et al. (1958). No differences were 
found in the relative accuracy of the three 
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groups studied, but each employed a new, 
characteristic stereotype. Personality differ- 
ences were found in the way the three groups 
described themselves. The discrepancy be- 
tween the high agreeing group’s self-descrip- 
tion and its behavior on the F Scale gives rise 
to the possibility that it may be employing 
some form of reaction-formation. 
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SOCIAL REINFORCEMENT IN ATTITUDE CHANGE! 


ELEANOR E. MACCOBY, NATHAN MACCOBY, A. K. ROMNEY, anp J. STACY ADAMS? 
Stanford University 


HE effects of a persuasive communica- 

tion have been shown to change with 

time. In some instances there are de- 
layed changes in the direction advocated by 
the communication—“sleeper effects” that 
do not appear in immediate postcommunica- 
tion measurement. Furthermore, some of the 
effects that do appear immediately after the 
communication are relatively stable, while in 
other instances, most of the effects are lost 
with the passage of time. In their pioneer 
studies of attitude change over time, Hovland, 
Lumsdaine, and Sheffield (1949) reported that 
their sleeper effects tended to occur among sub- 
jects who were initially opposed to the point 
of view advocated by the communication but 
who belonged to a group (e.g., an education 
group) most of whose members usually took 
the position being advocated. Two possible 
explanations of these findings are: (a) for a 
“predisposed” person, other attitudes that he 
already holds are consonant with the message 
of the communication, and serve to reinforce 
it during the interval following the com- 
munication when he rehearses the message to 
himself; (5) following exposure to a persuasive 
communication, subjects discuss it with one 
another; “predisposed” persons tend to be 
surrounded by people who agree with the 
message and therefore these conversations 
usually tend to reinforce it, while “nonpre- 
disposed” people are typically exposed during 
the postcommunication discussions to con- 
trary points of view that weaken the effects of 
the message. 

The second of these possibilities serves as a 
Starting point for the present study, but we 
have broadened the issue somewhat. Other 
Tesearch has suggested that interpersonal 
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.” Now at Behavioral Research Service, General Elec- 
tric Company, 
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communication is a highly selective matter. It 
is by no means a foregone conclusion that an 
individual who is surrounded by people who 
have a different point of view from his own 
about a particular issue will either discover 
this fact (see Allport, 1924, on pluralistic ig- 
norance), or, having discovered it, discuss the 
difference of opinion with his fellows 
(Maccoby, Matthews, & Morton, 1954). He 
may simply arrive at a tacit agreement with 
them not to discuss divisive matters. 

We do expect, however, that there are cer- 
tain conditions under which an individual will 
actively seek conversation with others about a 
communication to which he has been exposed. 
Our thinking on this matter stems in part from 
Festinger’s (1957) theory of cognitive dis- 
sonance. In the case of a person who is exposed 
to a persuasive communication with which he 
disagrees, and who is not changed by it, dis- 
sonance should be produced and the subject 
should engage in a number of strategems to 
weaken the impact of the communication and 
strengthen his original position. As Brodbeck 
(1956) has shown, one of these stratagems is 
for the subject to talk to people who he knows 
agree with his initial position. Adams (1961) 
has also demonstrated a tendency for a sub- 
ject to seek support for his initial position, 
following communication. For people who do 
change in the direction advocated by the com- 
munication, the act of change itself presum- 
ably serves as a reducer of dissonance. Never- 
theless, some dissonance should stem from the 
remaining elements of the subject’s original 
position; and he should strive to get rid of these 
elements by seeking conversation with people 
whom he can expect to tell him he was right 
to accept the point of view advocated by the 
communication and help him to refute re- 
maining elements of his old belief. That is, if 
a person is changed by a communication, he 
should behave quite differently from what 
Brodbeck and Adams reported: he should 
seek support for his new position rather than 
his initial one. 

Tf it is true that people tend to insulate 
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themselves from discussion with others who 
disagree with them and to seek out conversa- 
tion on a topic in order to bolster an existing 
point of view (and discharge the tensions re- 
sulting from actual or threatened attitude 
change), can it be said that social interaction 
plays any direct role in attitude change? Is 
social interaction following a communication 
only a kind of epiphenomenon, which accom- 
panies but does not directly affect the final 
impact of the communication? 

In pursuing these issues, we ask the fol- 
lowing specific questions: 


1. Is the content of postcommunication conversations 
such as to provide additional pressure on the indi- 
vidual either to accept or reject the communication? 

T. Specifically, to what extent are people exposed to 

* opposing points of view in discussions? 

2431s there any relationship between the initial effects 
of a communication and the kinds of conversations 
subsequently engaged in? 

3. Is the net change in an attitude over a period of time 
related to the occurrence and nature of conversations 
occurring in the interim? 


Out of a series of studies on communication 
. to parents and subsequent attitude change, we 
have two studies that contain data bearing 
upon these questions. For both studies, our 
persuasive communication was an illustrated 
booklet advocating late toilet training. The 
booklet advocated that training should begin 
when the child is at least two years old, and 
this message was at variance with the initial 
position of a large majority of our subjects, 
The two studies differed somewhat in design 
and sample, and they are described briefly be- 
low. 


METHOD 
San Francisco Study 


The subjects for this study were three groups of mar- 
ried women, the first group having no children, the sec- 
ond having a first young baby, and the third having 
older children only. The comparison of these three 
groups in relation to attitude change has been presented 
separately (Maccoby, Romney, Adams, & Maccoby, 
1959). For the present report, the three groups are 
combined. 

An initial interview was conducted in the Tespond- 
ents’ homes, during which each respondent was asked to 
express her opinion about a number of aspects of child 
training, including the proper age to begin bowel train- 
ing. About 2 weeks later, another interviewer gave the 

toilet-training booklet to the respondent and asked her 
to read it, immediately after which another interview 
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state her opinion about the proper age for beginning 
toilet training. A third home interview was conducted 
approximately 6 months later including the same ques- 
tion on the proper age for toilet training, and also includ- 
ing questions on any discussions the respondent had 
had about the booklet following the second interview, 
Interviews were also conducted at each stage with con- 
trol groups who did not read the booklet; but for our 
present purposes, these groups have been omitted since, 
of course, they could not be asked about postbooklet 
discussions. The size of the sample (booklet groups only) 
obtained in each round of interviewing was as follows: 


was conducted, and each respondent was again asked to 
1 


First interview, November 1958 167 
Second interview, December 1958 151 
Third interview, May 1959 


San Jose Study 


In this study, the subjects were all married women 
with a young baby. Some had older children as well, 
some did not. A home interview was conducted initially 
to provide a baseline measure on opinions about age of | 
toilet training and other child rearing topics. Then the < 
subjects were invited to attend one of a series of evening - 
meetings during which, among other activities, they 
read the toilet-training booklet, During the evening 
meeting, a paper-and-pencil questionnaire was adminis- 


136 


tered, including a question on the proper age for toilet [ 


training. Approximately 2 weeks after the evening meet- ~~ 
ings, the subjects were telephoned and asked about any | 
postmeeting discussions they had had, as well as about 
their current position on age of toilet training. The num- 

ber of cases for each of the three rounds of data collec- 
tion were: 


Initial home interview, March 1959 1040 
Questionnaire at meeting, early April 1959 228° 
Telephone interview, late April 1959 2248 
RrsurTS 
Content of Conversations 


A large proportion of the women who read 
our booklet did discuss it subsequently with 
Someone. Sixty-three percent did so in the 
San Francisco sample, 68% in San Jose. Within 
the San Francisco sample, the women with 
young babies were more likely to discuss the 
booklet with someone than were the women 
who did not yet have a baby or whose children 
were all old enough so that toilet training was 
Not an issue. 

We asked subjects to describe to us the 
nature of the conversations that took place— 
whether they could estimate ahead of time 
what the person they talked to about the 
booklet would think of its position and 


* There was an additional control group of 32 cases 
who came to the meetings, but who were not given the 
toilet-training booklet to read. 
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specifically what position the two parties to 
the conversation took, if any. A consistent 
picture "of the way conversations are circum- 
scripéd in the interests of maintaining har- 
monious social relations emerged from these 
descriptions. It seems that women feel that a 
good deal of tact is required in talking about 
child rearing with other women. They try to 
avoid giving advice, lest they be considered in- 
trusive, and sometimes do not give advice 
even when directly asked. Even when it is 
not a matter of giving advice, they avoid 
making a direct statement of their own 
opinions, except under very favorable cir- 
cumstances. They tend to ask for the other 
person's point of view first before stating their 
own, and then simply remain silent if they 
differ. Or, sometimes a woman presents a 
point of view as emanating from someone else, 
so that she herself does not become the target 
if the point of view draws fire. Perhaps the 
most common technique for avoiding con- 
troversy is to make some estimate in advance 


“ of the nature of the other person's point of 


view, and then to bring up the topic only if 
one is fairly sure that agreement already exists, 

These processes are illustrated by the fol- 
lowing facts from our data: slightly over half 
the time, respondents thought they knew in 
advance how the person they talked to would 
react to the booklet. And, by their own re- 
Ports, their expectations were almost uni- 
versally borne out—out of 191 instances in 
Which a respondent said she expected a par- 
ticular reaction from another person, there 
were only 4 instances in which she reported 
that an opposite reaction actually occurred. 
Of course, there is probably a good deal of 
tetrospective bias in the reported expectations. 
Furthermore, it is possible that some of our 
respondents predictions were self-fulfilling in 
the sense that the persons with whom they 
talked may have politely gone along with the 
Tespondents' expectations and not expressed 


their real opinions in all instances. Neverthe- 


less, let us assume that our respondents did 
know something in advance concerning the 
opinions that many of the people they talked 
to would probably express. Armed with such 
knowledge, they seem to have initiated a con- 
Versation about the booklet more often with 
People they thought would agree with it. For 


among the people they talked to, they report 
that there were twice as many people who sup- 
ported late toilet training as who disagreed 
with the booklet and advocated early training. 
We know from our initial interviews that this 
distribution of opinion would not be at all 
typical of our respondents themselves before 
reading our booklet, nor presumably of the 
people they commonly associate with. So, if 
people did reasonably often express their real 
opinions to our respondents, there appears to 
have been a selective factor at work in the 
Choice of people to talk with. We discuss this 
point further below. 

To return to the nature of the conversations 
held: our respondents reported that they them- 
selves, in talking about the booklet with others, 
most commonly simply described it and asked 
for the other person's opinion, rather than 
offering an opinion of their own. When they 
did offer an opinion, they more often supported 
the booklet than disagreed with it. 

In view of the relatively small number of 
women who openly disagreed with the booklet 
themselves, in conversation with others, and 
the relatively small number of the people our 
respondents talked to who expressed disagree- 
ment, we would not expect a very high level of 
expressed differences of opinion. Even so, the 
number of differences of opinion between the 
two people conversing were far fewer than 
would be predicted if the pairs of conversers 
were randomly assorted. Table 1 shows the 
relationship between the views of the two 
people involved in those cases where both 


TABLE 1 
RELATIONSHIP BETWEEN OPINIONS EXPRESSED BY 
Two PARTIES TO CONVERSATION 
(Included are those conversations in which both parties 
expressed definite opinions—San Jose and 
San Francisco combined) 


Opinion About Booklet Ex- 
pressed by Respondent in 
Conversation—Number 
of Subjects* 


Agreed with Disagreed with 
It: Advocated It: Advocated 


inion of Person Respondent 
Sate Talked to 


Late Training Earlier Training 
eed with booklet 60 5 
Degen! with booklet 13 20 


x = 30.59, p «0.01 


® There were 293 subjects in the two studies who reported at 
least one conversation, but only 98 subjects reported a conversa- 
tion in which both participants expressed a definite point of view. 
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people expressed a definite opinion about the 
booklet's point of view. For this table, the 
instances of such conversations in the two 
studies—San Jose and San Francisco—are 
combined. There is a close relationship between 
the opinion expressed by our respondent and 
the opinion she reports having been expressed 
by her co-conversationist. In all the conver- 
sations for which we have relevant data, there 
were only 18 instances in which the two parties 
to the conversation expressed different points 
of view about the booklet's message (see Table 


1). 
Postchange Selectiveness in Conversation-Seeking 


We mentioned earlier a prediction from 
Festinger’s (1957) dissonance theory: that 


TABLE 2 


AMOUNT OF IMMEDIATE POSTBOOKLET CHANGE IN THE 
Supyect’s OPINION ABOUT AGE OF BEGINNING 
TOILET TRAINING IN RELATION TO THE 
CoNTENT OF SUBSEQUENT CONVERSA- 

TIONS ABOUT THE BOOKLET 


No 
Opinion Expressed b» | OPE | Some. | Considerable 
Ec ME | ane | “eat 
arlier 
Training 
San Jose 
Subject talked te: 
Only people who agreed | 21 2 31 
with booklet = 1.57 
Only people who disagreed | 10 7 1 
with booklet. 
Both agreers and dis- | 11 13 7 
agreers 
Neutrals only 9 6 5 
Did not discuss booklet | 22 (30%)| 25 (35%)| 21 (30%) 
73 72 71 
San Francisco 
Subject talked to: 
Agreers only 13 6 23 
Disagreers only 12 3 "a ante 
Both 6 3 T 
Neutrals only 1 3 5 
Did not discuss booklet | 21 (40%)| 10 (52%)| 14 (26%) 
53 25 53 


5 “Some change” means that these subjects shifted to advocat- 
ing a beginning of toilet training 1-5 months later than in their 
initial interview; “considerable change” means a shift of 6 months 
or more in this direction. 

* These /'s have been computed by comparing the mean initial 
change scores of those who talked only with agreers with the mean 
initial change scores of those who talked only with disagreers. The 
combined probability for the two studies of these differences oc- 
curring by chance is less than .01. 


once people have reacted to a communication 
by accepting the new position that the com- 
munication advocates, they should try to re- 
duce any remaining dissonance arising from 
remnants of their previous beliefs by seeking 
support for their new belief. We have already 
seen that our respondents engaged in a sur- 
prisingly large number of conversations in 
which the other person involved expressed 
agreement with the booklet’s message, and 
that this was probably not altogether a 
chance matter, since the respondents often 
had knowledge in advance about what position 
the people they talked to would take. The 
question raised by Festinger’s theoretical 
point is this: is the seeking of support for the 
position of the booklet more common among 
people who were influenced by the booklet 
than by those who were not? In attempting to 
answer this question, we subdivided our 
respondents into the following groups: (a) 
those who talked only to people who agreed 
with the position of the booklet; (b) those who 
talked only to people who disagreed with it; 
(c) those who talked to both kinds of people; 
(d) those who talked only to neutrals—that is, 
people who expressed no opinion; and (e) those 
who did not discuss the booklet; and compared 
these groups on the amount to which they 
were influenced by the booklet immediately 
upon reading it. Our prediction from dis- 
sonance theory is that those who were changed 
by the booklet in the direction of advocating 
later toilet training should be concentrated in 
Group a, and those not influenced by the 
booklet should be found primarily in Group b. 

As Table 2 shows, in both studies a certain 
amount of selectiveness in the choice of 
discussants appears to have occurred, for we 
find that the people who talked only with 
others who agreed with the message tended 
to be those who had been initially most 
changed in the direction advocated by the 
booklet, while talking only with people who 
disagreed with the booklet was more confined 
to people who had not initially accepted the 
booklet’s point of view. (In San Jose, the 
differences had a p value of .12, in San Fran- 
cisco, of <.02, and the combined probability 
is <.01.)* 


* The findings reported in Table 2 are not an artifact 
of either initial position or final position. That is, if we 


| 
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We must consider an alternative explanation 
of the findings in Table 2. Perhaps our re- 
spondents were not actually selecting people 
to talk to who previously advocated the 
point of view of the message, but were instead 
talking to an unselected assortment of people 
whom they induced to express polite agreement, 
or whom they perceived as being advocates of 
late toilet training whether or not they in 
fact were. Although this explanation cannot 
be ruled out, it does not seem a likely one 
when the expectations of our subjects con- 
cerning the positions of the persons sought 
out are separately analyzed. We reason that 
if our respondents were selecting people to 
support their new views, they would have to 
do so on the basis of what they expected 
(tightly or wrongly) that the other person 
believed. If, on the other hand, they were 
persuading rather than selecting the people 
they talked to, then the other person’s opinion 
should be related to the respondent’s new 
position whether or not the respondent 
estimated in advance what the other person 
would think. In the San Francisco study, 
we find that among the people who said they 
knew what opinion to expect, the relationship 
between the amount of change produced by 
the booklet and the opinions they expected to 
hear expressed in their postbooklet conversa- 
tions was greater than the relationship reported 
in Table 2 (¢ = 4.32), whereas, when the 
respondents did not know what to expect, the 
telationship is nonsignificant (though in the 
same direction). In the San Jose study, the 
relationship reported in Table 2 was essentially 
maintained in the two parts of the sample, 
those who did not and those who did say they 
knew what to expect. 

Thus, in one of our two studies, we have 
some evidence that the opinion change 
produced by the booklet was inducing people 
to seek out conversations with people they 
expected to support their new position. In the 
other study, we cannot rule out the possi- 


subdivide the sample into those initially advocating 
very early training, and those advocating somewhat 
later training (almost none advocated training as late as 
that recommended by the booklet) we find that the rela- 
tionship between the amount of change following the 
reading of the booklet and the points of view expressed 
by co-discussants still hold; and this is also true if we 
subdivide the sample on final position. 


bility that such relationship as there was 
between opinion change and the nature of 
postbooklet conversations was produced by 
the respondents’ inducing, rather than select- 
ing, supporting opinion. 


Influence of Postbooklet Discussion 


We have seen that people tended to expose 
themselves to other people’s views about the 
booklet in a rather guarded and selective 
fashion. Very rarely did any open disagree- 
ments about the issue being discussed occur. 
Tf one person expressed an opinion, the other 
usually either agreed or was noncommittal. 
Furthermore, people often made predictions 
about the probable views of other people 
with whom they might talk and talked about 
the booklet mainly with people whom they 
expected to agree with it. The more the 
respondent's views changed in the direction 
advocated by the booklet at the time of 
reading it, the more likely he was to seek 
out supporting opinion. We earlier raised 
the question as to whether, in view of all 
this selectiveness, all this self-insulation from 
opposing points of view, discussion serves 
any function whatever in producing, or main- 
taining, opinion change. 

Our evidence is that the discussion itself 
does have an effect. Its major function appears 
to be that of maintaining opinion change once 
it has been produced rather than producing 
additional change. The analysis bearing upon 
this point will be reported for the San Fran- 
cisco study only, since in San Jose, there was 
too little "backsliding" to permit studying 
the maintenance of change.* 

In San Francisco, we have again divided 
our sample into those who talked only with 
people who agreed with the booklet, only 
those who disagreed, etc. For each of these 
groups, we have inquired as to whether the 


5 There area number of possible reasons why the atti- 
tude changes produced in San Jose were so lasting—for 
one thing, the time interval over which the maintenance 
of change was studied was only 2 weeks; furthermore, 
the sample was entirely made up of women who had 
young babies, and the involvement in the toilet-training 
issue for such women is great enough to produce con- 
siderable resistance to backsliding. And finally, the con- 
ditions of presentations of the message at the group 
meetings in San Jose were undoubtedly more prestigeful 
than in the home interviews in San Francisco. 
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TABLE 3 


RELATIONSHIP BETWEEN DISCUSSION PATTERNS AND 
THE MAINTENANCE OF OPINION CHANGE 


t Talked to: (Number 
Reve j (NI 
" Both Did 
Dis- 
ers or [pact | Total 
- [Discuss 
kar Ed 
Initially changed in| 
direction of book|- 
let, and: 
Change maintained^ | 25 2 10 8 45 
Change lost? 4 5 7 14 | 30 
Initially not changed 
in direction of 
booklet, and: 
Continued no such 9 8 16 19 42 
change 
Delayed change 4 4 2 4 14 
(sleepers) 
N= 42 19 25 45 131 


= “Change maintained" means that the respondent initially 
changed in the booklet’s direction and did not revert to or below 
his original position during the 6-month interval. One third of this 
group did “backslide?” to some degree, but not all the way. 

5 “Change lost" means that, after 6 months, the individual had 
reverted to his original position or below it—that is, advocated as 
early or even earlier toilet training as he had done in the first 
interview. 


initial changes in opinion following the 
reading of the booklet were maintained over 
the 6-month period that elapsed between the 
second and third interviews. As Table 3 shows, 
people who talked only with others who agreed 
with the position of the booklet tended to 
retain the initial effect of the booklet, at least 
in part. In contrast, the very small group who 
had initially changed in the direction of the 
booklet but subsequently talked only with 
people who disagreed with it tended to revert 
to their initial position. Perhaps more interest- 
ing is the group who initially changed in the 
direction of the position advocated and then 
did not talk to anyone about the booklet during 
the next 6 months. Such people tended to 
backslide to their initial positions. If we select 
respondents who had changed toward the 
advocated position after first reading it, and 
within this group compare those who talked 
only to “‘agreers” subsequently with those who 
did not talk with anyone, the difference in 
the maintenance of opinion change is highly 
significant—the chi square value is 13.61, 
df = 1, p = <.001. 

The relationship shown in Table 3 is not 
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an artifact of either group membership, 
degree of initial change, initial position, or 
final position. That is, while it is true that more 
of the women with young babies discuss the 
booklet with agreers only, and more women 
in this same group maintained their opinion 
change, if one studies the relationship between 
discussion and maintenance of change within 
each of the three sample groups, the direction 
of the relationship shown in Table 3 obtains 
within each group. Similarly the tendency for 
those who talk with “agreers” only to maintain 
that opinion change is found for people with 
different degrees of initial change, differing 
initial positions, and differing final positions. 

An interesting point has to do with the ef- 
fects of conversation in producing “‘sleepers”— 
delayed change in the direction advocated by 
the booklet. This was the problem originally 
raised by Hovland and his co-workers (1949). 
We did not find a very large number of such 
delayed changes in our study. But those that 
did occur were not concentrated among initial 
nonchangers who, after reading the booklet, 
engaged in conversation with people who 
supported the position that the booklet advo- 
cated. Rather, the effect of conversations 
appeared to occur in the maintenance of 
opinion change already produced by the 
booklet. 


SUMMARY 


Two studies were conducted in which an 
experimental communication was administered 
to groups of women, advocating later toilet 
training than most of the subjects initially 
supported. The post-exposure conversations 
that these women held with others were then 
studied. Our major findings are: 

1. The conversations were selective, in that 
people who were influenced by the communica- 
tion tended to have conversations with people 
who expressed agreement with, and support of, 
the respondent’s new position, while people 
who continued to advocate early toilet training 
after reading the booklet tended to have 
conversations with people who expressed 
support of this position. 

2. The major effect of the conversations 
was not to produce additional or delayed 
opinion change, but to prevent “backsliding” 
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on the part of respondents who had been 
initially influenced by the communication. 
People who did not engage in change-support- 
ing conversations in the period following 
exposure tended to revert to their initial 
position. 
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FATHER-ABSENCE AND DELAY OF GRATIFICATION: 
CROSS-CULTURAL COMPARISONS! 
WALTER MISCHEL* 


Harvard University 


Hr dual purpose of this study is to 

investigate the effect of father-absence 

in the home on children's ability to 
delay gratification, as measured by their 
preference for immediate, smaller reinforce- 
ment (ImR) over delayed, larger reinforcement 
(DelR); and test hypotheses based on eth- 
nographic data concerning cross-cultural and 
intracultural differences in preferences for 
ImR or DelR. Opportunities to study sys- 
tematically the effect of father-absence on 
the child's behavior, in this case on his delaying 
the immediately available but relatively 
trivial for the sake of later but larger outcomes, 
are rare. Either fathers are not absent for 
long periods or other conditions tend to be so 
deviant that generalizations to less extreme 
cases must be most tentative. The possible 
effect of prolonged absence of a parent is 
nevertheless of such import for any theory of 
identification and personality development 
that data are much needed. For example, 
Whiting (1959) is persistently concerned with 
developmental and identification. differences 
between children reared in mother-child and 
children reared in nuclear (father-present) 
households. 

The present research, based on investigations 
in the southern Caribbean, presents data on a 
large number of father-present and father- 
absent boys and girls, sampled from a variety 
of settings, and differing sufficiently in age so 
that the role of the latter can be tested. 
Father-presence or absence is related in this 
study to the individual’s choice preference for 
ImR or DelR. Such choice preferences are 
fairly easily elicited and provide lifelike 

1 This study was carried out under the partial support 
of a grant from the Laboratory of Social Relations, Har- 
vard University, and Grant M-2557 (A) from the 
National Institute of Mental Health. 

2 Grateful acknowledgment is made to the Education 
Office of the Government of Trinidad and Tobago, and 
of the Government of Grenada, and to their officials 
for kind cooperation in testing within the school 
systems. 
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behavior which is readily quantified and some 
of whose empirical correlates have been 
explored independently. Namely, preference 
for DelR (as opposed to ImR) has been 
related positively to accuracy in time state- 
ments and to social responsibility (Mischel, 
1961b). The latter was measured by an 
independently validated scale (Harris, 1957) 
and found to correlate substantially with 
other measures of personal and social adjust- 
ment and maturity. Preference for DelR has 
also been negatively related to delinquency, 
positively to strength of n Achievement, and 
negatively to acquiescence or “yeasaying” 
(as opposed to “naysaying”) tendencies 
(Mischel, 1961a). The preference for DelR- 
ImR distinction may thus be thought of as 
delineating two empirically-elaborated clusters, 
associated with significant differences in 
“maturity,” responsibility, delay over time, 
long-term goal direction, and autonomy. 
Singer and his associates (e.g., Singer, 
Wilensky, & McCraven, 1956) have investi- 
gated “delaying capacity,” primarily as 
inferred from the frequency of human move- 
ment (M) in Rorschach responses, and have 
pointed to relevant correlates of delaying 
which overlap to some degree with those 
just indicated. Conceptually, the empirical 
correlates of preference for DelR and ImR 
are related to Freud’s (1922) distinction 
between functioning on the pleasure principle 
as opposed to the reality principle and are 
relevant to Mowrer and Ullman’s (1945) 
position. The latter maintain that the inability 
to delay gratification is an important factor 
in immature, neurotic, and criminal behavior. 
The data on preference for DelR or ImR 
from the previous studies in this program all 
come from the same general culture area 
outside the United States and are based on 
West Indian children’s responses, tested in 
comparable settings and with techniques 
similar to those used in the present study. 
Thus a network of correlates associated wi! 
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the two reinforcement preference patterns 
with respect to delay has begun to be estab- 
lished. The findings of this study on choice 
behavior of father-absent and father-present 
children become meaningful in that context. 
In an earlier exploratory study (Mischel, 1958), 
using a small sample of Trinidadian rural 
children aged 7-9, a significant relationship 
was found between father-absence and pref- 
erence for ImR. Further, Negro subjects were 
found to show greater ImR preference when 
compared with East Indian subjects in the 
same cultural setting, the differences between 
the two groups seemingly being due to greater 
father-absence in the Negro sample. The 
present study is in part directed at replicating 
this relationship and testing its generality by 
sampling male and female children from a 
large variety of settings. 

More important, however, is the attempt to 
test also the effect of father-absence on older 
children. Thus far investigations on the 
effects of father-absence (Bach, 1946; Lynn 
& Sawrey, 1959; Sears, Pintler, & Sears, 
1946; Tiller, 1958) have all been based on 
samples of children under age 10. Lynn and 
Sawrey (1959) summarized these findings. 
Their own research included hypotheses of 
greater immaturity and poorer peer adjust- 
ment on the part of father-absent boys as 
compared to father-absent girls and a control 
group of father-present boys. These (and 
other) hypotheses were generally supported 
by data based on Norwegian boys and girls, 
ranging in age from 8 years to 9 years 6 months. 
The present study offers data for this age 
group (ages 8 and 9) as well as for a second, 
older group, aged 11-14. Lynn and Sawrey 
rightly point out that the findings of the 
earlier studies are limited by the fact that 
the father-absence with which they were 
concerned was a temporary state of affairs 
during wartime. In contrast, Lynn and 
Sawrey’s subjects came from Norwegian 
sailor families where the father is often 
away from home for periods of 2 years or 
more with only infrequent visits. These 
authors recognize that “unknown cultural 
factors specific to the samples may be operating 
instead of or in addition to the variable of 
father-absence. Women who marry sailors 
may, for example, be such a select group that 
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the variable ‘sailor wife’ may be the crucial 
one” (p. 259). The present data on father- 
absence from the Caribbean have the addi- 
tional advantage of coming from a culture in 
which father-absence is a common and widely 
accepted pattern. Common-law marriage is 
extremely popular and according to some 
estimates a permanent father or father-figure 
is prolongedly absent in over one-third of all 
households. 

The general hypothesis here expects an 
inverse relationship between father-absence 
and preference for DelR. Father-absent 
children should show relatively less preference 
for DelR and relatively more preference for 
ImR in choice situations, when compared 
with comparable father-present children. 
This relationship is expected to reflect the 
relatively greater immaturity and other 
correlates of preference for ImR which, it is 
anticipated, should characterize the father- 
absent group more than the father-present 
group. 

The second aim of the study is to test 
experimentally an anthropological hypothesis 
concerning gross personality (or “national 
character”) differences with respect to prefer- 
ence for DelR or ImR existing between the 
populations of two islands, Trinidad and 
Grenada, both in the southern Caribbean. 
The two islands may be thought of as sub- 
cultures within the same larger culture of the 
West Indies. The significance of this second 
aim lies in the application of quantitative, 
experimental techniques for testing a cross- 
cultural hypothesis, in this case a hypothesis 
dealing with an important psychological 
dimension that has been used frequently to 
make global but largely untested characteri- 
zations of groups and societies. This requires 
some explication of the cultural contexts 
within which the anthropological hypothesis 
was formulated. 

On the basis of anthropological observation, 
a major personality difference between the 
Negro groups of the two islands was noted. 
This difference, as expressed by many inform- 
ants with experience in both subcultures, is 
that the Trinidadian tends to be more impul- 
sive, indulges himself, and settles for relatively 
little if he can get it right away. He is described 
as not working or waiting for larger rewards 
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in the future, but tending instead to prefer 
immediately available relatively trivial grati- 
fications. In contrast to this, the Grenadian, 
although on the whole poorer, is said to be 
more willing and able to postpone immediate 
gains and gratifications for the sake of larger 
rewards and returns in the future. These 
observed personality or “national character” 
differences are supported by the further 
observations that the Trinidadian’s savings 
(e.g., for the education of children) tend to be 
less than the Grenadian’s of equally poor or 
even poorer circumstances. 

Such differences between the islands may 
be related historically to the fact that the 
Grenadian, in contrast to the Trinidadian, has 
tended to hold on to his own land from genera- 
tion to generation, and has been able to 
develop a much more autonomous role with 
respect to dependence on non-Negroes and 
colonials in government, in the professions, 
and in all forms of business. 

The observed distinction between Trini- 
dadians and Grenadians may be thought of as 
reflecting differences with respect to preference 
for ImR as opposed to DelR in choice situ- 
ations, with the Trinidadian preferring the 
former more and the latter less than the 
Grenadian. This difference between individuals 
from the two islands may be conceptualized, 
at least in part, as related to differences in 
“trust” experiences with respect to the 
actual anticipated occurrence of delayed 
reward. 

Indeed, in accord with Rotter’s (1954) 
social learning theory we have conceptualized 
earlier (Mischel, 1958) that the person's 
choice of an immediate, smaller or delayed, 
larger reward in a choice situation is, at least 
to some extent, a function of his expectation 
that the reward will actually occur and the 
reward value, or preference value, of the 
particular reinforcement. Mahrer (1956) has 
already shown that the person’s expectancy 
that reinforcement would follow from the 
social agent making the promise, even after 
time delay, is an important variable in such 


3 The distinction in this respect between Trinidadians 
and Grenadians has been made previously to differen- 
tiate the Negro and East Indian subgroups within 
Trinidad (Mischel, 1958), and will be referred to again, 
with replicating data, later in this paper. 
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choice behavior. In common sense terms, this 
kind of behavior may be thought of as “trust” 
or the belief that the agent promising delayed 
reinforcement will actually supply it. 

The Grenadian may be thought of as having 
had a longer history of actual reward occur- 
rence in delay situations (e.g., in the form of 
long-term gratifications accruing from land 
ownership) and as living in a cultural situation 
in which there is a good deal of trust with 
respect to promise-keeping and with respect 
to other means through which reward delay 
may be mediated, (i.e, long-term "payoff"), 
Certainly direct observation suggests the 
Grenadian to be far less suspicious and 
skeptical in his relations to strangers as well 
as to his own peers than is the Trinidadian. 
In sum, the former may be thought of as 
having long experiences in a relatively stable 
"delayed reward culture" (at least in its 
economic aspects) within which he himself has 
had a relatively autonomous and "trusted" 
role; in contrast, the Trinidadian may be 
thought of as participating in a relatively 
"immediate reward culture" within which 
he has had a highly dependent and “untrusted” 
role. These differences should, in turn, be 
reflected in the expectations with respect to 
reward delay transmitted to the children 
within the two cultures. The prediction follows 
that there would be greater preference for 
ImR than DelR on the part of Trinidadian as 
compared to Grenadian children of compara- 
ble age and socioeconomic background. 

Lastly, a hypothesis concerning differences 
between the Negro and East Indian popula- 
tions of the same island, Trinidad, on the 
same dimension of preference for ImR or 
DelR is tested. Ethnographic observation 
suggested differences between these two 
intra-island groups similar to those noted 
between Trinidadians and Grenadians (where 
reference is made only to the Negro popula- 
tions of both islands). Namely, when compared 
to Trinidadian Negroes, the East Indian 
group of Trinidad appears to show clearly 
greater preference for DelR in daily behavior. 
As previously indicated, this observation 
has been made and supported in earlier 
research (Mischel, 1958). Here, an attempt 
is made to replicate these findings, taking 
into account any possible differences in extent 
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of father-absence in the samples tested.* This 
also permits comparisons of intra-island 
differences between large groups on the 
dimension of concern. 


METHOD 


Subjects 


A total of 68 Trinidadian Negro children (30 boys and 
38 girls), and of 69 Grenadian Negro children (36 boys 
and 33 girls), all in theage group 8 through 9 were tested. 
The Trinidadian subjects came from two schools, both 
on the outskirts of the capital city. The Grenadian sub- 
jects similarly came from two schools, both located on 
the outskirts of Grenada’s capital city. Care was taken 
to select all schools so that subjects of roughly compa- 
rable socioeconomic backgrounds (all lower middle and 
lower class) would be obtained. This selection was 
guided by impressions formed on the basis of inspection 
of a variety of schools, participant observation in both 
cultures, and the counsel of local school officials. In- 
spection of the father's occupation reported by father- 
present subjects in all samples reveals no striking 
differences. Over 90% of the responses seemed classi- 
fiable as “unskilled workers.” 

Further data were collected on two samples of Trini- 
dadian East Indian subjects aged 8 through 9 from the 
same Trinidadian schools. The first sample consisted of 
22 boys and 15 girls; the second of 20 boys and 23 girls. 
In addition to these young subjects, similar data were 
collected from a total of 112 older Trinidadian Negro 
subjects (68 boys and 44 girls) aged 11 through 14 at 
one of the Trinidadian schools indicated above, and on 
75 Negro subjects (40 boys and 35 girls) in the same age 
range, coming from the second Trinidadian school. 


Task and Measures 


The reinforcements used with the younger group 
were selected through pretesting with a random sample 
of 20 boys and girls, aged 8 and 9, tested at another 
government school on the outskirts of the capital city in 
Trinidad. In these pre-experimental sessions subjects 
were seen in individual sessions and their preferences 
for specific reinforcements, all candy, were elicited. As 
a result, two candy reinforcements, varying markedly 


‘With respect to household structure in the three 
samples studied, the following tentative observations 
were made. Trinidad and Grenadaare alike in the house- 
hold structure of lower class and lower middle class 
Negro families. In both islands common-law marriage is 
extensive and accepted, fathers are frequently pro- 
longedly or permanently absent, and the mother-child 
relationship is the family nucleus. The child usually 
knows who his natural father is or was, although the 
mother's partner may change at irregular intervals. In 
contrast, in East Indian Trinidadian families the father 
tends to be permanent and common-law marriage is 
rare, 

5 The 112 older subjects were aso administered other 
measures, the results of which have been previously re- 
ported (Mischel, 19612). 


in price, size, and packaging (ie., a $.02 and a $.10 
candy bar) were selected. These met the desired re- 
quirements inasmuch as the larger reinforcement was 
uniformly preferred in a straight choice situation 
(“which one of these two would you like to take?”), but 
when the choice was “you can have this one (the 
smaller) today or this one (the larger) in one week,” 
approximately 50% of the group chose the former and 
approximately 50% the latter. The purpose of this pro- 
cedure was to select a reinforcement pair which would, 
as closely as possible, dichotomize the choices of subjects 
in one of the two ethnic groups being compared supply- 
ing an approximate 50-50 split in the Trinidadian sam- 
ple which could be used as the criterion for comparison 
with the Grenadian data. 

With the older subjects, a different reinforcement 
pair, namely, a $.10 and a $.25 candy bar were used on 
the basis of the procedure reported earlier (Mischel, 
1961b). In addition, two verbal items, to which the sub- 
ject was to respond with “yes” or "no," one inserted 
near the beginning of the total procedure and the other 
in the middle, were included with the older subjects. 
The items were: 

1. I would rather get ten dollars right now than have 

to wait a whole month and get thirty dollars then. 

2. Y would rather wait to get a much larger gift much 

later than get a smaller one now. 

The behavioral choice (candy) of preference for ImR 
as opposed to DelR was given as the last item of the 
total battery. Previous research (Mischel, 1961b) has 
shown that the use of these three measures in combina- 
tion is more fruitful than the use of single items. 

Although there are no formal data available on the 
comparative subjective value of the candy bars in the 
two cultures to be compared, direct observation, con- 
firmed by those with intimate knowledge of both cul- 
tures, indicated that in both Grenada and Trinidad the 
particular candies used were of similar appeal and avail- 
ability. In both cultures such candies are sold in vir- 
tually every corner store and are popular with, and com- 
monly consumed by, middle and lower class children 
alike. Candy allowances in the two cultures appear to 
be essentially the same. 4 

The measure of father-absence or presence consisted 
of the response ("yes" or *tno?) to the question: “Does 
your father live at home with you?" Twenty-eight out 
of 30 such responses, randomly selected, were in agree- 
ment with the child's mother's response to the question, 
“Does his (specifying the child's name) father live at 
home with you?" asked in subsequent interviews with 
mothers of subjects in the older Trinidadian sample. 


Procedure 

The same experimenter was used in all testing. He 
was introduced as an American from a college in the 
"United States interested in gathering information on the 
children in the various schools of the island. To help 
with this, subjects were asked to answer a number of 
simple questions, e.g., name, age. The details of this pro- 
cedure have been described previously (Mischel, 1958). 

Upon completion of the questions, the experimenter 
expressed his wish to thank the group for their coopera- 
tion. He displayed the two kinds of reinforcement and 
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TABLE 1 


NUMBER or FarHER-PREsENT (Fa+-) AND FATHER-AssENT (Fa—) SUBJECTS CHOOSING IMMEDIATE, SMALLER 


OR DELAYED, LARGER REINFORCEMENT* 


Grenada (Negro) 


Trinidad (Negro) 


School 1 School 2 Totalb School 3 School 4 Total® 
Reinforcement Preference Fat Fa— Fa} Fa— Fat Fa— Fat Fa— Fa+ Fa— Fa+ Fa- 
Immediate, smaller 5 4 4 4 9 8 11 10 8 7 19 17 
Delayed, larger 16 6 26 4 4 10 16 6 10 0 26 6 


Note.—Grenada versus Trinidad X Reinforcement Preference: X? = 10.40, p < .01 (corrected for continuity). 


® Subjects are aged 8 and 9. 


b 42 = 5.15, .05 > P .02; x! = 3,80, 10 > p > .05, if corrected for continuity. 


© x? = 4.93, .05 > p > .02 (corrected for continuity). 


said: “I would like to give each of you a piece of candy, 
but I don’t have enough of these (indicating the larger, 
more preferred reinforcement) with me today. So you 
can either get this one (indicating the smaller, less pre- 
ferred reinforcement) right now, today, or, if you want 
to, you can wait for this one (indicating) which I will 
bring back next Friday (one week delay interval).” 
These instructions were repeated in rephrased form to 
insure clarity and both reinforcements were carefully 
displayed. It was stressed that getting the (smaller) 
candy today precluded getting the (larger) one next 
week and vice versa. Subjects were asked to indicate 
their choice by writing T (today) or F (Friday, next 
week) on their questionnaires, The response made here 
was the measure of preference for ImR or DelR. Sub- 
jects were seated sufficiently far apart from each other 
to insure reasonably that their choices were made inde- 
pendently, All testing was done in group settings, and 
conducted on the same weekday (but in different weeks) 
and at similar times of day, within each of the schools 
in large classrooms. 


RESULTS AND DISCUSSION 


The data from the present study were 
analyzed in terms of differential preference 
for ImR as opposed to DelR in relationship 
to cultural group (Trinidad as compared to 
Grenada), and in relationship to presence or 
absence of the father within the home in 
each culture. In addition, the relationship 
between the experimental choice of preference 
for ImR as opposed to DelR and presence or 
absence of the father in the home was examined. 
in data from the group of older Trinidadian 
subjects, for whom a Grenadian comparison 
group is not available. All statistical analyses 
were made with the chi square test. 

Careful examination of possible sex differ- 
ences with respect to either extent of father- 
absence or preference on the experimental 
choice, within each of the subsamples and in 
combination, revealed no differences approach- 


ing statistical significance. Indeed, no con- 
sistent sex trends were observed in examining 
all reported relationships separately for each 
sex. Consequently, the male and female data 
were combined for all subsequent statistical 
analyses. The similar choice preferences with 
respect to delay of father-absent boys and 
girls in this Caribbean sample is of interest in 
view of Lynn and Sawrey's (1959) Norwegian 
data, in which differential effects of father- 
absence on the sexes were found. 

The data used for relating the experimental 
choice with cultural group are given in Table 
1, which shows the number of Negro subjects 
(ages 8 through 9) within each of the schools, 
in Trinidad as compared to in Grenada, 
preferring ImR or DelR. Comparison of the 
number of subjects preferring ImR or DelR 
within each of the two cultures (i.e., comparing 
School 1 with School 2 in Grenada and School 
3 with School 4 in Trinidad) results in no 
chi square which reaches significant prob- 
ability levels, and consequently the data from 
the two schools within each culture can be 
combined for further comparisons. This 
combination, namely, a comparison of the 
total number of Negro Trinidadian subjects 
preferring ImR or DelR with the total number 
of Negro Grenadian subjects preferring ImR 
or DelR, results in a chi square (using Yates 
correction for continuity) of 10.40, with $ 
<.01. As predicted by the cultural hypothesis, 
a significantly larger proportion of the Trini- 
dadian subjects, as compared to the Grenadian 
subjects, chose ImR rather than DelR. 

The data used for relating father-absence 
and the experimental choice, in Trinidad and 
in Grenada, are also presented in Table 1. 
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The chi square test applied to the relevant 
data yields uncorrected chi square values of 
6.14 for Trinidad and of 5.15 for Grenada. 
These values are significant (p <.02 and 
p «05 respectively) and indicate a greater 
proportion of father-absent subjects, as 
compared to father-present subjects choose 
ImR rather than DelR in each culture. If 
extreme caution is used and the Yates correc- 
tion is employed on all fourfold contingency 
tables (in spite of the fact that all expected 
and marginal frequencies are relatively large) 
the chi square value for Trinidad becomes 
4.93 (p <.05) and for Grenada 3.80. A value 
of 3.84 is required for p <.05 when df = 1. 

In view of this support for the relationship 
between father-absence and preference for 
ImR, it becomes important to examine the 
differences between the two cultures with 
respect to degree of father-absence. Table 1 
shows the number of father-absent and father- 
present subjects within each of the samples 
from the two cultures. Among the Grenadians, 
18 of the 69 subjects reported father-absence; 
among the Trinidadians, 23 of the 68 subjects 
reported father-absence. Application of the 
chi square test to these data results in no p 
values approaching statistical significance, and 
suggests that the difference between the two 
cultures on the experimental choice cannot be 
attributed to differential father-absence, al- 
though within each culture father-absence is 

, related to the choice behavior. 

This finding is of special interest in view of 
the finding obtained earlier (Mischel, 1958) 
in a comparison of Negro and East Indian 
subjects, both within the Trinidadian culture, 
on preference for delayed reinforcement. 
Although the two Trinidadian subgroups 
differed significantly in the choice behavior 
(the Negro subjects more frequently preferring 
the immediate, smaller reinforcement), the 
differences appeared attributable to differences 
in father-absence (the Indian subjects being 
characterized by more father-present house- 
holds). The present data permit an attempt 
to replicate this relationship. The new data 
for Trinidadian East Indian subjects aged 
8 through 9 were as follows. Among the 
Indians, 26 chose the ImR and 54 the DelR. 
Only 4 subjects in the Indian group were 
father-absent, 2 of these choosing ImR and 
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2 DelR. Comparing the Indians with the 
Trinidadian Negro subjects on preference for 
ImR as opposed to DelR results in a corrected 
chi square of 5.50 (p <.02). As was found 
earlier, the East Indian group again shows 
greater preference for DelR as opposed to 
TmR relative to the preferences of the Trinida- 
dian Negro group. Note that there are no 
significant differences in reinforcement prefer- 
ence with respect to delay between the 
Trinidadian Indian group and the Grenadian 
group, although the latter shows significantly 
greater father-absence (chi square = 11.47, 
df = 1, p <.001). Comparison of the Trini- 
dadian Indian group with the Trinidadian 
Negro group on extent of father-absence 
results in a corrected chi square of 18.5 (df = 
1, p <.01); clearly the latter show significantly 
greater father-absence than the former. If 
only father-present subjects from the two 
groups (Trinidadian Indian and Trinidadian 
Negro) are compared on preference for ImR 
or DelR, no significant relationship holds. 
Tn contrast, comparison of only father-present 
Trinidadian Negros with Grenadian father- 
present subjects on reinforcement preference 
still yields a corrected chi square of 5.85 
(p <.02), the Grenadians showing greater 
DelR preference. Thus the intra-island 
observed differences between the two Trini- 
dadian groups appear to be largely related to 
differential father-absence between the two 
groups. However, the Trinidadian-Grenadian 
Negro differences with respect to reinforcement 
preference cannot be attributed to differential 
father-absence. 

Summarizing the results thus far, the 
following relationships, condensed in Table 2, 
should be noted. Grenadian Negro children 
showed significantly greater preference for 
DelR as opposed to ImR when compared with 
Trinidadian Negro subjects. This difference 
cannot be attributed to differential father- 
absence since the two groups show no signifi- 
cant difference with respect to that variable, 
and since the difference in DelR-ImR prefer- 
ence holds even when only father-present 
subjects from the two islands are compared. 
Tn contrast, comparisons between Trinidadian 
Negro subjects and Trinidadian East Indian 
subjects reveal that the difference between 
these two, namely, the relatively greater 
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TABLE 2 
Summary or RELATIONSHIPS BETWEEN ETHNIC 
Grovp,* SUBJECTS WITH FATHER-PRESENT (Fa-+) 
OR FATHER-ABSENT (Fa—), AND PREFERENCE 
FOR DELAYED, LARGER (DelR) As OPPOSED TO 
IMMEDIATE, SMALLER (ImR) REWARDS 


Comparison etes dien 
Trinidadian Negro vs. Grena- | Grenadian | <.01 
dian Negro (Fa+ and Fa—) 
Trinidadian Negro vs. Grena- | Grenadian | <.02 
dian Negro (Fa+- only) 
Trinidadian Negro vs. Grena- — ns 
dian Negro (Fa— only) 
AENEIS Negro Fa+ vs.| Fa4- «.05 
Fa— 
Grenadian Negro Fa+ vs. Fa— | Fa+ «.07 
Trinidadian East Indian vs, | East Indian} <.02 
ie anes Negro (Fa-+ and 
a 
Trinidadian East Indian vs. — ns 
Trinidadian Negro (Fa+ 
only) 
Trinidadian East Indian vs. — ns 
Pan Negro (Fa+ and 
a 
Trinidadian East Indian vs. — ns 
Grenadian Negro (Fa+ only) 


Note.—Comparisons with East Indian subjects on father-ab- 
sence in relation to other variables cannot be made in view of the 
small N (4) of such father-absent subjects. Trinidadian Negro sub- 
jects as well as Grenadian Negroes, show significantly greater 
father-absence (p < .01) than Trinidadian East Indian subjects, 
but the difference between the two Negro groups in extent of 
father-absence is ns. 

“Using Trinidadian Negro, Trinidadian East Indian, and 
Grenadian Negro subjects, all aged 8 and 9. 

b Based on chi square comparisons, corrected for continuity; 
ns = p value > .10. 


. DelR preference on the part of the East 
Indian group, does appear attributable to 
differential father-absence. This seems the 
case since the Trinidadian Negro subjects 
show significantly greater father-absence than 
do the East Indian subjects, and since the 
latter’s relatively greater DelR preference 
does not hold when this differential father- 
absence is taken into account, ie, when 
comparisons are based only on father-present 
subjects. Also note that Negro subjects from 
Trinidad and Grenada do not differ in their 
choice behavior if only father-absent Negro 
subjects from the two islands are compared 
(see data in Table 1). The small N of father- 
absent East Indian subjects prevents their 
being compared with other father-absent 
groups. Finally, the Trinidadian East Indian 
group and the Grenadian Negro group are 
not significantly different in their reinforce- 
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ment preferences, although the latter show 
significantly greater father-absence. 

The present results thus suggest that the 
difference between the Trinidadian and 
Grenadian subjects on the experimental | 
choice must be sought in sources other than 
family constellation with respect to father- 
absence or father-presence. This illustrates a 
situation commonly encountered in cross- | 
cultural research. In spite of overlapping 


rearing conditions, not merely in terms of the i 


father in the family constellation but also in 


terms of numerous patterns that link the two. 
cultures and make them homogeneous in | 


many ways, other unspecified sources of 
variance must account for gross observed | 
differences, in this case differences in the 
delay of gratification patterns. The writer is 
inclined to focus on some of the cultural and 
economic differences, especially differences in 
autonomy and independence, as indicated 
earlier. The lack of a significant difference 


between father-absent Trinidadian and Grena- 


dian Negro subjects in preference for DelR 
as opposed to ImR (Table 2) also suggests the 
possible argument that when fathers are 


absent differences in cultural values are less | 
likely to be internalized by the children, ` 


since fathers may be needed to transmit such 
cultural values. i 
We turn next to the data on the relationship 
between father-absence and reinforcement 
preference obtained from the 112 older (ages 
11 through 14) Trinidadian subjects. Here, 
as previously described, three measures of 
reinforcement preference with respect to 
delay were available. Application of the chi 
square test resulted in no p value less than 30. 
This lack of relationship was obtained in 
comparisons using the candy choice alone, a$ 
well as all possible combinations of preferences 
over the three measures. To inquire further 
into this failure of the anticipated relationship 
ior older subjects, the same experimental 
procedure was repeated on another sample of 
75 Trinidadian subjects of the same age 
Tange, tested in another but highly similar 
Government school, comparably located, and 
with subjects of similar socioeconomic an 
ethnic backgrounds. Again, no relationship 
between the experimental choices and pres 
ence or absence of the father in the home was 
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obtained that reached or approached signifi- 
cant probability levels. Thus although the 
expected relationship was repeatedly obtained 
at the younger age levels, the null hypothesis 
cannot be rejected for seemingly similar sub- 
jects at older age levels. 

The possibility exists that the instructions 
for the experimental choice, specifically the 
phrase “but I don’t have enough of these 
(DelR) with me today,” may have partially 
confounded the results by producing a conflict 
in older subjects, eager for the DelR yet 
reluctant to make the experimenter return. 
However, this seems extremely unlikely. First, 
during the administration, that phrase, used 
only as an introduction to make the choice 
plausible, was probably overshadowed by the 
frequent repetitions of the actual choice 
conflict and the implication that the experi- 
menter would definitely return next week to 
hand out the DelR to those in the group who 
chose it. Second, and more definitive, in over 
20 pre-experimental sessions with other 
comparable subjects from the same culture, 
used to select the particular reinforcements 
sets, postchoice inquiries into reasons for the 
choices yielded nothing remotely suggestive 
of such an “obliging response." 

It is tempting to argue that for older 
subjects the measures of preference for ImR 
as opposed to DelR is meaningless or invalid, 
and consequently relations with father- 
absence should not be expected. It must be 
recalled, however, that the same measures 
used with Caribbean subjects of this age, and 
indeed including subjects from the present 
older sample, have yielded significant correla- 
tions with conceptually relevant variables 
(social responsibility, need for achievement, 
nonacquiescence), all of the above being 
positively correlated with preference for 
DelR (Mischel, 1961a, 1961b). In this context, 
it should also be noted that no relationships 
approaching statistical significance were ob- 
tained in attempts to relate father-absence 
with these correlates of preference for DelR. 
This suggests that father-absence at older 
age levels may not have some of the effects 
(e.g., “immaturity,” poorer adjustment) ap- 
parently associated with it at the earlier age 
levels (e.g, Lynn & Sawrey, 1959). The 
question cannot be answered at present in 
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view of the lack of research on father-absence 
with older children. Of course, the present 
data are seriously hampered in the gross 
measure of father-absence or presence used, 
which gives no information on the extent or 
duration of the father’s absence or presence. 
Nevertheless, the same gross measure did 
yield the expected results with the younger 
subjects, and excellent agreement between 
older subjects’ reports of father-absence and 
mother’s reports on the same question was 
obtained. 

The post hoc interpretation being suggested 
for these findings is that as the individual 
develops and matures, he begins to participate 
in an environment that extends beyond the 
immediate family and consequently his 
expectations with respect to promise-keeping, 
his “trust,” and his choice behavior with 
respect to delay of gratification become 
contingent upon numerous factors and experi- 
ences other than those within the household 
itself. Similarly, at least in southern Caribbean 
cultures, sources of influence outside the 
immediate family may become important 
determinants of other variables correlated with 
ImR-DelR preferences, such as “social re- 
sponsibility” and need for achievement, at 
relatively early periods in the person's life.® 

Although this interpretation is clearly 
tentative, pending further and more controlled 
research, the data illustrate the complexity of 
some of the conditions that may influence 
such choice behavior. In spite of these com- 
plexities, significant support was obtained 
for the anthropological hypothesis concerning 
cross-cultural differences between Trinidad 
and Grenada with respect to delay of gratifica- 
tion. Earlier findings of differences with respect 
to ethnic groups on the same variable were 
replicated. It is recognized, of course, that the 
findings are necessarily limited by sampling 
problems which make any generalizations 


6 Tt should be noted that in the cultures studied, the 
individual becomes independent of the family and 
autonomous, e.g., vocationally, relatively early in life. 
For example, official “student-teachers” in public 
schools are often 13 years old. In all three samples, the 
child from age 10 or 11 on, becomes increasingly free to 
move about, visiting with other relatives (eg. uncles, 
grandparents, etc) who may or may not be living in 
the immediate neighborhood, and whom he may visit 
for a day or longer periods. 
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tentative. Nevertheless, the findings are 
construed as illustrative of the possibilities of 
applying quantitative simple experimental 
techniques for testing cross-cultural differences 
on important psychological dimensions. 


SUMMARY 


This study tested an anthropological 
hypothesis concerning differences in preference 
for immediate, smaller (ImR) as opposed to 
delayed, larger reinforcement (DelR) between 
two subcultures (Trinidad and Grenada) 
within the West Indies. The data, in the form 
of choice preferences for ImR or DelR elicited 
from children in both islands, supported the 
cultural hypothesis. Further, a significant 
relationship between absence of the father 
within the home and greater preference for 
ImR as compared to DelR was found in both 
cultures, using subjects aged 8-9. However, 
this relationship was not found for older 
subjects (aged 11-14). These results were 
discussed in terms of increased sources of 
variance relevant to the formation of trust 
behavior and expectancies outside the house- 
hold with increasing age. Data replicating 
previously obtained differences with respect 
to preference for ImR as opposed to DelR 
between two ethnic groups (Indian and Negro) 
within the island of Trinidad were also 
presented. 
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SOME PERSONALITY CORRELATES OF CONCEPT CHANGE 
THROUGH ROLE PLAYING: 


0. J. HARVEY 
University of Colorado 


Hrs study was concerned with the 

interactional effects of three variables 

on opinion change: role playing, authori- 
tarianism, and status of the source of a role 
relevant communication. Persons differing in 
authoritarianism were requested to present 
the best arguments they could muster in 
support of a position that conflicted with their 
strong personal beliefs. 

That playing a role—acting “as if"—can 
alter behavior and attitudes has been recog- 
nized by many, not least among whom have 
been William James in his theory of emotion, 
Cooley in his concept of the looking-glass 
self, and G. H. Mead in his discussion of the 
value of taking the role of the "significant 
other. Experimental evidence has also 
begun to identify some of the conditions 
under which role playing modifies the player's 
opinion in the direction espoused in the role 
(eg., Janis & King, 1954; King & Janis, 
1956; Scott, 1957). 

Various theoretical positions are doubtless 
consonant with the effects of role playing: 
especially ones that envisage a direct and 
more or less causal connection (even synonym- 
ity) between such internal structures as 
attitudes, on the one hand, and behavior, on 
the other (e.g, Doob, 1947). The present 
study, however, arises from theoretical 
considerations of a cognitive sort that have 
found validation in research on the effects of 
labeling (e.g., Bartlett, 1932; Carmichael, 
‘Hogan, & Walter, 1932; Gibson, 1929; Hallo- 
well, 1951). From the assignment of a label to 
an object or class of events stems a tendency 
to attribute characteristics to the object that 
are consistent with the label even though 


l'This study was carried out under support of the 
Group Psychology Branch, Office of Naval Research, 
Contract Nonr 2149(02), with Vanderbilt University, 
while both authors were at Vanderbilt. Reproduction of 
this study in whole or part is permitted for any purposes 
of the United States Government. 

? Thanks are extended to ONR for its support, to the 
students and instructor who participated in the study, 
and to Mike Wertheimer for his editorial suggestions. 
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such attributions may be quite inappropriate 
to the “real nature” of the thing to which the 
name is affixed. Labels and concepts thus 
seem to provide a system of ordering, a mold 
into which impinging events tend to be fitted, 
a series of channels that set the limits and 
direct the course of cognitive outcomes. 
Perceiving, thinking, judging, and related 
activities are profoundly affected by—perhaps 
even wholly dependent upon—a pre-existing 
system of ordering or conceptual placement 
(Smith, Bruner, & White, 1956). And yet this 
very dependence on a system of categories 
leads to a kind of conceptual closedness, 
reflected in a functional blindness to alterna- 
tive evaluations that are not embodied in the 
conceptual framework employed at the 
moment. This observation, far from being new, 
is closely akin to the conclusion of Levy-Bruhl 
(1923) that the differences in the cognitive 
outcomes of "primitive" and “civilized” 
man are not attributable to the former’s 
thinking illogically and the latter’s thinking 
logically, as some of his contemporaries argued, 
but are due instead to starting from different 
premises. 

We assumed, as we have elsewhere (Harvey 
& Caldwell, 1959; Harvey & Rutherford, 1958), 
that once a concept has evolved as a facet of 
self it is rather resistant to change; and 
further, that one tends to evaluate a situation 
ina way maximally congruent with the con- 
cept(s) or aspects of the self that are operative 
at the time. Hence concepts may be viewed 
as varying on a dimension of openness- 
closedness, with the more closed state resulting 
from concepts or opinions of high ego involve- 
ment or centrality. Such closedness may be 
perpetuated through a variety of mechanisms: 
contrast, failure to perceive an incongruity, 
discrediting the source of the incongruity, as 
well as by other modes of distortion (Harvey, 
1959; Harvey, Kelley, & Shapiro, 1957). In 
the present study playing a role under condi- 
tions of minimal threat was viewed as one 
means by which a person might be induced 
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to accept alternatives generally excluded 
from his fairly closed system. 

The effect of role playing on inducing 
acceptance of alien alternatives probably 
depends on many personality variables, 
including those which predispose the person 
toward intolerance of ambiguity, a bifurcated 
either-or view of the world, and a need for 
highly specific and concrete categories for 
ordering situations. Authoritarianism appears 
to be such a cognitive variable with which 
role playing might be expected to interact. 
The anticipated nature of the interaction will 
become more apparent when role playing and 
authoritarianism are considered in relation to 
the third independent variable of this study, 
the status of the source of the pro-alcohol 
communication to which subjects were exposed 
prior to the presentation of their own pro- 
alcohol arguments. 

Authoritarianism appears to dispose toward 
very different behaviors in relation to persons 
high or low in status: toward deference to and 
high influence by those high in status and 
toward opposite reactions in relation to those 
low in status (Adorno, Frenkel-Brunswik, 
Levinson, & Sanford, 1950; Crutchfield, 1955; 
Harvey & Caldwell, 1959; Roberts & Jessor, 
1958; Thibaut & Riecken, 1955; Wagman, 
1955). Characterized by a poorly differen- 
tiated and integrated conceptual system or 
self-structure, which minimizes the self as a 
causal agent in transaction with the environ- 
ment, the more authoritarian person is more 
dependent on external criteria in defining his 
reality and evaluating his surroundings. That 
is, the referents to which the more authori- 
tarian person relates in the formation of his 
self are more in terms of such external scurces 
of authority as power, convention, and large- 
scale organizations, to all of which he seems 
to attribute connotations of omnipotence and 
omniscience.? 

The experimental treatments of the three 
independent variables provided a 2 X 2 x 2 
factorial design: role-no role, high-low source 
status, and high-low authoritarianism. The 
following predictions arise from the concep- 
tualization of these variables: 


3 The rationale of this assertion, both theoretical and 
empirical, is elaborated in Harvey, Hunt, and Schroder 
(1961). 
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1. Role playing should result in a significant 
positive change in the target opinions. l 
2. Role playing should interact with authori- 
tarianism, with greater positive opinion Change 
in the high than in the low authoritarians]| 
under the Role condition. q 
3. Role playing should interact with status 
of the precommunication source, with greater 
positive change in the High Status condition, | 
4. Status should interact with authoritari | 
anism, so that under the no role treatment 
greater positive change should occur in the. 
high than in the low authoritarians when the | 
status of the source is high, but conversely 
greater change should occur in the low than P 
in the high authoritarians when the status of | 
the source is low. 


METHOD 
Subjects 


The subjects represented four classes of students (a | 
total of 137) who were attending the 1958 summer ses- 
sion of a small Southern college that operates under | 
the auspices of the Church of Christ. This population | 
was selected because of its strong religious opposition to 
the sale and drinking of alcohol, the attitude issue of | 
the study. 4 


Materials 


Subjects’ attitudes toward alcohol before and after. k 
the experimental treatments were assessed by a 27-item 
graphic rating scale, constructed especially for this ] 
study, which requested the subject to indicate his post- 
tion on each of the items by checking at the appropriate 
point on the 16-centimeter line between the extreme 
points of complete agreement and complete disagree- 
ment. Representative items included: moderate drink- 
ing is not against the teachings of the Bible, it would be 
sinful to use in any way revenue from the sale of whisky 
for better schools for our children, and it is impossible 
to prevent some sort of manufacture and sale of whiskey. 
The response to each item was scored in centimeters, 
zero representing the response most favorable to alcohol. 
The measure of the subject’s attitude on the issue was 
the sum of the value in centimeter units of his ratings 
on all 27 items, with the higher total scores representing 
greater opposition to alcohol. 

Authoritarianism was determined by the scale of 
Webster, Sanford, and Freedman (1955), which accom 
panied the first administration of the scale of opinions | 
toward alcohol. Following exposure to a 2,500-word 
speech that favored the sale of alcohol under rigidly 
prescribed conditions, a revision of a communication 4 
employed earlier by Hovland, Harvey, and Sherf | 
(1957), subjects wrote on a standard form what they 
considered to have been the main arguments advanced 
in the fictitious address. This provided a measure of the 
accuracy in reproduction of the contents of the com- - 
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munication, a measure later related to the three inde- 
pendent variables in this study. On the same form 
subjects were requested to present the best reasons they 
themselves could muster in favor of the sale and use of 
alcohol, and to make believe that these pro-alcohol 
arguments were being marshalled for use in a collegiate 
debate. This latter exercise, which placed subjects in the 
position of contradicting their own beliefs under circum- 
stances aimed at production of minimal threat, consti- 
tuted the role playing situation. 

Tn the strictest sense, perhaps the treatment in this 
study should not be termed role playing since, unlike 
the usual role playing procedure in which the subject is 
asked to present arguments to some public audience, 
the present condition had the subjects present their 
self-negating arguments in private. The substitution of 
the private for the more typically public environment 
in which acting of a role is executed resulted from our 
belief that it is the entertainment of an alien premise 
that is the essential factor in the effectiveness of role 
playing in changing opinions, irrespective of whether 
the premise is entertained in private or public. Valida- 
tion of this assumption requires a direct comparison of 
role playing effects under private and public conditions 
that was not undertaken in this study. 


Procedure 


The experiment was carried out in two sessions, a 
week apart. In the first session subjects completed the 
scales of authoritarianism and opinions toward alcohol, 
each of which was administered by the class instructor. 
In the second session subjects again indicated their 
opinions toward alcohol, approximately half of them 
immediately following exposure to the communication 
espousing conditional sale of whiskey (No role condi- 
tion) and the other half after having written their own 
arguments in favor of the sale of alcohol (Role condi- 
tion). In each of the role and no role treatments the 
communication was presented to the subjects as having 
emanated from two different sources, one anonymous 
and the other of a clearly high status. We assumed that 
the anonymous source would have low status for the 
subjects, especially the more authoritarian ones to whom 
source differences were expected to be more important, 
In the High Status condition the communication was 
depicted by the instructor as a speech that had been 
delivered by a Church of Christ preacher to a group in 
Oklahoma City which was concerned with the then 
perennial question of prohibition in Oklahoma. In the 
anonymous or Low Status condition the communica- 
tion was pictured by the instructor as a speech that 
had been made in the same city by an unidentified 
speaker on the same issue and to a similar audience. 
The High-Low status source and Role-No Role varia- 
tions thus provided four experimental conditions, with 
the 137 subjects distributed fairly equally among the 
four cells. (In analysis of the results these four cells 
were subdivided further into high and low authoritari- 
anism to yield an eight-cell factorial design.) 

The class instructor, the same for all subjects, carried 
out the classroom operations of the experiment. He jus- 
tified the request for attitude information by saying 
that he was conducting an opinion survey of several 
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TABLE 1 
MEAN CHANGE OF OPINIONS TOWARD ALCOHOL 


No Role 


Source Authoritarianism Authoritarianism 
Status 

High Low High Low 
High 42.86 32.45 21.41 15.40 
Low 40.75 29.29 9.75 26.67 


colleges on a number of issues. He explained the presen- 
tation of the communication to subjects as a means of 
providing them with information on what certain other 
people who also were directly concerned with the issue 
of alcohol thought about the problem. Dittoed copies 
of the speech, presented in authentic-appearing format, 
were passed out by the instructor to three or four sub- 
jects who were invited to share them with other students 
in order to allay possible doubt of their authorship. 


RESULTS 
Opinion Change 

The measure of opinion change on the 
issue of alcohol was the total difference in 
centimeter units between the subject’s pre- 
and post-experimental responses on the 27- 
item scale. The means of these changes for 
the eight experimental conditions are presented 
in Table 1. Comparison of the appropriate 
means by the ¢ test for independent samples 
yielded the following results relative to the 
hypotheses: 

1. Role playing, considered across all 
conditions, had a significantly greater positive 
effect on opinion change than did no role 
playing (X difference = 20.55; ¢ = 2.675; f, 
135 df, <.01). 

2. Role playing interacted with authori- 
tarianism, tending to effect greater positive 
change in the High than in the Low Authori- 
tarianism condition (X difference = 10.829; 
t = 1.682; p, 76 df, <.10). 

3. Role playing failed to interact signifi- 
cantly with status of the precommunication 
source (X difference between high and low 
status = 2.63;1 = .281; p, 76 df, >.50). 

4. Status interacted significantly with 
authoritarianism, so that under the no role 
treatment greater positive change occurred in 
the High than in the Low Authoritarian 
condition when the status of the source was 
low (X difference between these two differences 
= 22.93; t = 3.471; p, 57 df, <.01). 

The greatest change occurred in the role 
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playing, high status, and High Authoritarian 
condition (X = 42.86); the least in the No 
Role, Low Status, and High Authoritarian 
condition (K = 9.75), both of which results 
follow from our hypotheses. Neither of these 
changes is significantly different, however, 
from the means of the other seven conditions 
combined, probably as a result of the small 
effects of interaction between role and source, 


Reproduction of the Arguments of the Com- 
munication 

From their summary of the main points of 
the communication, subjects were scored from 
0 to 4, depending on how many of the fr 
main arguments of the speech they reproduced 
correctly. Separate comparisons were made of 
these scores by chi square analyses for the 
three comparisons of Role-No Role, High- 
Low status, and High-Low Authoritarianism 
conditions. The status of the source and 
assuming a role had almost no effect on 
subjects' reproduction of the points of the 
communication (x? = .20 and .45, respec- 
tively). Authoritarianism did, however, affect 
significantly subjects’ dimensionalization of 
the speech, with the more authoritarian 
subjects scoring significantly Wower than the 
less authoritarian subjects in the number of 
the speaker’s arguments they reproduced 
correctly (x? = 6.83, p < 01). 


Perception of the Speaker's Intent 


On the same form and in the same order in 
which subjects summarized the main points of 
the communication and presented arguments 
counter to their own opinions, they also 
wrote out what they considered to have been 
the speaker's motives or personal reasons for 
endorsing the sale of whiskey. A content 
analysis showed no relationship between the 
quality (positive or negative) of the motives 
attributed to the speaker and authoritarianism, 
status of source, or playing of the role. 


Discussion 


The outcome of all of the comparisons was 
in the direction predicted, although in one 
instance the magnitude of the difference was 
significant at only the .10 level and in another 
instance was even less significant. The most 
plausible reason for the lack of sharp con- 
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firmation, we believe, is our only partial success 
in manipulating the source variable. Post- 
experimental discussion with the subjects 
indicated that some of them received the 
impression that their instructor endorsed, at 
least implicitly, the contents of the pro- 
alcohol communication which he presented to 
them as having come from either a Church of 
Christ preacher or from an anonymous 
speaker. The instructor, as we post-experi- 
mentally discovered, was also an ordaintd 
preacher in the Church of Christ. His failure 
to have disavowed actively the contents of’ 
the fictitious speeches apparently led some 
of the subjects to believe that he favored the 
sale of alcohol; at least, that he was mot 
strongly opposed to it. For subjects so believ- 
ing, the anticipated differences between 
attributed sources would be minimized, and | 
the instructor rather than the intended 
persons would serve as the more relevant 
source. Not all source effects were cancelled, 
as the results show. But a better controlled 
manipulation of this variable should be ex- 
pected to have increased its interactional | 
effects predicted, both with role playing and 
with authoritarianism. It also should have led 
to a greater discrediting of the low status 
speaker by the more authoritarian individuals. 

The fact that role playing under the private i 
conditions imposed in this study produced 
significant changes in opinions implies that 
the public performance of a role is not neces- ” 
sary for concept or opinion modification. It q 
Suggests that the effect may be more a function | 
of the extent to which a contradictory premise | 
is maintained than of the environment in | 
Which it is entertained. This is not to imply, 
of course, that environmental factors (public 
vs. private, for example) are irrelevant to the 
extent to which the premise is entertained. 
Indeed it is probable that some persons would 
be led to a more serious and effective considera- 
tion of alien premises under one set of environ- 
mental circumstances while others would be $0 
induced under different conditions. Thus 
persons who are more dependent upon external 
criteria as guidelines to their judgments and 
acts, as the more authoritarian persons tend j 
to be, may well be more influenced by the: 
public than by the private performance of à 
role. 
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f 
SUMMARY AND CONCLUSIONS 


This study was concerned primarily with the 
interactional effects of three variables on 
opinion change: role playing, authoritarianism, 
and status of the source of a role relevant 

- communication. Subjects divided according to 
high and low authoritarianism were requested 
to write in private the best arguments they 
could muster in favor of alcohol, the side of 
the issue that was strongly counter to their 
own beliefs. Immediately prior to playing 
of this role they were exposed to a communi- 
| cation that favored the sale of alcohol under 


narrowly prescribed conditions. The com- 
munication was presented to the subjects as 
. a speech that had been made by either a high 
status or low status (anonymous) individual. 
— Following exposure to the communication, 
E approximately half of the subjects were 
"retested immediately on their attitudes 
toward alcohol (No Role condition) while 
the other half were retested after they had 
written the pro-alcohol arguments (Role 
condition). Subjects were also tested on their 
ability to reproduce the main arguments of 
the communication and on the extent to which 
they attributed negative motives to the 
source of the incongruous communications. 
The findings were: 
1. Role playing, represented by presenta- 
tion of self-negating arguments, had a signifi- 


cant positive effect on opinion change. 
2, Role playing interacted with authori- 
tarianism, with persons higher in authoritari- 


anism being influenced more positively by 
playing the role than were persons lower in 
authoritarianism. 

3. Role playing failed to interact signifi- 
cantly with status of the precommunication 
source. 

4. Status interacted significantly with 
authoritarianism, with the higher authori- 
tarians being more influenced than the lower 
authoritarians by the high status source in the 
No Role condition, and with the reverse 
effects in the No Role, Low Status source 
condition. 

5. Authoritarianism had a significant nega- 
tive effect on reproduction of the main points of 
the negative communication, with higher 

/ authoritarians reproducing accurately signifi- 
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cantly fewer of the points than did the low 
authoritarians. Despite the failure of the 
more authoritarians to get the points of the 
speaker’s arguments, they nevertheless 
changed their opinions more than did the 
less authoritarians in the No Role condition 
when the source of the communication was 
high, although not significantly more so. 
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LOSURE, or the tendency to complete 
what is incomplete and to organize 
what is perceived into “good figures,” 

is central to the gestalt theory of perception. 
In Koffka’s (1935) words, “Tf the unit is ‘open’ 
or ‘incomplete,’ then that part of the field 
which corresponds to the gap will be a seat 
of very particular forces, forces which will 
make the arousal of processes of closure easier 
than the arousal of any others. The closure 
will, of course, be that closure which is 
demanded by the rest of the figure, a closure 
of good continuation." 

Perceptual closure is said to be an autoch- 
thonous process which operates to stabilize 
the perceived external world. The term has 
since been so expanded that Drever’s (1953) 
definition of closure refers to an all-embracing 


. function by means of which “percepts, 
- memories, and actions attain stability." 


The phenomenon of closure has been demon- 


' strated experimentally with normal subjects 


in a number of studies of form perception. 
Gibson (1929) had subjects reproduce two 
series of simple geometrical stimulus forms, 
a number of which contained two to four 
contour gaps, and found that absolute or 
partial completion occurred in 92% of the 
reproductions of the incomplete figures, 
absolute completion occurring in 15% of the 
cases. Hempstead (1901), Tiernan (1938), and 
Siedenfield (1938) also found evidence for 
Closure, the latter two experimenters reporting 
an increase in closure when subjects reproduced 
the stimulus several weeks later. 


*Snyder is a medical student at Georgetown Uni- 
versity who conducted this study at Taylor’s suggestion 
under Rosenthal’s supervision, The study was made 
Possible by a United States Public Health Service 
Fellowship Grant /2M-5908-C7 to Snyder. 

* The authors wish to thank Lois Raphling, who did 
the statistical computations, and F. M. Waldrop and 
Margaret Mercer who made possible the testing of 
Patients at St. Elizabeth’s Hospital. 
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Thurstone (1944), in a factor analytic study 
of perception, proposed the existence of a 
factor which involved the ability to form a 
perceptual closure and hold it against distrac- 
tions; it was associated with the ability to 
manipulate two configurations, a process 
characteristic of reasoning. While probably 
related to the factor operative in the studies 
of simple form perception, it appears that 
Thurstone’s closure factor is more complex 
than that examined by earlier experimenters, 

Taylor (1960) studied closure in individuals 
with extreme social attitudes. All subjects 
tended toward partial completion of incom- 
plete figures; extreme scorers on a social 
attitude scale tended to close gaps in figures 
to a greater extent than those who were not 
extreme in their views, supporting the hy- 
pothesis that extreme liberals and extreme 
conservatives have similar personality char- 
acteristics. 

While there is ample evidence for the occur- 
rence of closure in normal subjects, closure in 
subjects with personality disorders ` has 
apparently been investigated in only two 
studies. 

Street (1931, 1934) administered a gestalt 
completion test to 218 cases of “functional men- 
tal disorder” and 45 general paretics. In the . 
completion test, subjects were shown a series 
of pictures, altered so that parts of the figure 
form the ground. In order to recognize the 
pictures, subjects must complete perceptually 
the structure of each figure. The number of 
correctly completed figures is the individual's 
test score. When the scores of control subjects 
were compared with the scores of the groups 
with mental disorders, significant differences 
were found. The control subjects had a mean 
test score of 7.38, the functional cases, 6.33, 
and the organic cases, 5.27. 

Roseman (1952) tested the hypothesis that 
closure behavior, as measured by a jigsaw 
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pattern test, was a function of personality 
integration. The test of closure consisted of 39 
jigsaw patterns, each made up of two pieces 
which could be fitted together to form a 
closure or nonclosure item. There were 29 
closure items which would form a compact 
geometrical shape when put together, and 10 
nonclosure items. The latter were interspersed 
among the closure items to break any set for 
closure that might develop. Items were pre- 
sented in decreasing order of difficulty, each 
for a maximum of 10 seconds. The subjects 
were 42 male “psychiatric patients” and 23 
male normal controls, instructed to work as 
rapidly as possible to complete the pattern in 
the time provided. Normal subjects closed an 
average of 21.83 figures as opposed to an 
average of 17.86 for the patient group. The 
difference was significant at the 1% level. 
The mean closure time per item and the mean 
time for completing all test items were signifi- 
cantly faster for the normal subjects. The main 
hypothesis was supported using both clini- 
cal status and MMPI scores as measures of 
personality integration. 

Two aspects of the procedure which may 
have accounted for the differences between the 
two groups were the imposition of a time limit 
(since psychiatric patients often perform more 
slowly on various tasks than normal sub- 
jects) and starting with the most difficult 
items (since patients tend to be more readily 
discouraged and alienated by failure). 

The present study attempts to determine 
whether perceptual closure occurs to a less 
extent among schizophrenic than normal 
subjects when time pressure, difficulty level, 
and complexity of figure-ground relationships 
are not confounded in the experimental task, 
and to indicate alternative formulations which 
could account for such a finding. 


PROCEDURE 


Experimental materials. Each subject was given 
Taylor's (1960) modification of his own perceptual 
closure scales. These consist of two test figures and one 
"control" figure. The test figures are line drawings of a 
Boy and a House. In each figure the lines are broken at 
eight points to create eight gaps of 20 millimeters each, 
The Boy is presented first. Below it, parts of the figure 
which do not include the gaps are already reproduced. 
The subject’s task is to copy the figure by reproducing 
the parts containing the eight gaps. Instructions are 
printed on the sheet as follows: “Copy the rest of the 
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| 
boy below just as you see him above.” The arrangement 
and instructions are exactly the same for the House, 
which is presented next. | 

The “control” figure consists of four ticktacktoe's 

n 
n 
of the upper and lower ticktacktoe's, and the left and 
right ticktacktoe's are also 20 millimeters wide. Below 
are reproduced the 16 points at which the lines of the 
ticktacktoe's intersect, and the subject is required 
to reproduce the entire figure as it appears above, us- 
ing the points as guides. 

Subjects. Thirty-six male schizophrenics completed 
the task and 10 did not. Six of the 36 were patients at 
the Clinical Center of the National Institutes of Health, 
30 at St. Elizabeth’s Hospital in Washington, D. C. Sub- 
diagnoses were as follows: 3 hebephrenic, 10 paranoid, - 
13 catatonic, 1 simple, 7 chronic undifferentiated, and 
2 unspecified. Thirty were white and six Negro. Mean l 
age was 35.2 years, range 19-53. Four had less than 
an eighth grade education, 16 were at the high school 
level, and 11 had some college education. Education 


arranged as follows: The gaps between the lines 


level of four subjects was not noted. Nine patients | ' 


were on tranquillizing drugs, 8 of the 9 on a regime of - 
500 mg/day of chlorpromazine. 

Control subjects were obtained from two sources. | 
One group consisted of 14 volunteers who served as | 
normal controls for many studies at NIH. Three others | 
were employees at NIH. The second group consisted 
of 20 medical and surgical patients at a general hospital | 
in Washington, D. C. All subjects were male. Thirty-one 
were white and six Negro. Mean age was 24.1 years, | 
range 18-46, for the NIH group and 45.6 years, range 
21-59, for the GM &S group. Four had less than an 
eighth grade education, 13 were at the high school level, 
and 20 had some college education. 

Procedure. Most of the control subjects at NIH were 
tested in a group. The remainderand all GM &S control ' 
subjects were tested individually. Presharpened, soft 
lead pencils were always used. All schizophrenic subjects 
were tested individually. Instructions were repeated 
and clarified if necessary. 

The gaps in each subjects reproductions were 
measured in millimeters and subtracted from 20 milli- 
meters, the standard gap size, to obtain a “closure 
score.” The higher the score, the greater the degree 
of closure. Negative scores occurred when the gap was 
greater than 20 millimeters, | 
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TABLE 1 It 
CORRELATIONS OF MEAN CLOSURE Scores ON TEST | 
AND CONTROL FIGURES: PATIENTS AND CONTROL | 


SUBJECTS 
x (Boy + 
Group N | Boy:House BNET 
All schizophrenic subjects | 36 | +.61* | —.27 
Control subjects: NIH 17 | +.66* —.05 
Control subjects: Medical | 20 | -F.61* | —.02 4 


and surgical cases 


* Significant at .01 level. 
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RESULTS 


Table 1 shows some relationships among the 
mean closure scores on the test and control 
figures for both the patient and control groups. 
Closure scores were consistent for subjects 
when the test figure was either Boy or House. 
The degree of consistency was much the same 
whether the subjects were schizophrenic, 
physically ill, or healthy nonschizophrenic 
individuals. Thus, the mean closure scores on 
the test figures were quite stable. However, 
these scores did not correlate with the mean 
closure scores of the control figure, the tick- 
tacktoe, indicating that the mean closure 
scores were a function of the configurational 
conditions built into the test figures but not 
built into the control figure. 

The main results of the study are shown 
in Figure 1. Each dot in Figure 1 represents 
the mean closure score for the 16 gaps in the 
Boy and House drawings. Both control groups 
were combined because there were no signifi- 
cant differences between them with respect 
to closure scores on the test or control figures. 
Neither age nor education level was signifi- 
cantly related to performance on these tasks. 
Among all 37 control subjects there was only 
one who did not show some degree of closure. 
The mean closure score for the group as a 
whole was 3.50 millimeters, SD being 2.25. 


SCHIZOPHRENIC Ss % °% 
(on drugs) 
SCHIZOPHRENIC Ss EL. Ad 


(not on drugs) 


CONTROL Ss 


-5 OPENING ——— O—- CLOSING +5 
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The schizophrenic subjects who were on 
drugs have been separated from the others. 
Interpretation of differences between the two 
groups of patients is contingent on the fact 
that the group on drugs was also judged by 
the hospital staff to be “sicker,” that is, more 
disorganized. Thus, the drug and severity of 
illness conditions were confounded. The 
authors, naturally, had no control over such 
matters. Differences between the groups are 
interesting nonetheless. 

Of the 27 schizophrenic subjects who were 
not on drugs, 11 showed no closure. The mean 
closure score for the group as a whole was 1.47, 
SD being 3.32. It can be seen that four sub- 
jects contributed disproportionately to the 
group variability, showing an exaggerated 
degree of closure. Of the four, two were pre- 
dominantly paranoid and two catatonic. We 
are not able to account for the markedly 
different performance of these four subjects. 
The difference in mean closure scores between 
the nondrug schizophrenics and the control 
subjects was significant at the .01 level. 

Among the nine patients on drugs, there 
were virtually none who showed any appreci- 
able degree of closure, six of the nine showing 
a definite tendency to open the gaps rather 
than to close them. The mean closure score 
for this group was —1.48, and the SD was 


E t 


basoe did codes o 


+10 


MILLIMETERS 


Fic. 1. Closure scores of schizophrenic and control subjects. 
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TABLE 2 
INTRA-INDIVIDUAL VARIABILITY OF CLOSURE ScORES ON TEsT AND CONTROL FIGURES 
Closure Scores 
Boy and House Ticktacktoe 
N 
3 Largest |3 Smallest | Range 3 Largest | 3 Smallest | Range 
x e|x|e|Xl|e x e|x|e|&i|sc 
Patients: Nondru 27 |—4,12/4.91/6.34)3 .31/13.48)6.53/—1.34/4.80/3.48/5.35/6.59/4.35 
Pidente Drag i 9 |—7.24:2.934.83/1.20115.56/3.48| —2.78/3.952. 594.44 8.22|3.30 
Control Subjects 37 |—0.88/1.77/7.92/3.02/11.51/3.62] —1.94/3.75/2.35/3.70/6.30/3.44 
Tests of significance 
t Po rz prr TAN p t Puts. | 4 |» 
Patients: Nondrug vs. Controls 3.31| .0111.95| .06| 1.42| ns | 0.54] ns |0.95| ns |0.29| ns 
Patients: Nondrug vs. Patients: Drug 2.29| .05/2.01| .05| 1.22| ns | 0.89| ns |0.49| ns 1.18| ns 
Patients: Drug vs. Controls 6.30} .01/4.83) .01| 3.11| .01| 0.58] ns |0.15| ns |1.56| ns 


1.49. Differences between closure scores for 
this group and the other two groups were 
significant at well beyond the .01 level. 

Mean closure scores for the eight gaps of the 
ticktacktoe were 0.19, 1.04, and —0.67 for 
the control subjects, nondrug schizophrenics, 
and schizophrenics on drugs, respectively. 
None of the differences between groups was 
statistically significant. 

In Table 2, the performance of schizophrenic 
and control subjects is examined in greater 
detail. It was possible that normal subjects 
may have had relatively little variability 
among the gaps of any drawing, whereas the 
schizophrenic subjects may have both opened 
and closed gaps in any given drawing in a 
more extreme fashion, although opening more 
than closing, relative to control subjects. 
Thus, their mean closure scores may have been 
an inadequate representation of what they 
actually did. To examine this possibility, we 
computed the mean of the three largest gaps 
and the three smallest gaps in the Boy and 
House drawings combined and in the tick- 
tacktoe reproductions by each subject. As a 
measure of the overall variability in each 
reproduction, we took the difference between 
the largest and the smallest gap, viz., the 
range. 

From Table 2, it can be seen that the mean 
of the three largest gaps by control subjects 
was less than 1 millimeter larger than the 


standard; they seldom opened any particular 
gap, and then only very slightly. In contrast, 
the patients with and without drugs had 
extreme gap-opening scores of over 7 and 4 
millimeters, respectively. However, it was not 
the case that the schizophrenic subjects also 

: had more extreme closures, since the scores 
of control subjects were appreciably higher 
than both patient groups in this respect. Thus, 
the results shown in Figure 1 do not merely 
capitalize on differences between the groups 
in variability of performance but represent 
basic differences. With respect to the range 
of closure scores, control subjects did not differ 
significantly from schizophrenic subjects not 
on drugs, but had reliably narrower Tanges 
than the patients on drugs. Clearly, however, 
the effect of the drugs or the more severe 
mental condition of the latter group was not 
to increase variability randomly, but to exag- 
gerate further the tendency to open the gaps 
in their reproductions, increasing their range 
of scores in a specific direction. 

On the control figure, the ticktacktoe, 
there were no significant differences between 
the groups with respect to any of the three 
aspects of variability measured. 


Discussion 


From the point of view of gestalt theory, 
the gap in an incomplete figure sets up forces 
in the perceptual field which arouse processes 


ed 
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leading to closure. These forces occur, or have 
their counterpart, in the brain. In a brain 
where the potential for mobilizing such forces 
does not exist or is reduced, the degree of 
perceptual closure should be proportionately 
lowered. Thus, the finding of a *closure deficit" 
in schizophrenic subjects would be compatible 
with theories of schizophrenia which hold a 
deficit in the function of energizing or arousing 
systems in the brain to be responsible for the 
illness (Altschule, Promisel, Parkhurst, & 
Gruenbaum, 1950; Bychowski, 1943; Meadow 
& Funkenstein, 1952; Nielsen, 1948). Such 
systems might include the so-called reticular 
activating or limbic systems. The defect could 
presumably lie in some humoral substance 
which normally subserves the functioning of 
such systems, or is itself directly energizing 
in its normal state (Kety, 1959a, 1959b). 

It is possible, however, to examine the 
results from another pointof view. Thefact that 
the nondrug schizophrenic subjects closed the 
gaps of the test figures 1.47 millimeters on the 
average, whereas the mean closure score of the 
contro] subjects was 3.5 millimeters more than 
the standard to be copied, might indicate, 
paradoxically, that the performance of the 
Schizophrenic subjects was superior to that 
of the normal subjects. The patients’ scores 
(with four noted exceptions) clustered around 
zero, which indicates perfectly accurate per- 
formance, whereas control subjects’ scores 
clustered around a point which deviated 3.5 
millimeters from perfect accuracy. 

A similar and perhaps related finding has 
been noted with respect to the Miiller-Lyer 
illusion, In a study by Siegel (1954), groups 
of schizophrenic and normal subjects were 
required to adjust the “wings” in the Müller- 
Lyer figure to make equal the two horizontal 
lines which the wings intersected. The task 
was repeated at 2-day intervals. The schizo- 
phrenic subjects did better than normal sub- 
Jects on all six occasions for one standard 
setting and on five of six occasions for another. 
Moreover, both groups showed characteristic 
learning curves, in that the deviations from 
the standard became less with practice, but 
the overall degree of improvement was actually 
somewhat greater for the patients than for the 
control subjects. 

Brunswik (1956) emphasized that the 
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subject’s attitude (Einstellung), ie, the 
manner of directing attention toward a certain 
variable, is of great importance in determining 
the degree of illusion perceived. Citing an 
experiment by Marianne Müller, he pointed 
out that when the subject's attitude was 
natural, relaxed, or '"whole-perceiving," the 
ilusion effect was greater than when the 
attitude or set was “part-isolating.” 

It is therefore not unreasonable to infer 
that, in Siegel's (1954) study, the schizo- 
phrenic subjects had more of a "'part-isolating"" 
set and paid less attention to the context 
provided by the wings of the figure and to the 
areas which they subtended. This could then 
“explain” why their performance was more 
“accurate” than control subjects’, i.e, why 
they were less subject to the illusion. 

Similarly, in the present experiment, it is 
not improbable that schizophrenic subjects 
paid less attention to the whole figure as such, 
but focused primarily on the part of the 
drawing which they were making at any given 
time. Such a set was more clearly shown in 
the performance of more severely ill patients, 
whose drawings tended to be disjointed and 
loosely organized, the separate parts often 
failing to be properly aligned with one another. 
Failure to maintain a set in the way that 
normal subjects do (the tendency to develop 
segmental sets) has been described as a funda- 
mental formal characteristic of schizophrenia 
by Shakow (1950), and the present findings 
are consistent with such an hypothesis. 

The question arises as to why the schizo- 
phrenic maintains this “part-isolating” 
attitude and how general a phenomenon it is 
with respect to his perception of part-whole 
relationships under different circumstances. 
There may be an analogue here to the concrete- 
abstract relationships found among schizo- 
phrenic subjects in tasks involving conceptual 
thinking. Roseman's (1952) test of perceptual 
closure involved fitting jigsaw pieces together 
to form patterns which represented concepts 
such as geometrical shapes. With such a task, 
it is difficult to say whether performance 
deficit was due to inability to form the con- 
cepts or inability to obtain perceptual closure. 
Tn our experiment, the concepts Boy and House 
were given, but the schizophrenic subjects 
seemed to keep the concepts less “in mind,” 
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so that the failure to obtain closure may be a 
function of how schizophrenic subjects deal with 
and are guided by concepts even when they 
can form them or have them provided. The 
functional similarity between keeping a con- 
cept “in mind” and maintaining a mental set 
is apparent. 

It should be pointed out that a theory of 
brain arousal-deficit and a set-deficit theory 
are not necessarily incompatible, but may 
represent two levels of analysis of a common 
phenomenon. 


SUMMARY AND CONCLUSIONS 


Two test figures, each containing eight gaps 
of 20 millimeters, were copied by 36 schizo- 
phrenic and 37 control subjects. A control 
test of eight 20 millimeter gaps between four 
ticktacktoe figures was also administered. 
Mean closure scores were lower for the patients 
than for control subjects on the test figures, 
but no significant difference between the 
groups occurred on the control figure. The 
results were interpreted to be compatible with 
two hypotheses of schizophrenia, not 
necessarily mutually exclusive: Schizophrenia 
reflects a deficit in energizing or arousal 
systems of the brain; it reflects an inability 
to maintain a major set in the same way that 
normal subjects do. 
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three symptom clusters and suggested 
that these clusters could be organized 
conceptually along a developmental hierarchy. 
On theoretical grounds, these investigators 
suggested that symptoms indicative of “self- 
deprivation and turning against the self” 
represent an introjection of social standards 
in keeping with a relatively adequate level of 
social maturity. In contrast, symptom clusters 
indicative of either an “avoidance of others” 
or “self-indulgence and turning against others” 
were conceptualized as representing develop- 
mentally lower levels of psychological function. 
A test of this hypothesis was carried out by 
Zigler and Phillips (1960). Since these in- 
vestigators felt it premature to attempt the 
construction of a comprehensive index of 
maturity, they employed social competence as 
an approximation of such an index. Defining 
social competence in terms of six simple and 
objective biographical variables, they found 
that the symptom clusters noted here were 
related to premorbid social competence in the 
predicted direction. It thus appears that a 
developmental or maturational framework 
has considerable heuristic value for our under- 
standing of psychopathology. In the present 
study, this framework was again utilized. 
Developmental theorists of both psycho- 
analytic (Freud, 1952; Hartmann, 1952; Kris, 
1950; Rapaport, 1951) and nonpsychoanalytic 
persuasion (Lewin, 1936; Piaget, 1951; Werner, 
1948) have suggested that primitive, develop- 
mentally early behavior is marked by imme- 
diate, direct, and unmodulated responses to 
external stimuli and internal need states. In 
contrast, higher levels of maturation are 
characterized by the appearance of indirect, 
ideational, conceptual, and symbolic or verbal 


p= and Rabinovitch (1958) isolated 
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behavior patterns. Thus developmental psy- 
chology suggests that a shift in emphasis from 
action to thought is an expression of the 
maturational sequence. 

Use of such an action-thought parameter 
appears to allow for the derivation of certain 
predictions concerning the relationship be- 
tween symptomatology and maturity as 
measured by premorbid social competence. 
More mature individuals who are manifesting 
inappropriate or symptomatic forms of, 
behavior should do so with a relatively greater 
emphasis on ideational, symbolic, or verbal 
forms and, correspondingly, should show fewer 
symptoms expressed in direct or overt action. 
Alternatively, less mature persons in patho- 
logical states should evidence symptoms in the 
sphere of direct or gross action rather than 
those of the ideational or verbal variety. The 
specific hypothesis tested in the first study can 
be stated as follows: Individuals who manifest 
symptoms in the ideational and verbal sphere 
have attained a higher level of premorbid social 
competence than those individuals who mani- 
fest symptoms characterized by overt action. 

This hypothesis gains some support from 
earlier studies which examined the relationship 
between maturational level, as defined by 
Rorschach perceptual indices, and the in- 
cidence of behaviors in the spheres of thought 
and action. These studies dealt with the 
contrasting developmental level of directly 
assaultive versus verbally assaultive individ- 
uals (Misch, 1954), of persons who make 
actual suicidal attempts versus those who only 
harbor suicidal ideas (Kruger, 1954), of 
sexually deviant individuals versus those 
preoccupied with fears of acting in a sexually 
deviant fashion (Kruger, 1954), and, finally, 
of overactive children versus their underactive 
counterparts (Hurwitz, 1954). These earlier 
studies have characteristically examined rela- 
tively small samples of individuals who mani” 
fested one of a pair of comparable symptoms 
in the realms of action or thought. In contrast, 
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the present study employed a relatively large 
psychiatric population and included an analy- 
sis of a wide range of psychiatric symptoms. 


Srupv I 
Subjects 


This study was based on the examination of the 
case histories of 793 patients, 480 males and 313 fe- 
males, admitted to Worcester State Hospital during a 
12-year period (1945-1957). The particular case his- 
tories employed were those of patients referred to the 
hospital psychology department for personality study 
and who were eventually diagnosed as suffering from 
a functional disorder. While this sample includes a 
wide variety of cases, extremely deteriorated or very 
agitated patients are seldom referred for psychological 
evaluation and are not adequately represented in this 
sample, 


Categorization of Symptoms 


A symptom (Phillips & Rabinovitch, 1958; Zigler 
& Phillips, 1960) here refers to the description of a 
patient’s behavior by a psychiatrist at the time of 
initial institutional contact or the description of be- 
havior presented by referring physicians as the primary 
reason for hospitalization. An examination of the 793 
case records resulted in the tabulation of 15 symptoms 
which could be clearly placed in the action category 
and 14 which could be clearly placed in the thought 
category. Symptoms included in the action category 
Were: accused of murder, assaultive, perversions (except 
homosexuality), homosexuality, compulsions, rape or 
incest, robbery, suicidal attempt, fire setting, emotional 
outbursts, irresponsible behavior, maniacal outbursts, 
eats a lot, all addictions (except drinking), and drinking. 
Symptoms included in the thought category were: 
Suspiciousness, phobias, fears own hostile impulses, 
bizarre ideas, obsessions, suicidal ideas, sexual preoc- 
cupation, threatens assault, depreciatory ideas against 
the self, feelings of sexual inadequacy, perplexed, 
depersonalization, feels perverted, and feels going 
crazy or mentally sick, Hallucinations present a special 
difficulty in classification, These experiences clearly do 
not occur in the realm of action, yet they cannot un- 
ambiguously be placed in the sphere of thought, Such 
pathological manifestations are Most accurately to be 
categorized as perceptual phenomena (Arieti, 1955). 
Consequently, this symptom was removed from further 
consideration in the analyses undertaken for this study. 


Social Competence Indices 


As in the earlier studies, the variables of age, intelli- 
gence, education, occupation, employment history, and 
marital status were used as social competence indices. 
Each variable was divided into three categories with 
each category conceptualized as Tepresenting a step 
along a social competence continuum. The rationale 
employed for ordering the categories of each variable 
was the same as that used in an earlier study (Zigler 
& Phillips, 1960). The categories of each variable and 
their order from low to high are presented below. 
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above. 
2. Intelligence—IQs obtained on a standard intelli- 
gence test of 84 or less, 85-115, and 116 and above, 
3. Education—none or some grades including un- 
graded or special classes; finished grade school, 
some high school, or finished high school; and, 
some college or more. i 
4. Occupation—The Dictionary of Deus 
: 


1. Age—24 and below, 25-44, and 45 years and 


Titles (United States Government Printing Office, 
1949) was employed to place each occupation 
into the categories of unskilled and semiskilled, 
skilled and service, clerical and sales, and profes- 
sional and managerial. 

5. Employment history—usually unemployed; sea- 
sonal, fluctuating, frequent shifts, part-time em- - 
ployment; and, regularly employed. 

6. Marital status—single; separated, divorced, 
remarried, and widowed; and, single continuous, 
marriage. 


Method 


Each of the 793 case histories was examined for 
symptoms manifested. The purpose of this initial 
analysis was to place individuals into a pure Action, 
pure Thought, or Mixed group. All individuals who 
manifested only symptoms of the action variety were 
placed in the Action group. All individuals who mani- 
fested only symptoms of the thought type were placed 
in the Thought group. Individuals whose symptoms. 
were equally divided in number between the action 
and thought categories were placed in the Mixed group. — 
This procedure resulted in 182, 247, and 136 individuals 
being assigned to the Action, Thought, and Mixed | 

| 
| 
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groups, respectively. The remaining individuals in the 

original sample were removed from further con- 
sideration, m 
The case history of each individual was examined in | 
order to assign him a score on each of the six social | 
competence indices. For any of the indices, placement | 
in the lowest category resulted in a score of 0 for that _ 
index. Similarly, assignment to the middle category — 

resulted in a score of 1 and to the highest category, 4 
score of 2. The overall social competence score for | 
each person was the mean of the scores obtained on the _ 
individual indices, This averaging procedure was — 
required since data were not available for every indi- 
vidual on all six variables. Thus the final social com- _ 
petence score for any patient could range from 0 (the l 

lowest category on every variable for which informa- 
tion was available) to 2 (the highest category on every | 
variable for which information was available). i 
j 


Resulis 


| 
The prediction to be tested was that in- | 
dividuals in the Thought group have higher | 
Social competence scores than individuals in j 
the Action group. Although not predicted, | 
the social competence scores of the Mixed 1 
group would be expected to fall between those 
of the other two. In order to test whether these 
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TABLE 1 


DISTRIBUTION OF SOCIAL COMPETENCE SCORES 
FOR THE SymPTOM GROUPS 


Action Mixed Thought 


Social 
Competence} Ob- Ex- Ob- Ex- Ob- Ex- 
Score served | pected | served | pected | served | pected 
Fre- Fre- Fre- Fre- | Fre- Fre- 

quency | quency | quency | quency | quency | quency 


38.0 | 28.0 | 28.4 | 38.0 | 51.6 
76.3 | 59.0 | 57.1 | 93.0 |103.6 
56.4 | 41.0 | 42.1 | 94.0 | 76.5 
11.3 | 8.0 | 8.4 | 22.0 | 15.3 


0-.50 | 52.0 
.51-1.00| 85.0 
1.01-1.50| 40.0 
1.51-2.00| 5.0 


relationships actually obtained, a social com- 
petence score X group contingency table was 
set up. The distribution of each group on the 
social competence variable is shown in Table 
1. The expected distribution of each group if 
there were no real differences among the groups 
in respect to social competence is also pre- 
sented in Table 1. The chi square resulting 
from this contingency table was highly signifi- 
cant (x? = 26.2; df = 6; p <.001). As can 
be seen from Table 1, the Action and Thought 
groups contribute what is virtually the entire 
deviation from expectancy, the deviation 
being in the predicted direction. The distribu- 
tion of the Mixed group almost matches ex- 
pectancy and falls midway between the dis- 
tributions of the Action and Thought groups. 

A further analysis was conducted to in- 
vestigate the relationship between symptom 
group membership and each of the social 
competence indices. Using the same procedure 
as above, index X group contingency tables 
were set up for each of the six individual 
indices. The chi squares associated with the 
education, intelligence, age, and occupation 
indices all had » values <.05. The squares 
associated with the employment history and 
marital status indices had ? values <.20. The 
relationships between group membership and 
each of the six indices were in the predicted 
direction, 


Discussion 

The findings of this study supported the 
hypothesis advanced. Within the psychiatric 
Population employed, the symptoms of the 
more socially competent individuals tend to 


Occur in the sphere of verbalization and idea- 
tion rather than in the sphere of action. In 
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contrast, less socially competent patients 
preponderantly demonstrate their sympto- 
matology in direct action, These findings may 
be related to the position advanced earlier 
that for every maturity level there exist effec- 
tive patterns of adaptation as well as patho- 
logical deviations from these patterns (Phillips, 
1960). Within this context, the three syn- 
dromes investigated in this study—Action, 
Mixed, and Thought—may be conceptually 
ordered and viewed as corresponding to differ- 
ent inappropriate resolutions, each of which 
is tied to successive stages of social maturation. 

The present findings further suggest that 
the substitution of pathological for normal 
behavior patterns does not constitute a re- 
gression in the usual sense. That is, the in- 
dividual in becoming symptomatic does not 
appear to change his habitual mode of re- 
sponse. The action oriented, less mature, and 
less socially competent person tends to remain 
action oriented in his new and inappropriate 
behavior pattern. Antithetically, the more 
mature and more sociall effective person, 
predisposed to ideational, indirect, symbolic, 
or more verbal forms of self-world interaction, 
remains so oriented in a period of psycho- 
pathology. 


Struny II 


The findings of the first study may also be 
related to the concept of vicariousness, orig- 
inally employed in the sensory-tonic theory 
of perception (Werner & Wapner, 1952). AI- 
though considerable theoretical and experi- 
mental work has been done in an effort to 
include this concept within a developmental 
framework (Hurwitz, 19054; Kruger, 1954; 
Misch, 1954; Wapner, 1954, 1956), the rela- 
tionship between the concepts of vicarious- 
ness and maturation remains ambiguous. As 
noted earlier, the developmental position 
maintains that increasing maturation is ac- 
companied by a corresponding increase in 
ideational and verbal forms of expression and 
a decrease in overt action. The concept of 
vicariousness suggests that a functional 
equivalence exists between action and thought, 
and that if behavior in either of these modes is 
blocked for whatever reason, it will be re- 
channeled into the alternative form of ex- 
pression. 
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This view that action and thought are 
behavioral equivalents appears to be in opposi- 
tion to the maturational thesis. It suggests 
that independent of maturation, the particular 
behavioral mode employed is dependent either 
on the particular external conditions under 
which the individual operates or on certain 
predispositions or individual preferences for 
particular types of behavior. Of the two 
theoretical positions just outlined, maturation 
and vicariousness, the findings of the first 
study would appear to support the former, 
le. an emphasis on action or thought in 
symptomatic behaviors can be most appro- 
priately conceptualized within a maturational 
framework. 

However, this is not to assert that the con- 
cept of vicariousness could not eventually be 
incorporated within this framework. A po- 
tential for either action or thought can exist 
at each of the maturational levels, although 
lower levels of maturation tend to be charac- 
terized by an emphasis on action and higher 
levels by an emphasison thought. On the basis 
of an extensive body of work (Piaget, 1950; 
Podell & Phillips, 1959; Werner, 1948) yu 
can be anticipated that the quality or charac- 
teristics of the final behavior, whether in 
action or thought, will be related to matura- 
tional level. Thus it would be predicted that 
the action behaviors of the maturationally high 
individual who is employing an atypical, action 
dominated behavioral orientation will be 
developmentally higher in character than the 
action behaviors of the maturationally low 
individual. Analogously, it would be predicted 
that the thought behaviors of the matura- 
tionally low individual who is employing an 
atypical, thought dominated behavioral orien- 
tation will be developmentally lower in charac- 
ter than the thought behaviors of the matura- 
tionally high individual. 

A major purpose of the second study was 
the testing of this proposition. This test would 
appear to require (g) a measure of the in- 
dividual’s maturity level, (b) a criterion whose 
presence indicates that the individual is em- 
ploying an atypical behavioral orientation, 
and (c) a set of broad behavioral referrents that 
not only mirror the individual’s orientation 
but also can be rationally ordered to a develop- 
mental continuum. A social competence index 
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similar to those employed in the first study was 
used as the measure of personal and social 
maturity. 

Developmental theory as well as empirical 
findings in the psychology of occupations 
(Roe, 1956) was employed to derive a criterion 
of atypicality. Since high social competence 
individuals should typically have occupations 
other than those falling in the labor categories, 
individuals of relatively high social compe- 
tence who do fall in the labor categories were 
classified as atypical. Inclusion in the labor 
categories would appear to indicate a domi- 
nance of “action” over “thought.” In addition, 
the requirements of the labor occupations, 
unskilled, semiskilled, and skilled, are such 
that they can be ordered to a developmental 
continuum. That is, what is required of the 
unskilled laborer is the gross, unmodulated 
activity that has often been associated with 
lower levels of development. Antithetically, 
the activity of the skilled laborer is of the 
highly articulated variety that has been 
ascribed to higher developmental levels. The 
activity of the semiskilled laborer would fall 
between these two extremes. 

Since low social competence individuals 
should typically have occupations falling in 
the labor categories, individuals of relatively 
low social competence who fall in occupational 
categories which emphasize conceptualization 
or the manipulation of symbols (professional 
and managerial, sales, and clerical) were 
classified as atypical. Although the thought 
occupations do not form as clear a develop- 
mental hierarchy as do the labor occupations, 
they too may be rationally ordered to such a 
continuum. The professional and managerial 
occupations would be at the top and the 
clerical occupations at the bottom of this 
hierarchy, with the sales occupations falling 
at an intermediary point. 

Employing these measures, the specific 
predictions to be tested in the second study 
may be stated as follows: Individuals who 
have exhibited a relatively high level of 
premorbid social competence but who have 
occupations in the labor categories more fre- 
quently have jobs which fall at the upper end 
of the unskilled-skilled continuum than do 
individuals who have exhibited low pre- 
morbid social competence, and individuals 
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who have exhibited a relatively low level of 
premorbid social competence but who have 
occupations in the categories of professional 
and managerial, sales, and clerical more 
frequently have jobs which fall at the lower 
end of the clerical-professional continuum 
than do individuals who have exhibited rela- 
tively high premorbid social competence. 

A second major purpose of the second study 
was to test further the proposition that for 
every maturity level there exists effective 
patterns of adaptation as well as pathological 
deviations from these patterns, and that both 
the normal and pathological aspects of the 
individual’s functioning reflect the maturity 
level he has attained (Phillips, 1960; Zigler 
& Phillips, 1960, 1961). This position is in 
essential agreement with the view that mental 
disorders are continuing processes in which 
the premorbid, initial, middle, and ultimate 
stages are interrelated (e.g., Zubin & Windle, 
1954). As noted earlier, this position suggests 
that the substitution of pathological for normal 
behavior patterns does not constitute a re- 
gression in the usual sense. That is, the in- 
dividual in becoming symptomatic does not 
appear to change his habitual mode of re- 
sponse. This view was supported by the 
findings of the first study in that the action 
oriented, less mature, and less socially com- 
petent person tends to remain action oriented 
in regard to his symptom manifestations, 
whereas, the more mature and more socially 
effective person, being predisposed to idea- 
tional, indirect, symbolic, or more verbal forms 
of self-world interaction, remains so oriented 
in a period of psychopathology. 

A further test of this position suggests itself. 
If the orientation of the individual remains 
constant in both the premorbid and patho- 
logical periods, then the more socially effective 
person who employs an atypical action orien- 
tation in his normal functioning should also 
manifest an atypical action oriented symptom 
pattern. Analogously, the less socially com- 
petent person who employs an atypical thought 
orientation in his normal functioning should 
also manifest an atypical thought oriented 
Symptom picture. Again employing the 
measures outlined above the following specific 
predictions were tested: Individuals who have 
exhibited a relatively high level of premorbid 
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social competence, but who have occupations 
in the labor categories, more frequently mani- 
fest symptoms of the action variety than do 
individuals of equally high premorbid social 
competence whose occupations fall in the 
professional and managerial, sales, and clerical 
categories, and individuals who have exhibited 
a low level of premorbid social competence 
but who have occupations in these latter 
categories more frequently manifest symptoms 
of the thought variety than do individuals of 
equally low premorbid social competence 
whose occupations fall in the labor categories. 


Method 


Since the range of occupations of the female segment 
of the hospital population described in the first study 
was so restricted, the case histories of only male pa- 
tients were employed in this study. The histories of 
464 of the 480 male patients were used. The 16 patients 
whose records were not employed were those whose 
occupations did not fall into the categories of either 
professional and managerial, sales, clerical, skilled, 
semiskilled, or unskilled laborer. 

A new social competence score was obtained for 
each of the patients employed in this study. The proce- 
dure for computing each patient's social competence 
score was the same as that employed in the first study, 
except that the occupation variable was not utilized 
in the computation because occupation was one of the 
independent variables. Therefore, in this study an 
individual's social competence score is based on his 
age, intelligence, education, employment history, and 
marital status. 

Employing the procedure used in the first study, 
each of the 464 case histories was examined for symp- 
toms manifested in order to place individuals into a 
pure Action or a pure Thought group. This procedure 
resulted in 123 and 102 individuals being assigned to the 
Action and Thought groups, respectively. The 
remainder of the 464 case histories were removed from 
further consideration. 


Results 


The first prediction to be tested was that 
individuals who have exhibited a relatively 
high level of premorbid social competence but 
who have occupations in the labor categories 
more frequently have jobs which fall at the 
upper end of the unskilled-skilled continuum 
than do individuals who have exhibited low 
premorbid social competence. An occupation 
(skilled, semiskilled, and unskilled) X social 
competence group (High and Low) contin- 
gency table was set up. The chi square test of 
independence was employed to determine 
whether inclusion in a specific occupational 
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category was independent of membership in 
either of the two social competence groups. 
The resulting x? of 96.19 (p <.001) gave no 
evidence of such independence. The majority 
of individuals in the High social competence 
group fell into the skilled and semiskilled 
categories, whereas the majority of individuals 
in the Low social competence group fell into 
the unskilled category. 

The second prediction to be tested was that 
individuals who have exhibited a relatively low 
level of premorbid social competence but who 
have occupations in the categories of pro- 
fessional and managerial, sales, and clerical, 
more frequently have jobs which fall at the 
lower end of the clerical-professional contin- 
uum than do individuals who have exhibited 
relatively high premorbid social competence. 
Using the same procedure as above, an occu- 
pation (professional and managerial, sales, 
and clerical) X social competence group (High 
and Low) contingency table was set up. The 
resulting x? of 11.26 was also highly signifi- 
cant (p <.005). The majority of individuals 
in the High group fell into the professional 
and managerial and sales groups, whereas the 
majority of individuals in the Low group fell 
into the clerical category. Thus the findings 
essentially confirm the two predictions derived 
from the hypothesis, Even in those cases where 
an individual utilizes a developmentally 
atypical behavioral orientation, the quality or 
characteristics of the behavioral mode em- 
ployed is related to the maturational level 
attained by that individual. 

The next prediction to be tested was that 
individuals who have exhibited a relatively 
high level of premorbid social competence but 
who have occupations in the labor categories 
more frequently manifest symptoms of the 
action variety than do individuals of equally 
high premorbid social competence whose 
occupations fall in the professional and 
managerial, sales, and clerical categories, 
Only those individuals who were in the High 
social competence group and who were also in 
either the Action or Thought symptom 
groups were used to test this prediction. The 
chi square test of independence was again 
used. A 2 X 2 occupation (professional, 
managerial, clerical, and sales; and, skilled, 
semiskilled, and unskilled) X symptom pat- 
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tern group (Action and Thought) contingency 
table was set up. The resulting x? was a 
significant one (x? = 7.26; p <.01). Individ- 
uals in the Thought group tended to have the 
conceptually oriented occupations while in- 
dividuals in the Action group tended to be in 
the laborer occupations. Since it was pre- 
Viously shown that the individual's occupa- 
tion is related to his social competence level, 
the social competence scores of the "Thought 
and Action groups were examined to insure 
their comparability. Mean social competence 
Scores for individuals in the Action and 
Thought groups were 1.23 and 1.32, respec- 
tively. The difference of .09 was not significant 
(t <1.00). 

The final prediction to be tested was that 
individuals who have exhibited a low level of 
premorbid social competence but who have 
occupations in the professional, managerial, 
sales, and clerical categories more frequently 
manifest symptoms of the thought variety 
than do individuals of equally low social 
competence whose occupations fall into the 
labor categories. Only those individuals who 
were in the Low social competence group and 
who were also in either the Action or Thought 
Symptom groups were used to test this pre- 
diction. Again, a 2 X 2 occupation X symp- 
tom pattern group contingency table was set 
up. The resulting x? was significant (x? = 
6.78; p <.01). Individuals in the Action group 
tended to be in the labor occupations; in- 
dividuals in the Thought group tended to 
have the conceptually oriented occupations. 
As in the High Action and Thought groups, 
the social competence scores of the Low Action 
and Thought groups were compared, This com- 
parison revealed identical mean social com- 
petence scores (.56) for the two groups. Thus 
the findings also confirm the two predictions 
derived from the hypothesis that individuals 
who display an atypical behavioral orientation 
in their normal functioning will also display 
an atypical but congruent symptom pattern. 
Discussion 

The findings of this second study lend 
support to the two hypotheses advanced. It 
was found that for individuals manifesting à 
developmentally atypical behavioral orienta- 
tion in their occupation choice, the quality or 
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characteristics of their occupations neverthe- 
less remain related to the developmental level 
attained. It was also found that individuals 
displaying an atypical behavioral orientation 
in their normal functioning also displayed an 
atypical but congruent functioning in their 
pathological manifestations. 

Validation of the first hypothesis would 
appear to clarify the nature of the relationship 
between maturation and vicarious functioning. 
As noted earlier, this relationship has hereto- 
fore remained an ambiguous one. The findings 
of studies involving deviant groups (Hurwitz, 
1954; Kruger, 1954; Misch, 1954) have been 
employed in an effort to incorporate the 
vicariousness concept within a developmental 
framework (Wapner, 1954, 1956; Werner, 
1954). These earlier findings appear completely 
congruent with the developmental thesis that 
the higher the level of maturation attained, 
the greater is the emphasis on ideational or 
verbal behavior patterns as opposed to action 
oriented ones. The only aspect of these earlier 
studies that appears related to the vicarious- 
ness concept is the finding that the ideational 
groups produced more human movement re- 
sponses on the Rorschach than did the acting 
out groups. The implication here is that such 
responses represent a substitutive activity, 
suggesting a type of vicarious functioning. 
However, since increased production of human 
movement responses is also characteristic of 
higher developmental levels (Litwin, 1957; 
Podell & Phillips, 1959), it would appear that 
all the findings are consistent with the de- 
velopmental thesis without recourse to the 
vicariousness concept. It is clear that if one 
wishes to demonstrate vicarious functioning 
in different groups, he must hold the develop- 
mental level of the groups constant. Only then 
can differences in functioning be ascribed to 
vicariousness rather than to differences in the 
maturational level attained. This was the 
procedure employed in the present study, and 
It reveals that vicarious functioning does 
indeed occur within developmental levels, 
but that the formal characteristics of that 
functioning mirror the developmental level 
of the individual manifesting it. 

This finding would appear to raise certain 
questions concerning the more classical studies 
of vicarious functioning (Goldman, 1953; 
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Korchin, Meltzoff, & Singer, 1951; Krus, 
Werner, & Wapner, 1953; Singer, Meltzoff, 
& Goldman, 1952). For instance, in an often 
quoted study by Korchin et al. (1951) it was 
found that a group of normal adult subjects 
who were artificially immobilized produced 
significantly more human movement responses 
than a control group of nonimmobilized sub- 
jects. The present study would suggest that 
these findings are likely only if subjects are 
employed whose developmental level is charac- 
terized by a potentially high production of 
human movement responses, If subjects of a 
developmentally low level are employed, the 
interaction between developmental level and 
ideational production should be such that 
while the amount of ideational material would 
increase, its formal quality would be of a low 
rather than high level. 

Some recent evidence indicates that this 
suggestion is a valid one. Litwin (1957), em- 
ploying groups of children 6, 10, and 14 years 
of age, found an increase in human movement 
responses on the Rorschach with increasing 
age but did not find an increase in such re- 
sponses as a result of motorically inhibiting 
her subjects. Spivack, Levine, Fuschillo, and 
Tavernier (1959), employing adolescent sub- 
jects ranging in age from 11 to 19 years, did 
find an increase in ideational production on the 
Rorschach following motoric inhibition. How- 
ever, there was no increase in human move- 
ment responses. Instead, an increase was found 
in the production of responses developmentally 
lower in nature, animal movement and in- 
animate object movement responses. 

Although the present study provides some 
clarification of the relationship between 
maturation and vicarious functioning, neither 
it nor the earlier studies explain why an in- 
dividual should vicariously employ a be- 
havioral orientation which, from the develop- 
mental viewpoint, is an atypical one. One 
suggestion, derived from earlier experimental 
studies, is that the individual functions in a 
vicarious or substitutive manner when the 
external environmental conditions are such 
that they block the individual’s typical 
behavior. Thus it may be hypothesized that 
for individuals who employ a developmentally 
atypical orientation in their occupation choice, 
considerable external pressures influence this 
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choice, while no such pressure is likely to be 
found in the cases of individuals who employ 
a developmentally typical orientation. 

Another hypothesis is suggested by the 
finding of the first study that the develop- 
mental level of patients whose symptoma- 
tology is divided equally between action and 
thought symptoms falls midway between the 
developmental levels of patients whose 
symptomatology falls exclusively in the realm 
of action or in the realm of thought. It may be 
hypothesized that individuals who display a 
shifting or an ability for substitution, i.e., 
vicariousness, in their behavioral orientation 
are those whose maturation is at that transi- 
tional phase characterized by not only the 
presence of considerable overt action, but also 
by the beginning of an increased reliance on 
thought. This hypothesis, while explaining how 
the individual could employ either of two 
developmentally divergent modes, casts little 
light on the factors which influence the in- 
dividual's choice of modes. A complete ex- 
planation would still appear to involve ex- 
trapersonal, environmental factors. Postulat- 
ing the importance of such factors appears to 
be in agreement with the views of Hartmann, 
Kris, and Lowenstein (1946) and Erikson 
(1950) that individuals who are in transitional 
phases of ego development are particularly 
susceptible to external influence. 

One of the two hypotheses advanced above 
explains vicarious functioning solely in terms 
of "blocking" external events, while the 
other relates vicariousness to a combination 
of external events and a particular transi- 
tional level of maturation attained by the 
individual. While further research must be 
conducted to validate either of these hypoth- 
eses, certain findings here make it dubious 
that external events alone are responsible for 
vicarious functioning. If, as this hypothesis 
Proposes, an individual is forced to choose a 
developmentally atypical occupation because 
of external factors (parents’ aspiration level, 
employment possibilities, etc.), it would not 
be expected that this atypical orientation 
would manifest itself in an area where these 
factors are inoperative or where their strength 
has been greatly reduced—namely, in the 
pathological manifestations of the individual. 
However, the results of the present study 
clearly indicate that the individual's atypical 
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behavioral orientation remains constant in 
both the normal azd the pathological phases 
of the individual's functioning. 

This finding, in conjunction with earlier 
ones (Zigler & Phillips, 1960, 1961), argues 
that the individual's functioning is not solely 
the product of environmental conditions, nor 
even of pervasive normal or pathological 
states. Instead, it mirrors a specialization or 
life style that extends to all areas of func- 
tioning as well as being intimately related to 
the maturational level attained. Although the 
authors have been impressed with the heuristic 
value of conceptualizing pathological be- 
haviors as regressive phenomena, this view 
holds that the individual's life style or basic 
behavioral orientation is at least an equally 
important determinant of his pathological 
manifestations, The possible interactions be- 
tween regression and the various behavioral 
orientations of action and thought would 
appear to be an inviting area of investigation. 


SUMMARY 


Two studies were reported. In the first 
study, the hypothesis tested was that in- 
dividuals who manifest symptoms in the 
ideational and verbal sphere have attained a 
higher level of premorbid social competence 
than those individuals who manifest symptoms 
characterized by overt action. The case 
histories of 793 patients were examined in 
order to establish three groups, Action, 
Thought, and Mixed. All persons who mani- 
fested symptoms only in action were placed 
in the Action group and those individuals with 
Symptoms only in thought were placed in the 
Thought group. Individuals whose symptoms 
were equally divided in number between the 
action and thought categories were placed in 
the Mixed group. 

The variables of age, intelligence, education, 
occupation, employment history, and marital 
Status were employed in assigning each in- 
dividual a social competence score. It was 
found that individuals in the "Thought group 
had significantly higher social competence 
Scores than individuals in the Action group. 
It was also found that the distribution of the 
Mixed group on the social competence meas- 
ure fell midway between the distributions of 
the Action and Thought groups. 

In the second study, two hypotheses were 
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tested: In those cases where an individual 
utilizes a developmentally atypical behavioral 
orientation, the quality or characteristics of 
the behavioral mode employed is related to 
the maturational level attained by that in- 
dividual, and individuals who display a de- 
velopmentally atypical behavioral orientation 
in their normal functioning also display an 
atypical but congruent symptom pattern. 

The case histories of 464 male patients whose 
occupations fell into the categories of either 
professional and managerial, sales, clerical, 
skilled, semiskilled, or unskilled laborer were 
examined in order to establish High and Low 
social competence groups and Action and 
Thought symptom groups. The variables of 
age, intelligence, education, employment his- 
tory, and marital status were employed in 
assigning each individual a social competence 
score. As in the first study, inclusion in the 
Action or Thought symptom group was deter- 
mined by the specific symptoms the patient 
manifested. Individuals in the High social 
competence group who had occupations falling 
into the labor categories (skilled, semiskilled, 
and unskilled) were treated as employing an 
atypical behavioral orientation. Analogously, 
individuals in the Low social competence 
group but with occupations which emphasize 
conceptualization or the manipulation of 
symbols (professional and managerial, sales, 
and clerical) were also viewed as manifesting 
an atypical behavioral orientation. 

The two predictions derived from the first 
hypothesis were confirmed. Employing all 
individuals in the High and Low social com- 
petence groups who had occupations in the 
labor categories, it was found that the majority 
of individuals in the High social competence 
group fell into the skilled and semiskilled 
categories, whereas the majority of individuals 
in the Low social competence group fell into 
the unskilled category. Employing all in- 
dividuals in the High and Low social compe- 
tence groups whose occupations were of the 
thought oriented variety, it was found that 
the majority of individuals in the High group 
fell into the professional and managerial and 
sales categories, whereas the majority of in- 
dividuals in the Low group fell into the clerical 
category, 

The two predictions derived from the second 
hypothesis were also confirmed. Employing 
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only those individuals in the High social 
competence group who also fell into either the 
Action or Thought symptom groups, it was 
found that individuals in the Thought group 
tended to have the conceptually oriented oc- 
cupations, while individuals in the Action 
group tended to be in the labor occupations. 
This observation -also held in that group of 
individuals who were in the Low social com- 
petence group and who also fell into either the 
Action or Thought symptom groups. The 
findings of both studies were discussed in re- 
lation to the concepts of maturation, vicarious- 
ness, and regression. 
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SOCIAL REINFORCEMENT WITH CHILDREN AS A FUNCTION 


OF CA, SEX OF £, 


HAROLD W. S 
University of 


HE purpose of this study is to investigate 

the effectiveness of social reinforcement 

in modifying children's performance in 

a simple game. The variables of primary con- 

cern are the sex of the adult providing the 

social reinforcement, and the sex and age of 
the child being studied. 

In two recent studies with preschool children 
(Gewirtz, 1954; Gewirtz & Baer, 1958), 
women were found to have a significantly 
greater effect on the performance of boys than 
of girls, and men were found to have a greater 
effect on the performance of girls than of boys. 
Two types of tasks were used: easel painting 
in the presence of a permissive adult and a 
simple game in which one of two alternative 
responses was socially reinforced by the adult. 
A significant interaction between sex of adult 
and sex of child was found for amount of at- 
tention seeking while painting and for the 
frequency with which the socially reinforced 
response was made. Although, as the authors 
indicate, other interpretations may account 
ior the results, the Oedipal theory of Freud 
provides perhaps the most meaningful focus 
about which to view the data. 

The results of these two studies lead one to 
Speculate further about the possible effects 
of social reinforcement by adults on the per- 
formance of children of different ages. Again, 
even though other views may lead to similar 
Predictions, Freudian theory provides the 
most integrated series of hypotheses for making 
Such predictions. 

During the early years of childhood the 
Principal caretaker of both boys and girls is 
usually the mother. Through her role in 
satisfying the child's basic needs, the mother 
becomes the primary love-object of the young 
Child and her presence and comments acquire 
the capacity for reinforcing behaviors not 
associated with primary drives. Assuming the 

‘This study was supported by Grant M-3519 from 
the National Institute of Mental Health. 

* The assistance of Richard Strate in the conduct 
of this study, and of Raymond Collier in the analysis 
of the data is gratefully acknowledged. 
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relationship with the mother generalizes to 
other women, social reinforcement provided 
by women should be more effective in modi- 
fying the performance of both young boys and 
girls (2-4 years) than social reinforcement 
provided by men. As the girl grows older and 
enters the Oedipal period a shift in object- 
choice from the mother to the father begins 
to occur. During this period the boy not only 
fails to change his love-object from the 
mother to the father, but comes to view the 
father negatively. Results such as those found 
in the Gewirtz studies are predicted for this 
stage (4-7 years); that is, women are pre- 
dicted to have a significantly greater effect on 
the performance of boys than of girls, and men 
are predicted to have a significantly greater 
effect on the performance of girls than of boys. 
Later, the Oedipal relationships decrease in 
strength and the pattern of object-choice 
changes. The girl re-establishes a close affec- 
tive relationship with the mother, and the boy 
shifts his object-choice from the mother to 
the father. During late childhood (7-11 years), 
women should be more effective reinforcing 
agents for girls’ behavior than for boys’, and 
men should now be more effective reinforcing 
agents for boys’ behavior than for girls’. 

The present study attempts to investigate 
the validity of such predictions by testing 
children at three age levels (3-4 years, 6-7 
years, and 9-10 years) in a simple game in 
which either a male or female experimenter 
(Z) makes supportive comments during the 
subject’s (S’s) performance, In order to reduce 
the problems resulting from having only one 
male and one female E, six Es of each sex 
were employed. The use of more than one E 
also makes it possible to determine whether 
there are significantly different effects asso- 
ciated with different Es. 

MeErtHOD 
Subjects 
The Ss were 252 boys and 252 girls selected on the 


basis of CA. One-third of the Ss of each sex were within 
the CA range from 3-0 to 5-0 years, one-third from 
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6-0 to 8-0 years, and one-third from 9-0 to 11-0 years. 
The children were attending preschools or elementary 
schools in Minneapolis and St. Paul, and all of the 
children of the appropriate CAs in each group or class 
visited were used as Ss.* The schools in general enrolled 
children of average socioeconomic and intellectual level 
and were selected on the basis of availability rather 
than any other criteria. 


Experimenters 


The Es were 12 persons who were willing to perform 
as Es and who were involved in some phase of the 
Institute of Child Development program. The female 
Es included two graduate students in psychology, 
three graduate students in child development, and a 
secretary. The male Es included a graduate student in 
psychology, a graduate student in child development, 
a postdoctoral research associate in child develop- 
ment, and undergraduate students in psychology, 
sociology, and industrial education. The age range for 
the Es was 19-25 years, with a mean CA of 22.8 years. 


Apparatus 


"The apparatus has been described in detail elsewhere 
(Stevenson & Fahel, 1961). It consisted essentially 
of a table of adjustable height with two bins, 8 X 10 
inches, and a short transverse upright panel to shield 
E's recording from S. The left bin contained approxi- 
mately 1,600 orange, blue, and green marbles. The 
right bin was covered by a plate with six 5$ inch 
holes placed randomly about the surface. Below the 
right bin was a mechanism whereby each marble was 
counted as it fell to the bottom of the bin. 


Procedure 


"The Ss were tested individually, Each S was obtained 
from the classroom by E and taken to the experimental 
room, which in practically all of the schools visited was 
a quiet, isolated room. The E engaged S in friendly 
conversation and seated S at the table. The Æ affirmed 
S's age and name, and said: 

We're going to play a game. Let me tell you about it. 
This is called the Marble Game. See the marbles? 
We have blue ones and orange ones and this color 
green ones and this color green ones. Let me show you 
how to play it. The marbles go in these holes. [E 
indicated the marbles in the left bin and the holes 
of the plate covering the right bin.] You can put any 
color marble in any hole. These are the marbles and 
these are the holes, Pick the marbles up one at a 
time and put them in the holes. Let me show you 
how to do it. [E demonstrated how the marbles 
could be dropped in each hole] O.K.? I'll tell you 
when to stop. Now you can play the game. 


* The writer wishes to thank Judith Rosen, Barbara 
Knight, Kristin Arnold, Stephanie Grossman, Evelyn 
Stern, Norma McCoy, Richard Strate, Norman Kass, 
Gerald Peterson, Peder Johnson, Mervyn Bergman, 
and Edward Dowd who served as Es, and the principals 
and teachers of the nursery schools and elementary 
schools visited for their cooperation in providing 
children for the study. 
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The first minute of the game was used to establish 
a baseline rate of response for each S. During this 
minute E played the role of an attentive, but non- 
reinforcing observer of S's performance. Care was 
taken not to smile or to nod approval of 5's behavior 
during this period. After the first minute E continued 
to be attentive to S’s performance and did not avoid 
smiling and nodding when delivering one of the state- 
ments of verbal support. Twice a minute for the next 
5 minutes Æ made a supportive statement about S's 
performance. The statements were made after approxi- 
mately 15” and 45” within each minute, and immedi- 
ately following one of S’s responses. Five statements 
were used: "You're doing very well,” "That's very 
good,” “You know how to play this game very well," 
"That's fine,” “You're really good at this game." 
These statements were made in a predetermined random 
order established separately for each S. Any attempts 
by S to engage E in conversation were not responded 
to by E. 

Three departures from this procedure were allowed. 
First, if S stopped responding for 30 seconds during the 
course of the game, £ said, “T'I tell you when to stop." 
If S failed to resume response or if S later stopped again 
for 30 seconds, the game was terminated. Second, if 
S picked up a handful of marbles, E said, *Put them 
in one at a time." If S did not heed this instruction, S 
was dropped from the experiment. Third, if S stopped 
responding during the first minute before a supportive 
comment was made, S was discarded. In addition to 
the last two criteria, 5s were also discarded if there were 
mechanical difficulties in the apparatus, or if any 
difficulties arose in the experimental procedure, such 
as someone's interrupting the game. A total of 35 Ss 
were eliminated from the experiment and replaced by 
other Ss on these bases. 

Each E tested seven boys and seven girls at each 
of the three age levels. Most of the Es tested children 
from several schools within an age level, This was 
desirable from the standpoint of increasing the random- 
ness of the sample of children tested by each Æ and was 
necessary because of the small number of children in 
some of the classes or groups visited. The children were 
obtained either alphabetically, according to seat order, 
or according to which children happened to be unoc- 
cupied or not engaged in group activities at the time 
E visited the class, There was no tendency for the first 
E visiting a group to obtain a different set of children 
from Es visiting the group later. When large groups 
were visited, several Es were available so that all 
children in a group were tested in a fairly uninterrupted 
sequence. 

Before beginning to test children, each Æ practiced 
a predito several times with an experienced Æ as 

e 5. 


Response Measures 


The number of marbles inserted during each minute 
of the game was recorded by E. The number inserted 
during the first minute when E made no supportive 
comments was used as an index of S’s base rate of 
response. The base rate was subtracted from the num- 
ber of marbles inserted during each subsequent minute 
for each S and the mean of these difference scores was 


FACTORS IN SOCIAL REINFORCEMENT WITH CHILDREN 


30 


20r 


a 


—— 


Or 


L Male E Female E 
CA 3-4 


Mean Difference Score/Minute 
[s] 
T 


Male E 
CASI 


149 


Boys 
LI Girls 


Male E Female E 
CA9-10 


Female E 


Fic. 1. The average difference score obtained for each minute of the game for each CA and for each sex of Ss 


and Es, 


used as an index of the effectiveness of the social rein- 
forcement in modifying S's rate of response. The two 
measures used in the major analyses of the results 
were, therefore, the base rate and the mean difference 
score, 


RESULTS 


The first prediction was that women would 
have a significantly greater effect than men on 
the performance of both boys and girls at the 
3-4 year level. The mean difference score for 
boys and girls tested by men was —.04, and for 
boys and girls tested by women it was 1.38 
(see Figure 1). This difference is significant 
at the .005 level (t = 2.97, df = 166). 

In order to determine whether this differ- 
ence might also be characteristic of the 
performance of Ss at the other age levels, the 
same comparison was made for the other ages. 
The mean difference score for Ss tested by 
women at the 6-7 year level was 1.29 and for 
Ss tested by men, .90. The difference is not 
Significant (¢ = 1.08). The mean difference 
Score for Ss tested by women at the 9-10 
year level was .80, and for Ss tested by men, 
1.55. The difference is in the opposite direction 
from that found for younger Ss, but is not 
Significant (¢ = 1.68, df = 166, p > .10). 


The first prediction is therefore confirmed. 
Social reinforcement in the present setting is 
more effective in modifying the rate of re- 
sponse of boys and girls at the 3-4 year level 
when it is provided by women than when it is 
provided by men. The difference is significant 
at the 3-4 year level and at none of the other 
CA levels. 

The second prediction was that for children 
of 6-7 years women would have a greater 
effect on the performance of boys than of 
girls, and men would have a greater effect on 
the performance of girls than of boys. The 
mean difference score for boys tested by women 
was 2.18, and for girls tested by women, .40. 
The difference is significant at less than the 
.025 level (¢ = 2.37, df = 82). The mean differ- 
ence score for girls tested by men was 1.15, 
and for boys tested by men, .64. The differ- 
ence is in the predicted direction, but is not 
statistically significant (/ = .66). 

Again, to determine whether this effect 
occurs only at the 6-7 year level, the same 
comparisons were made at the other age levels. 
The difference between the mean difference 
scores at the 3-4 year level for boys and girls 
tested by women was not significant (t = .50). 
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The difference between the mean difference 
scores for boys and girls tested by men was 
also not significant (¢ = .54). At the 9-10 year 
level, the difference between the mean differ- 
ence scores for boys and girls tested by women 
was not significant (t 1.27), nor was the 
difference between the mean difference scores 
for boys and girls tested by men (¢ = 1.13). 

The second prediction was thus only partly 
confirmed. At the 6-7 year level, and only at 
this CA level, women were more effective in 
modifying the behavior of boys than of girls. 
Social reinforcement provided by men did not 
have a significantly greater effect in modifying 
the behavior of girls than of boys at any of the 
CA levels. 

The third prediction was that at the 9-10 
year level boys tested by men would show a 
greater increment in response than would 
girls tested by men, and conversely, girls 
tested by women would show a greater 
increment in response than would boys tested 
by women. This prediction was not supported. 
The tendency was for the cross-sex effect sug- 
gested by the results at the 6-7 year level to 
be maintained at the 9-10 year level, but by 
the 9-10 year level the variability among Ss 
was so great that the differences were not 
statistically significant. 

The average difference scores obtained for 
all Ss tested by men showed a consistent in- 
crease across the three age levels from —.04 to 
-90 to 1.55, while the average difference scores 
obtained for Ss tested by women showed a 
decrease across the three age levels from 1.38 
to 1.29 to .80. The increase for the men from 
the 3-4 to the 9-10 year levels is highly signifi- 
cant (! = 4.17, df = 166, p <.001), but the 
corresponding decrease for the women is not 
significant (¢ = 1.23). Social reinforcement 
delivered by men therefore became increasingly 
effective as the Ss’ CA increased, but no signifi- 
cant changes in the general effectiveness of 
social reinforcement delivered by women as a 
function of Ss’ CA was found. 


Base Rate 


The question may be raised whether differ- 
ences in performance among the various sub- 
groups may not also have been manifest during 
the base rate period. The Es may have in- 
duced different levels of motivation during 
the initial stages of the task which would be 
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reflected in different base rates of response. | 
The data in Table 1, which presents the aver- ' 
age base rates for each of the subgroups, © 
indicate that this was not the case. Although, © 
as might be expected, the rate of response | 
increased with increasing CA, the values for | 
the subgroups within an age range are remark- . 
ably similar. None of the tendencies found in — 
the analysis of the difference scores is seen. 3 

An analysis of variance of the base rates | 
was performed to determine whether the 
apparent similarities in performance were 
indeed nonsignificant, as well as to determine - 
whether the mean rates of response of Ss. 
tested by different Es differed significantly, - 
This analysis is summarized in Table 2. Thei e 
was no significant difference between the base 
rates of boys and girls, nor between the base - 
rates of Ss tested by men and women. The. 
difference associated with CA is highly signifi- — 
cant, with each age level showing an increase © 
in rate of response. "s 

There is a highly significant difference in | 


TABLE 1 
Mean Base Rates FOR Eacu SUBGROUP 
Sex of E and S 34 6-7 9-10 
Male E 
Boys 12.4 26.0 36.0 
Girls 14.4 27.1 34.0 
Female E 
Boys 11.6 24.3 34.3 
Girls 12.3 26.1 35.7 
TABLE 2 
ANALYSIS OF VARIANCE OF BASE RATE SCORES 
Source df MS Error F 
Sex E (A) 1| 10.50 | D .64 
Sex S (B) 1 | 97.73 | BXD .01 
Age S (C) 2 137.50 | CXD 1265. 72*** 
Individual E (D) | 10 | 173.66 5,259098 
Du 5 | 235.05 T1252 
Dr 5 | 12:27 3.40** 
AXB 1| 24,90 | BXD E 
AXC 2]| 34.68 | CXD 2.08 
BXC 2 | 42.03 | BXCXD +50 
BXD 10 | 48.69 1.47 
CXD 20 | 16.70 .80 
AXBXC 2| 5110 | BXCXD „61 
BXCXD 20 83.20 2,5254 
BXCXDx 10 | 41.63 1.26 
BXCXDr 10 | 124.78 3,78***. 
Within 432 | 33.05 
**p «01 
*** p < .001 
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the average base rates of Ss tested by different 
Es. Significant differences occur among the 
base rates of Ss tested both by men and by 
women. Since the Ss tested by each E were 
determined at random, these differences in 
base rates cannot be attributed to charac- 
teristics of Ss. It must be concluded that the 
Es differed in ways which had significant 
effects on the rate with which Ss responded. 

Finally, there is a highly significant inter- 
action between Sex of S, Age of S, and In- 
dividual E. It is evident in the breakdown of 
this interaction according to sex of E that 
the major contribution to this interaction is 
derived from the women Es. The average base 
rates of Ss tested by men were rather evenly 
distributed within each age and sex grouping. 
For Ss tested by women, however, the dis- 
persion of average base rates was similar for 
boys and girls at the 3-4 year level, but 
increased more for girls than for boys at the 
6-7 and 9-10 year levels. 


Difference Scores 


An analysis of the changes in difference 
scores through the period in which social re- 
inforcement of Ss’ responses occurred in- 
dicates an increase in the average difference 
score from .35 for the second minute to 1.30 
for the sixth minute. This increase is highly 
significant (/ = 4.62, df = 503, p <.001). The 
difference scores increased from the second 
through the sixth minute in 11 of the 12 sub- 
groups. The only group in which a decrease 
through this period occurred was for girls at 
the 9-10 year level tested by women. Here 
the change was minimal; the average differ- 
ence score for the second minute was .14, and 
for the sixth minute, .01. 

Since the effect of social reinforcement was 
greatest during the sixth minute of the game, 
an analysis of variance was performed on the 
difference scores obtained for this period. 
Because of the lack of homogeneity of variance, 
a logarithmic transformation of the data was 
performed. The analysis is summarized in 
Table 3. 

There was no significant difference in the 
sixth-minute difference scores for boys and 
girls or for Ss tested by men and women. There 
Was a significant difference in the difference 
Scores of Ss tested by individual Es, a signifi- 
cant double interaction between Individual E 
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TABLE 3 
ANALYSIS OF VARIANCE OF DIFFERENCE SCORES 
FOR SIXTH MINUTE 
Source df | MS Error F 
Sex E (A) 1|.02| D B 
Sex S (B) 1|.06| BXD 2.14 
Age S (C) 2 |.04| CXD Br 
Individual E (D) 10 | .013 6.84*** 
Du 5 | .010 5.26999 
Dr 5 |.015 7.89*** 
AXB 1 |.02| BXD 4.28 
AXC 2 |.01| CXD 31 
BXC 2 |.005| BXCXD | 1.78 
BXD 10 | .003 1.47 
CXD 20 |.099 52. 10*** 
CX Du 10 | .005 2,63** 
CXDr 10 | .015 7.89*** 
AXBXC 2 |.00| BXCXD | 1.19 
BXCXD 20 | .008 4.4200" 
BXCXDm 10 | .004 2.10* 
BXCXDr 10 | .013 6.84*** 
Within 432 | .002 
* p< .05. 
p< 01. 
se p < 001. 


and Age of S, and a significant triple inter- 
action between Individual E, Age of S, and 
Sex of S. 

The tendency was for the dispersion of the 
average difference scores to increase with 
increasing CA. The range tended to be more 
restricted, however, for boys than for girls 
at the youngest age level and the increase in 
range from one age level to the other tended 
to be greater for boys than for girls. In general, 
the findings are similar to those obtained in 
the analysis of the base rates in that the dis- 
persion of average difference scores obtained 
for Ss tested by women tended to be greater 
than that obtained for Ss tested by men and 
in that the dispersion tended to be greater for 
girls than for boys at all ages. 

An inspection of the sixth-minute difference 
scores indicates that there was some con- 
sistency in the ordering of the average differ- 
ence scores of Ss tested by individual Es at the 
3-4 and 9-10 year levels. Although the number 
of Es is small, rank order correlations were 
computed to give an indication of the degree 
of consistency found. The correlation for boys 
at the 3-4 and 9-10 year levels tested by men 
was .81 (p <.05), and for boys tested by 
women, .60. The correlation for girls tested by 
men was —.71, and for girls tested by women, 
— .16. In other words, the ordering of Es tended 
to be similar at the early and late ages for 


152 Hangorp W. 


boys, while the ordering for girls indicated 
that women who produced high difference 
Scores at the younger age levels tended to 
produce low difference scores at the highest 
age levels and vice versa. 

The correlation between the number of 
marbles inserted during the first minute of the 
game and the difference score for the sixth 
minute was —.40. This r is highly significant 
(p <.001) and indicates a tendency for Ss 
with low base rates to have high increments in 
response and for Ss with high base rates to 
have lower increments in response. 


DISCUSSION 


The results at the 6-7 year level are similar 
to those reported by Gewirtz, Baer, and Roth 
(1958) for 4-6 year old children tested fol- 
lowing a 20-minute period of social isolation. 
Since the index of the effectiveness of social 
reinforcement differs in the two studies, it is 
possible to compare the results only in terms 
of direction and not in terms of the absolute 
changes in behavior. In both studies social 
reinforcement provided by women had a sig- 
nificantly greater effect on the behavior of boys 
than of girls, and there is a tendency, not 
significant in either study, for social reinforce- 
ment provided by men to have a greater effect 
on the behavior of girls than of boys. The 
major discrepancy between the results from 
the two experiments is that the cross-sex 
effect was reliable in the Gewirtz, Baer, and 
Roth study only following social isolation 
and was not reliable when Ss were tested 
without such isolation. The basis of this 
difference is not clear, and the present data 
indicate that deprivation immediately prior 
to testing is not a necessary condition for the 
effect to emerge. 

The results provide some support for the 
predictions derived from psychoanalytic 
theory. There is clear support of the prediction 
that women are more effective than men in 
modifying the performance of boys and girls 
at the 34 year level in this type of task. As 
discussed above, the results are also in line 
with the prediction that adults should have a 
greater effect on children of the opposite sex 
at the 6-7 year level. The results for Ss tested 
by men are in the appropriate direction, but 
are not reliable. The strong effect of women 
on boys’ performance at this age level, as 
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contrasted with the more moderate effect of 
men on girls’ performance, has some basis in 
the Freudian assumption that the Oedipal 
relationship is more intense with boys than 
with girls. The results do not support the 
prediction that the behavior of children at the 
9-10 year level would be modified to a greater 
degree by social reinforcement provided by 
adults of the same sex as S compared to adults 
of the opposite sex. The view that the 9-10 
year old children's close affective relationship 
with the like-sexed parent results in adults of 
this sex having a greater effect on the perform- 
ance of the child is not supported. 

The question arises as to whether other 
positions than the Freudian one may not 
provide a more satisfactory account for the 
results. Gewirtz and Baer (1958) have 
hypothesized that the effectiveness of social 
reinforcement is increased by social depriva- 
tion. A simple extension of this hypothesis 
may account for the cross-sex effect discussed 
above. It may be hypothesized that the effec- 
tiveness of social reinforcement provided by 
an adult in modifying children's performance 
is a function of the degree of social deprivation 
children have for contacts with members of 
the adult's sex. 

It must be assumed that during the early 
years of the child's life the parent functions 
more frequently as a caretaker than as a 
Source of social satisfaction for the child. As 
discussed earlier, the effectiveness of the 
mother as a reinforcing agent during the early 
years of the child'slife may be posited as being 
a result of her gaining in secondary reinforcing 
value because of her presence in the satisfac- 
tion of the child’s basic needs. The father has 
a relatively less important role at this time 
and only later does he begin to play a signifi- 
cant part in the child’s daily activities. The 
increased effectiveness of the male Es with 
increasing CA of the child was found in this 
study. As the child grows older and becomes 
capable of satisfying his own basic needs or of 
delaying gratification of his needs, the im- 
portance of both parents as social agents 
increases. 

Social forces begin to operate after the first 
years to direct boys’ activities towards those 
of other boys and men and to direct girls’ 
activities’ towards those of other girls and 
women. Masculine contacts and behavior 
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are reinforced for boys and not for girls, and 
feminine contacts and behavior are reinforced 
for girls and not for boys. The boy is relatively 
more deprived of contact with females than 
with males and the girl is relatively more 
deprived of contact with males. This is as- 
sumed to be maximal during the early school 
years, for it is perhaps at this time that there 
is the greatest emphasis placed upon the child’s 
adopting an appropriate sex role. Because of 
this deprivation, the effectiveness of women 
as reinforcing agents for boys and of men as 
reinforcing agents for girls is assumed to in- 
crease. The strength of the cross-sex depriva- 
tion is assumed to be greater for boys than for 
girls. Feminine activities and contacts are 
more likely to be discouraged for boys than 
masculine activities and contacts are for girls, 
Further, the boy has had a closer relationship 
and more frequent contact with the mother 
during the first years of life than the girl has 
had with the father, thus being forced to 
decrease or relinquish such relationships and 
contacts should have a greater effect on boys 
than on girls, 

This general argument leads to an odd 
paradox, for it would be predicted that during 
these years the more effective reinforcement 
of masculine behavior for boys would be 
derived from the mother and the more effective 
reinforcement of feminine behavior for girls 
would be derived from the father. It may be 
important, therefore, to distinguish between 
the model for identification and the source of 
reinforcement for adopting behavior con- 
gruent with this model. Although the father 
may provide the model for masculine behavior 
for the young boy, the degree to which the 
boy identifies with this model may be a func- 
tion of the degree to which the mother re- 
inforces masculine behavior. 

The results give several indications that 
there are differences between boys and girls 
in the degree to which the characteristics of 
individual adults influence their performance. 
The adults who were effective with younger 
girls tended to be less effective with older 
girls, while the adults who were effective with 
younger boys tended to remain effective with 
older boys. In view of the fact that the per- 
formance of 3-4 year old boys tested by differ- 
ent Es did not differ to so great a: degree as 
did the performance of the young girls, it may 
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be hypothesized that boys and girls differ- 
entiated the characteristics of the particular 
Es equally well, but that the performance of 
the young girls was influenced to a greater 
degree by the characteristics of particular Es 
than was the performance of young boys. The 
fact that the average difference scores obtained 
by the various groups of girls tended to differ 
to a greater degree at all age levels than did 
those obtained by the groups of boys indicates 
that the performance of girls in general was 
affected to a greater degree by particular 
characteristics of the individual Es than was 
the performance of boys. An examination of 
such factors as E's age, appearance, experience 
with children, scholastic status, or scores on 
the MMPI gave no indication of what charac- 
teristics of the Es produced these differences 
in base rates and difference scores. Further, 
there are no data which clarify why men in 
general tended to produce more homogeneous 
grouping of performance in children than did 
women. 


SUMMARY 


Six male and six female Es tested seven boys 
and seven girls at each of three age levels 
(3-4, 6-7, and 9-10 years) in a simple game. 
Following an initial minute during which a 
base line for performance was established, E 
made two of five standard supportive com- 
ments about S’s performance each minute for 
the next 5 minutes. The increment in response 
occurring after the first minute of the game 
was used in the analysis of the results. The 
average difference score was greater for Ss 
tested by women than by men at the 3-4 
year level, and was significantly greater for 
boys than for girls tested by women at the 
6-7 year level. The increment was greater at 
the 6-7 year level for girls than for boys tested 
by men, but the difference was not significant. 
At the 9-10 year level the differences in 
performance associated with sex of Æ and sex 
of S were not statistically significant. An 
analysis of the increment of response during 
the sixth minute of the game indicated signifi- 
cant differences in the scores obtained by Ss 
tested by different Zs, and significant inter- 
actions between individual Es and S's CA, 
and between individual Es, S’s CA, and S’s 
sex, The results were interpreted in terms of 
the hypothesis that the effectiveness of social 
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ACQUIESCENT RESPONSE SET AS A MEASURE OF ACQUIESCENCE: 


ROBERT J. FOSTER? 
University of Texas 


analysis of the effect of response set on 

psychological tests, he used the label 
“acquiescent response set” to designate the 
tendency to give affirmative answers to test 
items independent of item content. Most 
investigators have subsequently freely used 
the term “acquiescence” to refer to this 
response set and have explicitly or implicitly 
assumed that the degree of acquiescent re- 
sponse set reflects a personality dimension 
such as acquiescence, conformity, or sub- 
missiveness, 

Gage, Leavitt, and Stone (1957) and Leavitt, 
Hax, and Roche (1955) suggest that acquies- 
cent response set (here designated ARS) on 
the California F Scale reflects the “authori- 
tarian submission” component of the authori- 
tarian personality. The former claim that 
their data “‘... provides some support for 
the conceptualization of acquiescence [ARS] 
as akin to ‘yielding’ or ‘conformity’ to group 
Pressure studied by Asch and Crutchfield.” 
Others (Bass, 1956; Jackson, Messick, & 
Solley, 1957; Zuckerman, Norton, & Sprague, 
1958) have indicated their general agreement 
with this line of thought. The central purpose 
of this study is to test this assumption that 
ARS reflects an individual's tendency to 
acquiesce by examining the relationship be- 
tween ARS and behavioral measures of con- 
formity and compliance. 

The extent to which ARS is a highly gen- 
eralized or multidimensional tendency, as 
assumed in most of the literature, has been 
questioned (Bass, 1956; Hanley, 1959). 
Jackson and Messick (1958), especially, have 
Stressed that response set may be differentially 
elicited by certain content categories of formal 
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characteristics of the test item. The possible 
differential effect of overgeneralized as opposed 
to probabilistic wording of items (e.g. all 
men are conceited vs. some men are conceited) 
and the possible interaction between ARS and 
the set to give the most socially desirable 
answers were suggested by these authors as 
dimensions needing further research. 


METHOD 
ARS Measures 


An aphorism questionnaire similar to the one by 
Bass (1956) and Carey, Rogow, and Farrell (1957) and 
a difficult true-false test adapted from the Information- 
True Test used by Gage et al. (1957) were chosen as 
two measures of ARS which were dissimilar in content 
and response alternatives. The aphorism questionnaire 
had the advantage of proven reliability, of being a 
realistic (not artificial) task for the subjects, and of 
containing items somewhat similar to the California 
F Scale with which most recent research (cf. Christie, 
Havel, & Seidenberg, 1958) on ARS has been 
concerned. 

The final form of the aphorism questionnaire con- 
sisted of 72 heterogeneous items about human behavior. 
Each item had been classified by judges as being socially 
desirable (to answer “agree’’), neutral, or socially 
undesirable, and as being overgeneralized, prob- 
abilistic, or not worded in either fashion (“neither”), 
For social desirability, 43 students drawn from the 
same population as the subjects used in the main study 
but from a different sample were asked to indicate 
whether it would be best to agree or disagree to each 
statement in order to give the best impression of one- 
self, or whether it would make any difference how the 
statement was answered, Then, three graduate students 
in psychology classified these judgments about each 
statement into desirable items (over 50% "agree" 
votes), undesirable items (over 50% “disagree” votes), 
and neutral items (over 33% “no difference" votes with 
“agree” and “disagree” votes being about equally 
balanced). To determine the extent of overgeneraliza- 
tion about an item, five to seven graduate students in 
psychology were asked to rate the form (rather than 
the content per se) of each statement on a nine-point 
scale on which lower values reflected probabilistic 
statements, higher values indicated overgeneralized 
statements, and central values were classified as 
“neither.” From these judgments, the experimenter 
classified each item as overgeneralized, probabilistic, 
or neither. In this manner, there were then eight items 
for each of the nine possible combinations among these 
classifications. Each of the six subscales consequently 
consisted of 24 items of the same type on one dimension 
but counterbalanced on the other dimension (e.g., the 
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overgeneralized subscale had an equal number of items 
for each degree of social desirability). 

The true-false test consisted of 78 items, 46 critical 
(extremely difficult) items—which Gage et al. (1957) 
found to elicit an even division between true and false 
responses—intermixed with 32 noncritical (easier) 
items which served to make the task seem more realis- 
tic. Half of the critical items were false, half were true. 
None of the items contained qualifying words such as 
“sometimes,” “always,” “every,” etc. A subject's 
ARS was indicated by the frequency with which true 
responses were given to the critical questions. 


Measures of Behavioral Acquiescence 


Two measures of compliance were used: a request 
to subjects that they agree to participate in an addi- 
tional short psychological experiment and a request 
that they complete a short essay and return it before 
a designated date. 

An Asch-like conformity situation using an 
apparatus similar to that described by Crutchfield 
(1955) served as the chief measure of behavioral acquies- 
cence. Subjects were told they were participating in a 
communication experiment. Five subjects of the same 
sex in separate cubicles were asked to indicate their 
judgment about various stimuli by closing an electrical 
switch which would indicate their answers. Subjects 
were told to answer as their turn came and were under 
the impression they knew the other subjects’ answers 
from the illumination of lights on a panel before them 
which were supposedly controlled by the other subjects. 

The experimenter actually controlled these lights 
and simulated the answers that were supposedly given 
by other subjects. Noncritical trials (correct answers 
simulated) were interspersed with critical trials (incor- 
rect answers simulated). All subjects believed them- 
selves to be the last subject to answer on the critical 
trials and consequently were faced with making a 
choice between an apparently correct answer and the 
answer (incorrect) supposedly given by all other 
subjects. A detailed description of this procedure is 
found elsewhere (Crutchfield, 1955; Foster, 1960). 

The stimuli consisted of 35 cards, 16 of them critical, 
asking for judgments on a wide variety of tasks includ- 
ing the relative area of geometric figures, the number 
of dots in a given area, the logical sequence of num- 
bers, attitudes about war, human behavior and stand- 
ards at their university, the relative size of angles, 
and preferences for geometric designs and colors. 
Although it would tend to lower reliability, a hetero- 
geneous mixture of stimuli varying in content, level of 
difficulty, degree of factual anchorage, etc. was desired 
to make the task more representative and to remove 
any systematic influence of confounding variables. 

Each of the critical cards had been pretested in a 
control situation in which judgments were made with- 
out the knowledge of simulated group norms to insure 
that correct responses could be easily detected. In the 
case of attitude or opinion items, the response that was 
most frequently given under private conditions was 
considered as the actually “correct” response. All items 
were chosen because they elicited extreme skewness in 
response. Most attitude cards consisted of statements 
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selected by Blake, Helson, and Mouton (1957) from 
the Thurstone-Chave list of attitudes toward war, or 
aphorisms selected by the experimenter from the 
aphorism questionnaire. To control for ARS, the 
“correct” answer was “agree” on half of the critical 
attitude cards and “disagree” for the other half. Con- 
forming responses were scored according to a weighting — 
system previously established by judges; nonconformity 
responses were weighted zero. f. 

Immediately following the conformity measure, 
subjects were asked to write brief answers to four 
questions designed to find out whether they had sus- 
pected that the apparatus was “fixed” and the instruc- 
tions deceptive. Those subjects who were considered 
by judges to have suspected that the electrical wiring 
was “fixed” or that confederates were used were elimi- 
nated from the analysis. 


Subjects 


One hundred and fifty freshmen, 75 males and 75 
females, were selected from introductory psychology 
courses on the basis of their being available at the time 
testing was scheduled. Subjects participated in order 
to meet course requirements. 


Procedure 


The sequence of procedures to which the subjects 
were subjected, is listed below: 
First Session: 

1. One hundred and fifty subjects were asked to 
volunteer for an additional experiment 
(Compliance 1). 

2. Subjects took the Aphorism Questionnaire. 

3. Subjects took the true-false test. 

4, Subjects were requested to write a short essay 
and return it to the experimenter by a given date 
(Compliance 2). 

Second Session (about 5 weeks later): 

1. Fifty-five females and 57 males from the above 
subjects (selected on the basis of their availability 
at scheduled times) were subjected to the influence 
of simulated group norms in an Asch-like con- 
formity situation. i 

2. Subjects answered questions about their experi- 
ence and reaction to the conformity situation. 
(Seven females and 18 males suspected that the 
conformity situation was rigged; their data were 
discarded.) 


RESULTS AND DISCUSSION 

Reliability of the Instruments 

Since the psychological equality of “agree 
strongly” and “agree moderately” responses 
has not been established, each response cate’ 
gory was scored separately as well as in the 
traditional manner of ignoring the intensity of 
the response. 

The corrected split-half reliability of the 
various instruments used in this study are 
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TABLE 1 

Rewiasitity OF ARS AND Conrormiry MEASURES* 
All 
ARS measures Mb | SD» |Mates| FS | Sub- 
jects 
Information-True test 12.94 | 6.23 | .65 | .71 | .69 

Aphorism Questionnaire 
Strong agree responses 13.33 | 7.54] .90 | .83| .87 
Moderate agree responses | 24.81 | 6.88] .77 | .73| .75 
ARS-total 38.50 | 7.38 | .67| .88 | .77 
Conformity 11.73 | 6.97 | .77 | .81| .79 


* Coefficients are split-half Pearson product-moment correla- 
tions corrected for test length by the Spearman-Brown formula. 
N = 75 for both males and females and 150 for all subjects, except 
for the conformity scale where N = 39 for males and 48 for females. 

> Data are for all subjects combined. 


a 
TABLE 2 
INTERCORRELATIONS AMONG ARS SCORES ON 
SUBSCALES OF THE APHORISM QUESTIONNAIRE: 
THE EXTENT OF OVERGENERALIZATION^ 


Subscales 
Neither Probabilistic 
Males 
Strong agree responses 
Overgeneralized .70 71 
“Neither” .73 
Moderately agree responses 
Overgeneralized 153 .50 
“Neither” .51 
Females 
Strong agree responses 
Overgeneralized .61 57 
“Neither” .55 
Moderately agree responses 
Overgeneralized .46 .46 
“Neither” +35 


^ Coefficients are Pearson product-moment correlations. N = 
75. A coefficient of .27 is statistically significant from zero at the 
*01 level (one-tailed test). 


given in Table 1. All instruments appear to 
have obtained a satisfactory level of internal 
Consistency. The label “ARS-total” is used 
to designate the score derived from treating all 
agree responses as the same set, the traditional 
approach of measuring ARS. 

It was not possible to obtain an index of 
the reliability of the compliance measures 
Since only nine subjects complied with the 
request to complete and return the essay 
(Compliance 2). However, eight of these nine 
Subjects also complied with the request to 
Volunteer for an extra experiment (Com- 
Pliance 1) suggesting some common variance 
between the two situations. The point biserial 
Correlations between Compliance 1 and the 
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TABLE 3 
INTERCORRELATIONS AMONG ARS SCORES ON 
SUBSCALES OF THE APHORISM QUESTIONNAIRE: 
THE EXTENT OF SOCIAL DrsmasiLITYA 


Subscales 
pity | Neutral eget ie 
Males 

Strong agree responses 
Socially desirable -75 | .65 (.88)° | .37 (.50*) 
Neutral 4.69 49 (.67*) 
Socially undesirable 4 

Moderately agree responses 
Socially desirable +58 | .56 (1.13) | .26 (.44*) 
Neutral 42 +35 (.70) 
Socially undesirable 59 

Females 

Strong agree responses 
Socially desirable 73 | .79 (1.16) | .46 (1,00) 
Neutral -63 -45 (1.05) 
Socially undesirable -29 

Moderately agree responses 
Socially desirable +59 | .41 (1.02) | .09 (.14*) 
Neutral i «35 (.82) 
Socially undesirable 68 


^ Pearson product-moment correlations, N = 75. In the mat- 
rices .27 is statistically significant from zero at the .01 level (one- 


tailed test). 
b Split-half reliabilities corrected by the Spearman-Brown 


formula. 

° Coefficients in parentheses represent correlations corrected 
for attentuation. An asterisk indicates that this estimate is signifi- 
cantly different from 1.00 at the .005 level (one-tailed test). 


conformity measure is .16 (males) and —.11 
(females), a nonsignificant finding at variance 
with other reported results (cf. Blake & 
Mouton, 1959). This finding is not surprising, 
however, since specific factors such as one’s 
degree of interest in psychology or the sex of 
the examiner—which may have been non- 
relevant in the conformity situation—could 
have differentially motivated subjects to 
volunteer to participate in the extra experi- 
ment. 


Differential Effect 
on ARS 

The interrelationships among the ARS scores 
derived from each subscale of the Aphorism 
Questionnaire are given in Tables 2 and 3. 
Examination, in Table 2, of the correlations 
among scales composed of statements worded 
in an overgeneralized, probabilistic, or neutral 
(“neither”) manner, reveals that these item 
characteristics did not differentially affect an 
individual’s tendency to give acquiescent re- 
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sponses since all coefficients within each 
matrix are of approximately equal size. 

These findings indicate that research in- 
volving ARS probably need not be concerned 
with these item characteristics when a college 
population is used. However, they appear to 
be contradictory to the recent findings of 
Couch and Keniston (1960), who found that 
the "item tone" tended to differentially elicit 
agree or disagree responses. They describe 
the tone dimension in terms of being extreme 
and enthusiastic as opposed to guarded, quali- 
fied, and cautious, a dimension apparently 
similar (but not identical) to the overgeneral- 
ized probabilistic dimension in this study. 

Table 3 suggests a different conclusion with 
respect to social desirability as an item charac- 
teristic. Coefficients within each matrix in 
Table 3 vary considerably in size, suggesting 
a possible confounding of ARS with the item 
variable. Since, however, the variation in 
correlation size could be a function of differ- 
ences in the reliabilities for each of the sub- 
scales, split-half reliability correlations for 
each subscale were computed and examined in 
the following manner. 

If the degree of social desirability of the 
items differentially affected the subjects’ 
tendencies to give acquiescent Tesponses, the 
correlation between any two subscales should 
be significantly lower than that which would 
be predicted if the two subscales were meas- 
uring identical reliable variance. The pro- 
cedure used to test whether the subscales, 
when scored for ARS, measure in part, differ- 
ent things was as follows: The correlations 
between the subscales were corrected for 
attenuation. The resultant coefficients are 
shown in parentheses in Table 3. The upper 
confidence level (.005) for the corrected corre- 
lation was then established by computing 
the upper limits for the original correlation 
and then correcting it for attenuation (a 
conservative test since the larger standard 
error was used). If the established upper limits 
of confidence falls short of 1.00 (the theoreti- 
cally expected correlation if both scales 
measure identical variance), it would follow 
that the item characteristic (social desirability) 
differentially affected the subject’s tendency 
to give acquiescent responses (an inference 
statistically significant at the .005 level, one- 
tailed test). 
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Examination of Table 3 reveals that the 
upper confidence levels of three of the four 
corrected correlations between the socially 
desirable and the socially undesirable sub- 
scales were below unity. In addition, one 
corrected correlation between neutral and 
socially undesirable scales (strong agree males) 
was significantly lower than unity. The failure, 
among females, of the upper confidence level 
of the corrected correlation between the so- 
cially desirable and socially undesirable sub- 
scales to be below 1.00 was a function of the 
very low reliability of the latter subscale 
among female subjects. 

While the differential effect of the social 
desirability of item content is demonstrated, 
it must also be remembered that any uncor 
rected correlation of .27 or larger in Table 3. 
indicated a statistically significant tendency 
(01 level, one-tailed test) for ARS to be 
general across the subscales in spite of the 
small number of items in each subscale, 
Nevertheless, any purported measure of ARS 
should always be qualified as being limited 
to a particular type of stimulus situation until 
the generality of ARS is more firmly estab- 
lished. More research is needed which is 
directly concerned with the measurement 
aspects of ARS before studies involving the 
correlates of ARS can be meaningfully inter- 
preted. 


Relationship of ARS to Behavioral Ac 
quiescence | 
The relationship between the two measures 
of ARS and the two behavioral measures of f 
acquiescence is given in Table 4. Only one of 
the 64 correlations in the table (not all of 
which are independent) is statistically signifi- 
cant. The one significant correlation, that 
between conformity and the tendency for 
female subjects to give strongly agree responses 
to the overgeneralized aphorisms, is not 
significant for male subjects. In view of this. 
fact and the large number of coefficients 
computed, this coefficient probably represents 
a Type I error. Selected scatterplots fail 
to suggest any curvilinear relationships. : 
If one accepts the conformity measure âs 4 
an appropriate operational definition of ac 
quiescence, it would appear from the present 
data that the assumption in the literature that 
ARS is a measure of acquiescent tendencies 
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TABLE 4 
CORRELATIONS BETWEEN MEASURES OF ARS AND MEASURES OF BEHAVIORAL ACQUIESCENCE* 
Compliance 1 Conformity 
Male Female 
W = 75) W = 75) ates) Os) 
Information-True test — .09 =-22 zu 28 
Aphorism Questionnaire 
Strongly agree score > —.02 115 —.17 
Socially desirable — .04 LO) —.18 
Neutral —.09 -— 19 —.10 
Socially undesirable 13 .09 .08 
Overgeneralized —.01 zi —.24 
Neither —.05 —.13 14 
Probabilistic .02 -A7 .08 
Moderately agree score .01 .03 .01 
Socially desirable —.15 .07 .03 
Neutral —.08 -03 — .06 
Socially undesirable — .08 +15 —.22 
Overgeneralized — .03 .01 —.13 
Neither —.17 18 —.05 
Probabilistic —.14 .10 —.07 
ARS-total score —.01 —.04 —.19 


," Coefficients under Conformity are Pearson product-moment correlations; those under Compliance 1 are point biserial correla- 
tions. A product-moment correlation of 22 is significant at the .05 level when N = 75; 28 is required when N = 48; and 32, 


when N = 39. 


is invalid, and that the label “acquiescence” 


or “acquiescent component” in reference to 
ARS is inappropriate and misleading. Of 
course, it could be that “acquiescence” is an 
inappropriate label to apply to the conformity 
operation, but this seems to be the least likely 
alternative. Irrespective of labels, from this 
study, it seems fairly safe to conclude that 
individual differences which are observed in 
these two operations do not reflect the same 
personality dimension or behavioral tendency. 


SUMMARY AND CONCLUSIONS 


_ The consistent tendency of individuals to 
give affirmative responses to statements or to 
questions independent of the test item content 
is termed acquiescent response set (ARS). 
It has been frequently treated in the research 
literature as a measure of a generalized tend- 
ency to acquiesce even though studies have 
not established that ARS and behavioral 
acquiescence reflect similar response mecha- 
nisms. This study investigates the validity 
of ARS as a measure of behavioral 
acquiescence. 

A conformity situation of the sort devised 
by Asch, in which subjects were placed under 
social pressure to conform to simulated in- 
Correct answers of others, and a compliance 
Situation involving one’s willingness to agree 


to a request, were used as behavioral criteria 
for acquiescence. 

The present study also investigated the 
differential effects of two item characteristics, 
social desirability and probabilistic vs. over- 
generalized wording, on the individual's 
relative tendency to give acquiescent re- 
sponses. 

Two measures of ARS used in the investiga- 
tion were a true-false test containing extremely 
difficult items and an aphorism questionnaire 
that was counterbalanced for social desirability 
and for the extent to which the wording was 
probabilistic or overgeneralized (e.g., “Some 
men are self-centered” vs. “All men are self- 
centered”). On the aphorism questionnaire, 
subjects marked specified degrees of agreement 
or disagreement. Subjects were 150 college 
freshmen. 

Conclusions from the data are as follows: 

1. On the aphorism questionnaire the over- 
generalized-probabilistic dimension did not 
differentially affect the relative tendency of 
subjects to exhibit ARS. 

2. The extent to which answers to items were 
socially desirable tended to have a differential 
affect on an individual’s relative ARS score, 
i.e, subjects tended to be ordered differently 
on socially desirable items than on socially 
undesirable items. 
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3. With one exception, none of the measures 
of ARS correlated with the behavioral meas- 
ures of conforming or complying, regardless 
of the response alternative or the item charac- 
teristic used in obtaining the ARS scores. This 
one exception, when the large number of 
coefficients computed is considered, probably 
represents a Type I error. 

4, More research concerned directly with 
the measurement and the generality of ARS 
in different stimulus situation is needed before 
studies involving personality correlates of ARS 
can be meaningfully interpreted. 

5. The results contradict the assumption 
made in the literature that ARS is a measure 
of a generalized tendency to acquiesce and 
suggest that the use of the label of “ac- 
quiescence” or “acquiescent component" in 
reference to ARS is misleading and inappro- 
priate. 
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REACTION TIME IN SCHIZOPHRENIC AND NORMAL SUBJECTS 
IN RELATION TO THE SEQUENCE OF SERIES OF REGULAR 
PREPARATORY INTERVALS 


THEODORE P. ZAHN, DAVID ROSENTHAL, anp DAVID SHAKOW! 
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NE of the most important variables 
that differentially affects the reaction 
time (RT) of schizophrenic as com- 


spared to normal subjects is the length of the 


preparatory interval (PI). A classic finding has 
been that if PIs are presented in a regular 
fashion (that is, the PI is constant for a given 
series of trials), RT is an increasing function 
of the length of PI. If, however, the PIs of 
successive trials are mixed in an irregular 
manner, RT is a decreasing function of the 
length of the PI. These relationships have 
been found to be more pronounced for schizo- 
phrenics than for normals with actual crossover 
of the curves for the two conditions in the 
former group. This experiment, which has been 
replicated by different groups of investigators, 
was first performed by Huston, Shakow, and 
Riggs (1937), who interpreted the results as 
indicating that schizophrenic, in contrast to 
normal subjects, had difficulty in maintaining 
the high level of set (preparation to respond) 
required for optimal RT performance on the 
long PIs in thé regular procedure. Inability to 
maintain a set has been considered by Shakow 
(1950) to be one of the primary features of the 
schizophrenic disorder. 

It is«clear, then, that the effects of the PI 
depend not only upon its absolute length but 
also upon the context in which it occurs. A 
previous paper (Rosenthal, Lawlor, Zahn, & 
Shakow, 1960) advanced the suggestion that 
each of the two contexts mentioned above 
(regular and irregular) was governed by its 
own set of variables and that they could be 
fruitfully studied separately. The main con- 
cern of the present paper is with the effects of 
varying the context of the PIs in a regular 
procedure. More specifically, it is with the 
effects of varying the order of presentation 
of successive series of RT trials with PIs of 
varying lengths. 

1 The authors wish to acknowledge the assistance of 


Daniel Primac and Lois Raphling in the collection and 
analysis of the data. 


ExPERIMENT I 


The first study was originally undertaken 
to explore further previous findings of a sharp 
rise in RT between PIs of 4 and 7.5 seconds 
(Rodnick & Shakow, 1940; Rosenthal et al., 
1960; Tizard & Venables, 1956). These studies 
suggest a “critical value" of the PI beyond 
which there is a sharp decline in the ability of 
schizophrenic patients to maintain a set. It 
was our purpose to explore the inter- and 
intra-individual consistency of this critical 
value by comparing schizophrenic and normal 
subjects on RT at each PI from 1 to 10 seconds 
in 1 second increments, using the regular 
procedure. 

The PIs were presented in order of increasing 
length (ascending) and of decreasing length 
(descending). Order of presentation was, at 
the outset, thought to be of relatively minor 
importance because of the similar shapes of 
the curves obtained by Tizard and Venables 
(1956), who used different orders of PIs for 
different subjects, Rodnick and Shakow (1940), 
who used a deliberately mixed order, and 
Rosenthal et al. (1960), who used an ascending 
order only. 


Method 


Subjects. Seven chronic schizophrenic and 11 normal 
subjects were used, all white males who resided in the 
Clinical Center of the NIH. The patients had been rig- 
orously screened for organic involvement, physical ill- 
ness, mental deficiency, and for diagnosis. None of the 
patients had been receiving drug therapy for at least 
several months. They had a mean age of 34.0 years 
(range 29-39), a mean education of 11.1 years (range 
7-15), and an average length of hospitalization of 10.1 
years (range 5-17). 

The normal subjects were conscientious objectors 
who had volunteered to serve as control subjects at the 
Clinical Center in lieu of military training. Their mean 
age was 20.9 years (range 19-24) and mean education 
13.0 years (range 11-16). i 

Apparatus. The subjects were required to operate a 
telegraph key, the “body” of which was housed in an 
aluminum casing together with a small speaker which 
was the immediate source of the RT stimulus. The 
speaker was driven by an oscillator and an audio ampli- 
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fier which produced a 300 cycle tone with an intensity 
at the subject's ear of about 70 db. re 0.0002 dynes/cm?. 
The warning stimulus was a 15-watt white light enclosed 
in a box with a translucent front panel and placed di- 
rectly over and behind the key-speaker housing. Timing 
of the stimulus sequence was accomplished by four 
“silent” Hunter interval timers (Model 111C). RT was 
recorded to the nearest 5 milliseconds by a Standard 
Electric Timer. 

Procedure. 'Two methods of testing RT were used. In 
both the subjects task was to lift his finger from the 
telegraph key at the onset of the tone. A 0.2 second light 
flash was the warning stimulus, and the length of time 
between the light and tone was, of course, the PI. 

In Method NR (no ready), the subject was instructed 
to depress the key shortly after his preceding response. 
If the key was depressed within 5 seconds of the pre- 
ceding response, the light-tone sequence automatically 
began 8 seconds after the previous response, regardless 
of when, during the 5 second period, the key was de- 
pressed. If the subject failed to depress the key within 
the prescribed time or failed to keep it depressed 
throughout the cycle, the cycle was broken, and there 
was a 3-second delay before the warning light. 

In Method VR (verbal ready), the subject was in- 
structed that after the experimenter had recorded the 
previous RT and reset the timer, the experimenter 
would say “Ready,” and then when the subject was 
ready he was to depress the key. In this method, when 
the subject depressed the key, there was a 0.3-second 
ats the 0.2 second light flash which initiated 

e PI. 

Each subject was tested in eight sessions, four with 
NR and four with VR. Each session consisted of five 
series of 14 trials each, a different PI being used in each 
series. "There was a rest period of 1-2 minutes between 
series. For purposes of analysis only, the last 10 trials in 

each series were used, the first 4 trials being considered 
as practice for the particular interval. 

Ten different PIs were used, ranging from 1-10 sec- 
onds in 1 second increments. In half the sessions even 
PIs and in half odd PIs were used. In half the sessions, 
PI series were presented in an ascending order, the PI 
being increased from series to series (2, 4, 6, 8, 10); in 
the other half, a descending order (10, 8, 6, 4, 2) was 
employed. 

‘The sequence of conditions was constant for all sub- 
jects. Four sessions on Method NR were given first: 
ascending even, descending odd, descending even, and 
ascending odd, in that order. The sequence for Method 
VR, which followed, was ascending even, descending 
even, ascending odd, and descending odd. 


Results 


The results are summarized in Figure 1, 
which shows the mean RT at each PI for each 
method, order of presentation, and subject 
group. The usual difference between schizo- 
phrenics and normals in mean level of RT 
is apparent, but equally noteworthy are the 
differences between the slopes of the ascending 
and descending orders for the schizophrenic 
subjects. For the ascending orders, RT is an 
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increasing function of PI duration; but for 
the descending orders, this is not the case. The 
normals, on the other hand, have faster RTs 
for the short PIs than for the long ones for 
both orders with both methods. 

With respect to the original question that 
prompted this research, the group curves show 
no unambiguous "critical PI" beyond which 
schizophrenic subjects show a marked loss of 
ability to maintain a set. Furthermore, an 
intensive examination of the data of individual 
subjects showed that although there were 
sharp rises between two adjacent PIs in the 
ascending orders for most of them, the points 
at which these sharp rises occurred varied 
both inter- and intra-individually from session 
to session. 

Method VR produced faster RT in both 
groups (p < .10), but the difference between 
methods was not significantly greater for the 
patients than for the control subjects. 

The effects of the order of PI presentation 
were evaluated for each method separately by 
computing the linear slopes of the curves 
relating PI to RT for each subject.? The differ- 
ences between the two orders were analyzed 
by ¢ tests and summarized in Table 1. The 
slopes are significantly different for the patient 
group only for Method VR, and the differences 
between schizophrenics and normals are also 
significant only for this method. These differ- 
ences for the two methods combined approach 
significance (p < .10). 

In addition to the mean RT, the median 
RT and the mean of the three fastest RTs on 
each series of 10 trials were computed for 
each subject on all conditions. These two 
measures produced curves similar in shape to 
those produced by the means. Analyses of the 
slopes of the curves show that for each of these 
two measures, the difference between ascending 
and descending conditions is significant for 
both Method VR and for the two methods 
combined (p < .05 in each case). The agree- 
ment among these three indices shows that 
the differences in slopes for the two orders are 


* The 1- and 2-second PIs were not included in the 
slope calculations since some schizophrenic subjects had 
disproportionately long RTs to these intervals. Consid- 
erable experience with these shorter intervals indicates 
that this is probably the result of processes different 
from those occurring in the case of the longer Pls. Their 
inclusion would therefore seem to confuse the main 
point of the analysis. 
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p Tot. due merely to a few isolated extremely small NV in the first study, a second experiment 
4 deviant RTs. was carried out to attempt to replicate the 
results with respect to the order variable. 
Experiment II 


, The effect of the two orders of presentation Method 
in Experiment I was not predicted at the Subjects. Twelve schizophrenic and nine normal sub- 
Outset. Because of this, and because of the jects were used. None was taking tranquilizing drugs at 
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TABLE 1 
VALUES OF i FOR SIGNIFICANCE TESTS OF THE DIFFERENCES BETWEEN SLOPES FOR THE 
ASCENDING AND DESCENDING ORDERS 
Method NR Method VR Total 
Even Odd Combined Even Odd Combined Combined 

Schizophrenic «1 1.84 1.15 Whe 2.86* 3.19** 2.22 
Normal <1 «1 «1 —1.98 «1 —1.40 «1 
Schizophrenic vs. Normal* «1 1.74 «1 3510** | 52687" 3.36** 2.17 


* For these comparisons, due to marked inhomogeneity of variance, the formulas for ¢ and for estimating the df were those suggested 


by Dixon and Massey (1951, p. 105). 
*p < .05. 
**p < 0. 


the time of the study. However, most patients had been 
taking them until 1-2 weeks prior to the study and had 
been transferred to a placebo at that time. They were 
all white males, carefully screened for diagnosis, absence 
of organicity, and good general health. The 12 patients 
used here were selected from a larger number of subjects 
on the basis of the experimenter's ratings of their co- 
operation (Shakow & Huston, 1936). They were so 
rated on the first session of the experiment, and only 
those patients who were judged to be trying to give a 
good performance (rated À or B) were tested on all three 
occasions. In all cases, the cooperation level of these 12 
subjects remained acceptable in the subsequent sessions. 
"Their mean age was 47.0 years (range 37-58) and mean 
length of hospital stay was 17.1 years (range 3-30). The 
normal subjects were from the same population as in 
Experiment I. Three of the nine subjects had been sub- 
jects in Experiment I several months previously. 

Apparatus. Although a different apparatus was used 
for this experiment, it was similar in all essential re- 
spects to that used in Experiment I except that no warn- 
ing light was used. The output of an audio-generator 
was amplified to produce a tone of approximately the 
same characteristics as that used in Experiment I. RT 
was measured in milliseconds by a Hunter “Klock- 
counter," 

Procedure. Method VR was used, modified from that 
of Experiment I by the omission of the warning light. 
This was done because the light flash appeared to dis- 
tract some of the more disturbed patients. Hence, in the 
present experiment, the PI was initiated solely by the 
subject's depressing the key. 

Each subject was tested on three occasions from 1 to 
3 days apart within the span of one week. On each occa- 
sion, he was given 10 successive trials on each of five 
different PIs: 1, 2,4, 7, and 15 seconds. One-minute rest 
periods were given between each series of 10 trials. In 
the first session, the PIs were presented in descending 
order (Descending 1), a series of trials on the 15-second 
PI being given first, followed by a series on the 7-second 
PI, etc. The second day, an ascending series (Ascending) 
was given, and in the third session the descending order 
(Descending 2) was used again. At least 10 practice 
trials were given before the first session, and at least 6 
more were given before starting the experimental trials 
on subsequent days. 

The first trial of each series was considered a practice 

trial and was omitted from the calculations. 


Results 


Curves showing the relationship between 
the length of the PI and RT for each of the 
three conditions are shown in Figure 2. For 
the normals, as in Experiment I, the three 
curves are essentially parallel and have small 
slopes. For the patients, on the other hand, 
quite different results obtain for each of the 
three conditions. 

The data were analyzed as before, slopes 
being computed for each individual. The 
logarithm of the PI was used in computing 
the slopes because the function relating RT 
with log PI for the ascending condition was 
much more nearly linear than the RT-PI 
function. Analyses by / tests show the differ- 
ence between the slopes for the Ascending and 
Descending 2 conditions for the patients to be 
significant (p < .01) and for this difference 
to be greater for the patients than for the 
controls (p < .01). Thus these results are 
confirmatory of those of Experiment I. 

The Descending 1 condition, on the other 
hand, produced results at variance with the 
other descending conditions in the two experi- 
ments, the minimum RT being at the 4-second 
PI. Comparisons of the slope of this curve with 
the slopes for the other two conditions produce 
no significant differences mainly because of the 
large variance for this first condition, which is 
3.6 times that of the Ascending condition and 
5.6 times that of the Descending 2 condition. 
In any case, the overall slope of this curve 
(from the 2-second PI to the 15-second PI) 
is not too meaningful because of its nonmono- 
tonic nature. 

Many of the schizophrenic subjects seemed 
to have trouble with the long PIs on De- 
scending 1, presumably because they occurred 
before the sequence of movements involved 
in the RT task had been well learned or because 
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they had not yet adjusted to the new testing 
Situation. Apparently, the 10 or more practice 
trials were not sufficient, even though at least 
4 of them were with a moderately long (8- 
Seconds) PI. The decrease in RT from the 
15-second PI to the 4-second PI for this first 
Condition may be the result of practice or 
gradual adjustment to the test situation. 


In this experiment, as in Experiment I, 
the schizophrenics have slower RT than the 
normals under all conditions. The question may 
be raised as to whether the present results may 
not be attributable, at least in part, to an 
intrinsic slowness in RT or to a fundamental 
psychomotor deficit rather than primarily to 
psychological factors (e.g., maintenance of set) 
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associated with schizophrenia. In other words, 
are the differences between the curves for the 
two groups a function of RT level per se as 
well as the more obvious difference in 
psychiatric condition? As an approach to 
answering this question, a subgroup of the 
patients in Experiment II was compared with 


a subgroup of the normals, matched on RT 
performance under optimal conditions—the 
2-second PI in the Ascending series. The six 
fastest patients had a mean RT of 177.8 
milliseconds for this PI, and the seven slowest 
normals had a mean of 179.4 milliseconds. 
Comparing these groups as to the slopes of the 
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» Ascending and Descending 2 curves yields 


results comparable to those for the total 
groups, as shown in Figure 3. As before, the 
difference between the slopes was significantly 
greater for the patients than for the normals 
(p < .001). This analysis, then, demonstrates 
that even schizophrenics who, under optimal 
conditions, are capable of RT in the normal 
range show the same PI and order effects as 
do a more unselected group.’ 


Discussion 


The results of these two experiments agree 
with respect to both the ascending and 
descending PI series. For the ascending series, 
they show that the RT of schizophrenic pa- 
tients is disproportionately impaired when 
the PI is relatively long. Experiment II shows 
that even patients who are capable of RT in 
the normal range under optimal conditions 
exhibit a gross deficit in performance under 
nonoptimal (long PI) conditions. The results 
for the ascending series are similar to those 
of several earlier studies which have led to the 
inference that schizophrenic subjects suffer a 
disproportionate loss of set during PIs of more 
than a few seconds. 

For the descending series, however, an 
additional phenomenon is demonstrated. Not 
only is the performance of the patients poor 
under nonoptimal conditions, but it remains 
poor when, subsequently, the conditions are 
Shifted to optimal ones. The patients seem 
to be unable to recover from the effects of series 
of long PIs when the PIs are shortened. In 
contrast, the performance of the normal sub- 
jects is virtually identical under the two condi- 
tions. They show no tendency toward impaired 
performance under optimal conditions as a 
consequence of preceding poor conditions. 

A related phenomenon has been demon- 
strated in a recently completed study from this 
laboratory. Here, with an irregular procedure, 
it was found that when a single long PI trial 
was followed by a single short PI trial, RT on 
the latter trial was  disproportionately 
lengthened in schizophrenic as compared with 
normal subjects. Although this procedure is a 
more complicated one because differential 


? That the age difference between patients and con- 
trols does not contribute appreciably to the results is 
indicated by low, nonsignificant rank order correlations 
between age and RT (.23) and age and slope difference 
Score (.13) for the schizophrenic group. 
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expectancies are operating, it may be con- 
sidered in the same terms as the present study. 
Thus either a single or a series of long PIs 
produce effects in patients that persist and 
impair performance under subsequent, more 
optimal conditions. Similarly, in a study by 
Rodnick, cited by Shakow (1950) a distraction 
stress impaired the performance of schizo- 
phrenic subjects on both stress trials and 
subsequent nonstress trials, whereas normal 
subjects were impaired only on stress trials. 
These studies suggest that the often noted 
inability of schizophrenic patients to adapt 
quickly to change may be, at least in part, a 
function of their inability to recover from an 
inadequate set or pattern of responding which 
has been induced by poor preceding external 
conditions. 

One possible explanation of the results of 
the descending series is that RT may have 
remained poor at the shorter PIs because of 
reactive inhibition, fatigue, or lowered coopera- 
tion, occurring primarily in the latter portions 
of a session. Such factors could conceivably 
account for at least part of the increase in RT 
on the longer PIs in the ascending series. If 
this hypothesis were true, one should expect a 
gradual increase in RT over successive trials, 
even if all series included only short PIs. To 
test this possibility, six of the seven patients 
in Experiment I were given 14 trials on each 
of the following PIs: 1, 3, 2, 4, 3, and 3 in that 
order, using Method VR. The mean slope for 
the latter five PIs was essentially zero. Also, a 
group of schizophrenic subjects showed no 
decrement on the second of two 25-trial blocks 
with an irregular procedure. In these two 
studies, then, despite the fact that more RT 
trials and at least an equal amount of 
«pI time" and total time preceded the later 
trials, there was no tendency for merely 
extended performance on the RT task to have 
a decremental effect on the later trials. This 
suggests that the lack of improvement on the 
shorter PIs in the descending series of the 
present experiments cannot be accounted for 
merely by lateness in the session. It appears, 
rather, to relate specifically to the fact that 
long PIs preceded them. : 

What, then, is the important psychological 
process involved in the response of schizo- 
phrenic subjects to a series of long PIs? One 
possibility, suggested by Shakow (1950), is 


168 


that repeated and unsuccessful attempts to 
maintain a set for relatively long periods lead 
to a reduction of task oriented behavior or 
"withdrawal" on the part of schizophrenic 
patients. This process presumably involves 
both decreased alertness to external stimula- 
tion (inability to establish a “major set”), 
especially to task relevant stimuli (“ready” 
signal, stimulus), and increased attention to 
internal and irrelevant external stimuli (intru- 
sion of “minor sets”). A reaction like with- 
drawal under unfavorable conditions could 
presumably persist during subsequent, more 
favorable external conditions, perhaps in 
association with perseveration of the minor 
sets, 

Another factor which may significantly 
affect the performance of subjects in the pre- 
sent type of situation is the general level of 
“arousal” as reflected in psychophysiological 
variables. It may be, for example, that the 
patients’ arousal level is decreased dispropor- 
tionately by a series of long PIs, or it may 
recover slowly from such a series, leading to 
prolonged slow performance. However, some 
studies have found higher physiological base- 
line levels or greater increases in level during 
experimental tasks for schizophrenic than for 
normal subjects (e.g., Malmo, Shagass, & 
Smith, 1951). High values for such indicators 
of arousal have been found to be accompanied 
by reduced responsivity to external stimuli, 
increased nonspecific responsivity, and per- 
formance deficits (Silverman, Cohen, & 
Shmavonian, 1959). Thus the effect of series 
of long PIs on the RT of schizophrenic subjects 
may be reflecting the effect of a stressful condi- 
tion on an organism already in a state of 
“hyperarousal,” physiologically speaking. It 
may seem paradoxical that the “withdrawal” 
hypothesis and the “‘hyperarousal” hypothesis 
are not necessarily incompatible, but schizo- 
phrenia may very well be a reflection of the 
coexistence of such ordinarily disjunctive 
states. 


SUMMARY 


The effect of the preparatory interval (PI) 
on simple reaction time (RT) in schizophrenic 
and normal subjects was investigated in two 
experiments using a regular procedure. When 
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successive series of PIs were presented in order | 
of increasing length (ascending), RT was an 
increasing function of PI length for both 
groups, but the slope of the curve was signifi- 
cantly greater for the schizophrenic subjects. 
When the reverse (descending) order of 
presentation was used, the RTs of the patients | 
tended to be the same for all PIs and relatively 
slow, whereas the curve for the normal subjects 
was virtually identical to that for the ascending 
order. : 

The data indicate, in general, a greater | 
susceptibility to the effects of context of PIs 
on the part of schizophrenic subjects. More 
specifically, these subjects show an inability 
to recover from an inadequate set or response 
pattern which has been produced by unfavor- 
able external conditions. The data cannot be 
accounted for by the concepts of fatigue or 
reactive inhibition. However, series of long 
PIs could conceivably lead to a “withdrawal” 
reaction, characterized by the perseveration 
of minor sets, or to a general change in the 
level of arousal in schizophrenic subjects. 
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THEMATIC SEXUAL RESPONSES AS RELATED TO 
SEXUAL DRIVE AND GUILT" 


ALAN H. LEIMAN anp SEYMOUR EPSTEIN 
University of Massachusetts 


a direct relationship was reported between 

sexual responses to a thematic appercep- 
tion test and sexual drive as measured by rate 
of orgasm. No relationship was found when 
deprivation (time since last orgasm relative to 
rate) was used as an alternate measure of drive. 
It was assumed that each person has a rela- 
tively stable rate of sexual outlet which can 
be used as a baseline to determine whether he 
is in a deprived or satiated state. The fact 
that thematic responses were related to rate 
but not deprivation presents a problem. A 
possible explanation is that thematic sexual 
responses and rate are both influenced by guilt 
to about the same degree, whereas deprivation 
is relatively independent of guilt. This hy- 
pothesis is supported by the consideration 
that deprivation is determined largely by the 
accident of when a subject happens to par- 
ticipate in an experiment. If deprivation 
is relatively independent of guilt whereas 
thematic sexual responses are not, it follows 
that the addition of a measure of guilt is 
necessary if a relationship between the two is 
to be demonstrated. One purpose of the present 
study was to repeat the previous study with 
the addition of a measure of guilt. 

A second purpose was to test an extension 
of Mille's model of displacement (Miller, 
1948, 1951) to projective techniques (Epstein 
& Smith, 1956). According to this model (see 
Figure 1) highly drive relevant cues favor 
inhibition while weaker cues favor expression. 
It follows that drive should most effectively 
be measured by weak stimulus cues and guilt 
by strong ones. To determine whether this is 
So, a test of thematic apperception with a built- 
in dimension of sex-relevance was constructed. 


T a study by Epstein and Smith (1957), 


1 This study is part of a project on the measurement 
of drive and conflict of which Epstein is principal inves- 
tigator and which is being supported by Grant M-1293 
from the National Institute of Mental Health, United 
States Public Health Service. 

This paper was presented, in part, at the 1960 East- 
ern Psychological Association Convention in New York. 


METHOD 


All subjects were unmarried male students enrolled 
in a general course in psychology at the University of 
Massachusetts. They were tested as a group with a spe- 
cially constructed thematic test, and then given a ques- 
tionnaire designed to obtain information on drive and 
guilt over sex. 


Thematic Apperception Materials 


The thematic test consisted of six specially designed 
and selected pictures arranged along a dimension of 
sexual relevance. The pictures were presented in order 
of increasing relevance in order to reduce the influence 
of one picture upon the next. Buffer pictures were inter- 
spersed among the critical pictures to disguise the di- 
mension. The pictures in order of presentation were: 

1. A man sitting on a park bench at night (low 
relevance). 

2. Two figures of undetermined sex walking along 
a dirt road. They are approaching a gate (low rele- 
vance). 

3. A young boy sitting before a violin (TAT Card 
1; buffer) (Murray, 1943). 

4. A young man sitting at a desk and writing a let- 
ter (low relevance). 

5. The dimly illumined figure of a man. whose 
shadow can also be seen. He is walking up a stairway 
(Symonds Card A6; low relevance) (Symonds, 1949). 

6. A night scene of a young man helping a young 
woman into a car. They are dressed as if going to a 
party or dance (high relevance). 

7. A group of men lying on the ground (TAT Card 
9BM; buffer). 

8, A man lying on a bed, embracing and being 
kissed by a woman who is leaning over him (high rel- 
evance). 

The subjects were told they were participating in a 
test of creative imagination. Each picture was exposed 
for 20 seconds, followed by a 4-minute period for writ- 


Two thematic scores were obtained, Need Sex (n 
Sex) and Sexual Guilt. 

Need Sex. Murray (1943) defines n Sex as follows: 
“To seek and enjoy the company of the opposite sex. To 
have sexual relations. To fall in love, to get married” 
(p. 10). In the present study a five-point scale was used, 
First, basic weights were assigned as follows: 

1. The slightest reference to marriage or romance 

(e.g., “They are on a date"). 

3. Reference to physical sexual contact other than 
sexual intercourse (e.g, “They are petting in the 

» 

5. Actual or implied sexual intercourse (e.g. 

“They are going to have relations”). 

These basic weights were then modified by taking 
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into consideration intensity, importance to plot, fre- ` 


quency, and duration of need. Frequently, no change 
from the basic weight was made. When it was, it was 
usually within one unit. In the present study, the first 
author scored the stories without knowledge of the 
drive- or guilt-group into which each subject was placed. 
In order to obtain an interscorer reliability coefficient, 
the second author scored 20 randomly selected records, 
without previous mutual training with the first author. 
A coefficient of .96 was obtained, which compares favor- 
ably with similar scores which yielded interscorer relia- 
bilities of .89 (Epstein & Smith, 1957) and .95 (Epstein 
& Smith, 1956). 

Sexual Guill. It had been intended to investigate 
three levels of guilt over sex, the most overt level con- 
sisting of direct admission of guilt or regret by the hero, 
the second level of disapproval or punishment by some- 
one else, and the third of accidental misfortune follow- 
ing, but not a direct consequence of, sexual behavior 
(e.g., “After he leaves the girl, he gets hit by a car on 
the way home"). Responses to each picture were given 
a weighted guilt score in the same manner as with the 
score for n Sex. However, as an insufficient number of 
guilt responses was obtained in each level, they were 
combined into one overall guilt score by multiplying the 
weight for direct guilt by 5, the weight for the middle 
score by 3, and the weight for the least direct guilt by 1. 
The score was discarded when it was found that the 
interscorer reliability coefficient for the two authors 
scoring 20 randomly selected records was .14. No at- 
tempt was made to improve the reliability through fur- 
ther definition as it was apparent that the difficulty lay 
in the dearth and vagueness of the guilt responses them- 
selves. There were very few clear-cut cases of a hero’s 
directly indicating that he regretted his sexual behavior. 
Our impression was that sexual guilt was manifested by 
avoidance of sexual themes rather than by guilt themes. 
"The pictures, furthermore, were designed to measure 
sexual, not guilt, responses. 


Questionnaire on Behavioral Sexual Drive 
and Guilt over Sex 


The instructions on the questionnaire were essen- 
tially the same as in a previous study (Epstein & Smith, 
1957). In order to minimize defensiveness, subjects 
were assured anonymity,? were told they need not an- 
Swer any questions if they did not wish to, and honesty 
was appealed to in the interest of science. 

The questionnaire contained two checklists, one re- 
quiring an estimate of average frequency of orgasm per 
week during the previous 2 months, i.e., from “one or 
less per week" to “eight or more per week,” and one for 
time since last orgasm, i.e., from “one day ago or less” 
to “more than 2 weeks ago." The final section consisted 
of the following 12 statements on attitude toward sex, 
which was used to measure sexual guilt and inhibition: 
sex education should be part of the high school curricu- 
lum; people have too much sex on their minds; sex 
should be reserved only for marriage; there is nothing 

wrong or unhealthy about having sexual orgasms fairly 


? Thematic responses were matched with question- 
naires by having subjects place their materials in indi- 
vidual manila envelopes. 
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net response 


STRENGTH OF RESPONSE TENDENCY 


d c b a 
*— — — STIMULUS-RELEVANCE 


Fic. 1. Net increment in response tendency to a stim- 
ulus dimension as a function of drive and inhibition. 


often; I avoid sexy shows when I can; it is wrong to in- 
dulge in sex strictly for pleasure; I enjoy most of my 
sexual orgasms (considering all sources combined) with- 
out regret; I feel guilty about my sexual behavior; I find 
discussions about sex slightly annoying; I feel that Iam 
well adjusted sexually; when I have sexual thoughts or 
daydreams, I feel slightly ashamed of myself; sexual 
thoughts come to mind sufficiently often to interfere 
with my work. 

Tn responding to the guilt questionnaire, the subjects 
rated each item on a continuum from +3 to —3 (agree 
strongly to disagree strongly). A preliminary score was 
obtained by adding weights after having reversed the 
signs of items indicating an absence of guilt. Each item 
was then correlated with the total score and only those 
items retained for which there was a positive relation- 
ship. On this basis, Item 1 was eliminated. 

Despite the limitations of using self-report informa- 
tion as a criterion measure, there is no one who has as 
much opportunity to observe a person's behavior and 
know his feelings as the person himself. Furthermore, 
the conditions were such as to minimize defensiveness, 
and finally high reliability is not required as the only 
assumption is that the groups selected by the scales do 
in fact differ in central tendency on the variable in 
question. 


Analysis of the Data 


To investigate the influence of drive, guilt, and stim- 
ulus-relevance independently and in interaction with 
each other upon thematic responses, two separate anal- 
yses of variance were performed, one with rate and the 
other with deprivation as the measure of drive. 

For the analysis with rate, subjects were divided 
into three equal groups of 22 each. The lowest group 
consisted of those who reported a rate of 0-1, the middle 
group of 2-3, and the high group of 4 or more orgasms 
per week. Independently of rate, the group was divided 
in half as nearly as possible on self-reported guilt. In an 
attempt to have the measures of rate and guilt relatively 
independent, subjects were selectively eliminated so 


THEMATIC SEXUAL RESPONSES, DRIVE, AND GUILT 


TABLE 1 


ANALYSIS OF VARIANCE OF THEMATIC SEXUAL 
"RESPONSES WITH RATE AS THE MEASURE 


dr or DRIVE 
Source SS df | MS F 

Rate 12.45| 2| 6.26| 6.66** 
Guilt 1.33} 1| 1.33) 1.41 
Rate X Guilt .00| 2 .00 
Between subjects 45.82) 49 94 
Stimulus-relevance 108.05} 1 |108.05|216.10*** 
Stimulus-relevance X .94| 2 47 

Rate 
Stimulus-relevance X 1.33} 1| 1.33) 2.67 

Guilt 
Stimulus-relevance X .34 2 .16 

Rate X Guilt 
Residual error 24.40| 49 .50 


** Significant at .01 level. 
*** Significant at .001 level. 


that groups divided according to rate had the same 
means and standard deviations on guilt.* 

Finally, stimulus-relevance was taken into account 
by dividing the pictures into those of low (Pictures 1, 
2,4, and 5) and high (Pictures 6 and 8) relevance as de- 
termined by frequency of eliciting a sexual response. 

Th summary, the analysis of variance using rate as 
the measure of drive involved three levels of drive, two 
levels of guilt, and two levels of stimulus-relevance as 
independent variables, and thematic sexual response as 
the dependent variable. 

The analysis using deprivation as the measure of 
drive was identical with that for rate except that two in 
place of three levels of drive were investigated because 
of the added restriction of having to hold rate constant 
in relation to deprivation. Within each level of rate, 
subjects were divided as nearly in half as possible on 
time since last orgasm. 


RESULTS 


Influence of Drive upon Thematic Sexual 
Responses 


„In Table 1 it can be seen that there are 
Significant differences (.01 level) in thematic 
Sexual responses for the groups divided ac- 
Cording to rate. The mean thematic sex scores 
in ascending order for rate are 5.38, 7.20, and 


3 The balancing of the subjects resulted in cells of 
Unequal proportions of subjects. Consequently, additiv- 
ity of sums of squares could no longer be assumed. 
Snedecor (1946, pp. 299-301) reports a modification of 
the usual analysis of variance which circumvents this 
difficulty for completely randomized designs. A logical 
extension for use with designs with repeated measure- 
ments for each subject was used by applying the Snede- 
cor Corrections independently to the between-subjects 
and within-subjects cells variance. The authors are in- 
Ped to Jerome Myers for his aid in this aspect of 
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Fic. 2. Thematic sexual response as a function of 
rate of orgasm. 


8.94, indicating a positive relationship (see 
Figure 2). 

Table 2 fails to reveal a relationship for 
deprivation and thematic sexual responses. 


Influence of Guilt upon Thematic Sexual 
Responses 

There is a significant relationship (.01 level) 
between self-reported guilt and thematic sexual 
responses when subjects are selected according 
to deprivation (see Table 2), but not when 
subjects are selected according to rate (see 
Table 1). With deprivation held constant 
subjects of low guilt obtained a mean thematic 
sexual score of 4.24 while for subjects of high 
guilt the mean was 3.13. 

The difference in findings for subjects 
selected on rate and deprivation can be ac- 
counted for by assuming that rate is more 
influenced by guilt than is deprivation. It 
would then follow that groups with high and 
low guilt scores matched on rate are less 
reliably different in guilt than are groups 
matched on deprivation. In order to test this 
assumption, the relationship of the two 
measures of drive to guilt was investigated over 
the entire pool of subjects. For the low-guilt 
group 7 subjects were of low rate, 9 of medium 
rate, and 17 of high rate; for the high-guilt 

oup 15 were of low rate, 13 of medium rate, 
and 5 of high rate. A chi square of 8.36 with 
2 df was obtained, indicating a significant 
(.02 level) inverse relationship between rate 
and self-reported guilt. When the analysis was 
in terms of deprivation, in the low-guilt group 
16 subjects were of low deprivation and 17 of 
high deprivation; in the high-guilt group 22 
subjects were of low deprivation and 11 of 
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TABLE 2 
ANALYSIS OF VARIANCE OF THEMATIC SEXUAL 
RESPONSES WITH DEPRIVATION AS THE 
MEASURE or DRIVE 


Source SS | df | MS F 
Deprivation .A7| 1| .47 
Guilt 0.43| 1 |10.43| 9.07** 
Deprivation X Guilt 4.74| 1| 4.74 4.13* 
Between subjects 54.28| 47 | 1.15| 
Stimulus-relevance 93.84) 1 |03.841170.62*** 
Stimulus-relevance X .00 1 .00 
Deprivation 
Stimulus-relevance X 1.19) 1| 1.19| 2.16 
Guilt 
Stimulus-relevance X -36| 1| .36 
Deprivation X Guilt 
Residual error 5.62} 47 | .55 


“Significant at .05 level. 
** Significant at .01 level. 
*** Significant at .001 level. 


high deprivation. The corresponding chi 
square of 2.24, 1 df, is not significant. The 
results support the assumption that guilt 
and rate but not guilt and deprivation are 
related. 


Interaction of Drive and Guilt om Thematic 
Sexual Responses 


A significant interaction of drive and guilt 
(.05 level) was found for the analysis with 
deprivation (see Table 2), but not rate, as the 
measure of drive (see Table 1). In Figure 3 
it can be seen that an increase in deprivation 
is associated with an increase in thematic 
sexual responses for subjects of low self- 
reported guilt and a decrease for subjects of 
high self-reported guilt. 


Stimulus Relevance 


For the analyses of variance with both rate 
and deprivation as measures of drive, the only 
significant source of variance that included 
stimulus-relevance was the main effect of 
stimulus-relevance itself (see Tables 1 and 2). 
This simply indicates that the pictures divided 
according to presumed low and high relevance 
were in fact of low and high relevance as 
determined by average sexual response elicited. 

In order to investigate further the influence 
of stimulus characteristics, we analyzed the 
data for each picture separately by comparing 
the number of subjects above and below the 
pooled median. The reason for analyzing the 
data in relation to a cutting point was that 
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Frc. 3. Thematic sexual response as a function of 
deprivation and guilt. 
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all the pictures of low relevance had a pre- 
dominant number of zero scores, while the 
pictures of high relevance were markedly 
skewed in the opposite direction. Following a 
procedure described by Sutcliffe (1957), we 
were able to evaluate by chi square the inter- 
action as well as main effects of drive and guilt 
upon thematic sexual responses. | 
Inspection of the data with deprivation as | 
the measure of drive made it apparent that 
all pictures demonstrated the same interaction 
effect, as is illustrated in Figure 3. For low- 
guilt subjects there was an increase in thematic | 
sexual responses with increasing drive and 
for high-guilt subjects a decrease. A difference | 
in pictures, however, was evident for the | 
relationships of self-reported guilt and the- 
matic sexual responses. In general, the higher 
the stimulus-relevance, as measured by fre- 
quency and intensity of sexual response, the 
more effectively did the pictures separate 
subjects according to self-reported guilt. For 
Picture 8, the picture of highest relevance, 
significance was reached at the .01 level. The 
finding that guilt was best measured by 
relatively weak sexual responses to pictures of 
high relevance raises the question of why a 
Significant interaction of guilt by stimulus- 
relevance was not found in the analyses of 
variance (see Tables 1 and 2). The answer is 
that Picture 5, which was grouped with the 
picture of low stimulus-relevance, functioned 
with respect to guilt like a picture of high 
stimulus-relevance. Picture 5 does not draw a 
high number of sexual responses but tends to 
elicit strong ones. Considering only thematic 
Sexual responses actually produced, the 
average strength of response to Picture 5 was 
higher than to one of the pictures classified as 
of high stimulus-relevance (see Table 3). It 
may be that in so far as the effect of guilt upon. 
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TABLE 3 


SEXUAL RESPONSE CHARACTERISTICS OF 
INDIVIDUAL PICTURES 


£ Picture 
Thematic Sexual 
Response 


1 2 4 5 6 8 


Percentage responses 44% | 48% | 26% | 33% |100% 95% 
Mean response .85| .T8| .45| .92 2.05 | 3.67 
Range 0-5 |o |o |0-5 |1-5 | OS 
SD 1.18| 1.02| .87| 1.66| 1.16 | 1.36 
Mean response consider- | 1.90| 1.61| 1. 76| 2.90| 2.05 | 3.94 


ing only responses 
with sex imagery 


thematic sexual responses is concerned what 
is important is the strength rather than the 
number of thematic sexual responses that a 
picture elicits. 

For the analysis with rate as the measure 
of drive, two of the pictures classified as of low 
relevance (Pictures 1 and 5) significantly (.01 
level) differentiated the groups with respect 
to rate, but not guilt, while the picture of 
highest relevance differentiated (.01 level) 
the groups according to guilt, but not rate. 
In no case was the interaction between guilt 
and rate significant, which is consistent with 
the findings from the analysis of variance. The 
fact that stimulus-relevance was found to 
influence the effect of both rate and guilt on 
thematic sexual responses when individual 
pictures were analyzed but not in the analysis 
of variance (see Table 1) indicates that the 
pooling of pictures into groups of low and high 
relevance resulted in a cancelling-out process. 
This has already been described in relation to 
the measurement of guilt, where Picture 5 
functioned as a picture of high rather than low 
relevance. The two pictures of low relevance 
that differentiated the groups according to 
rate (Pictures 1 and 5) were examined in 
relation to the two pictures of low relevance 
that failed to differentiate the groups (Pictures 
2 and 4). In Table 3 it can be seen that the 
pictures do not differ in frequency of eliciting 
sexual responses. Picture 1 does not appear to 
be distinctive in any way other than that it 
was presented first. This suggests that order 
or sequence effects may be of importance in 
projective tests. Picture 5 elicited the full 
range of responses and had the highest stand- 
ard deviation of any of the pictures in the 
study. Thus, it had the highest potential for 
discriminating among individuals, and may 
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simply be a good “item” in a psychometric 
sense. 


Measurement of Conflict by Thematic Responses 


Assuming that pictures of low relevance 
best measure drive, and pictures of high 
relevance best measure inhibition (see Figure 
1), it follows that people who overrespond to 
stimuli of low relevance and underrespond to 
pictures of high relevance are in a state of 
opposing drives, or in conflict. Taking into 
account the findings on the individual pic- 
tures, we selected subjects on their response to 
Picture 1 as the best measure of drive and 
Picture 8 as the best measure of guilt. Nine 
subjects were found to follow the conflict 
pattern of producing greater than a median 
response to Picture 1 and less than a median 
response to Picture 8. If a combination of high 
drive, as measured by rate or deprivation, and 
high self-reported guilt is used as the criterion 
of conflict, there was not even a tendency for 
this group to behave as if in conflict. If, on the 
other hand, high self-reported guilt is taken as 
the sole criterion of conflict, then six of the 
nine subjects who followed the conflict pattern 
in thematic responses are in conflict, indicating 
a weak tendency in the predicted direction. 
However, Picture 8 alone differentiated the 
groups according to self-reported guilt, so that 
no advantage is gained by taking into account 
performance on Picture 1. 


Discussion 


Influence of Drive and Guilt upon Thematic 
Sexual Responses 


Tn accordance with hypothesis and previous 
work (Epstein & Smith, 1957) it was found 
that thematic sexual responses were directly 
related to sexual outlet but not to deprivation. 
On the other hand, and again in accordance 
with hypothesis, thematic sexual responses 
were found to be related to deprivation when 
self-reported guilt was taken into account. The 
discrepant findings for the two measures of 
drive emphasize the importance of taking into 
account levels of response in accounting for 
projective behavior. Apparently, thematic 
sexual responses and rate of sexual outlet, both 
of which refer to responses, are influenced to a 
similar degree by guilt, so that a measure of 
self-reported guilt adds nothing. On the other 
hand, deprivation in relation to rate can be 
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thought of as an index of momentary drive 
which is less influenced by guilt than either of 
the response measures. Accordingly, a measure 
of self-reported guilt is necessary to bring out 
the relationship between deprivation and 
thematic sexual responses. 

Of considerable theoretical interest is the 
manner in which self-reported guilt interacted 
with deprivation to influence thematic sexual 
responses, When self-reported guilt was low, a 
direct relationship was observed between 
deprivation and thematic sexual responses, but 
when self-reported guilt was high, an inverse 
relationship was noted. According to Miller 
(1948, 1951), as verified by Brown (1942, 
1948), an increase in drive raises the entire 
gradient of approach, thereby increasing net 
strength of approach response. Miller assumes 
that the avoidance gradient is unaffected by 
the increase in height of the approach gradient. 
The conditions of Brown’s experiments meet 
this assumption, as Brown investigated ap- 
proach and avoidance gradients independently. 
However, the findings in the present study 
suggest that the assumption does not generally 
hold for sexual conflict in human beings where 
the guilt reactions can be conditioned to the 
drive itself, so that an increase in drive can 
produce a decrease in strength of approach by 
raising the avoidance gradient more than the 
approach gradient. 

Of practical interest, the finding of an inter- 
action of deprivation and guilt on thematic 
sexual responses suggests that the interpreter 
of tests like the Thematic Apperception Test 
(TAT) must be on guard against confusing 
low drive states with states of high drive and 
high guilt, and must also consider that either 
an increase or a decrease in drive-related re- 
sponses, depending on the state of guilt, may 
indicate an increase in drive. 


Influence of Stimulus-Relevance 


A possible approach to assessing the effects 
of drive and inhibition independently is sug- 
gested by the model presented at the beginning 
of the paper (see Figure 1). Following the 
assumption that the gradient of inhibition, as 
a function of stimulus cues, is steeper than the 
gradient of expression, we hypothesized that 
pictures of low relevance provide the best 
measures of drive and pictures of high rele- 
vance provide the best measure of guilt. The 
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hypothesis was supported when individual 
pictures were analyzed, but not when they 
were pooled into groups of low and high 
relevance. Apparently other features of the 
pictures must be considered if the effects of 
stimulus-relevance are not to be masked. For 
one, there is reason to suspect that order and 
sequence effects are important in influencing 
responses to pictures of low relevance. Drive 
may exert its directive function by providing 
cues that are not otherwise available. When 
cues are offered by instructions, external stim- 
uli, or other pictures in the test, the influence 
of drive may be overshadowed by the effects 
of set. This may account for the general finding 
that multiple-choice TATs are less effective 
than the usual TAT. 

There was reason to believe that other 
features of the pictures, in addition to stimulus 
relevance, were important in determining how 
well groups were differentiated. Accordingly, 
it will be necessary in future work to analyze 
the response characteristics of the pictures 
used, so that ultimately theselection of thematic 
pictures and the interpretation of responses 
to them will be on a more orderly basis than is 
presently the case. The approach that Murray 
recommends in his TAT manual—treating 
responses to all pictures alike by adding them— 
would seem to mask important effects. Pro- 
jective tests, by stressing the ambiguity that 
all projective stimuli have in common, have 
failed to take sufficiently into account the dis- 
tinctive characteristics of the stimuli. Despite 
ambiguity, the major source of variance in 
responses to a series of stimuli is generally 
contributed by differences between the 
stimuli. 


Implications for the Measurement of Con- 
flict 


If pictures of low relevance measure drive 
and pictures of high relevance measure in- 
hibition it would seem to follow that people 
who overrespond to the former, and under- 
respond to the latter should be in a state of 
strong, opposing drives, or in conflict. It might 
also be expected that such people would be 
relatively fixated or rigid in behavior related 
to the drive in question, as their responses 
should vary minimally as a function of chang- 
ing stimulation. Unfortunately, an adequate 
sample of subjects who overresponded to 
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pictures of low relevance and underresponded 
to pictures of high relevance could not be 
obtained, nor was a satisfactory criterion of 
conflict available. Accordingly, the negative 
results on the measurement of conflict cannot 
be taken too seriously, and further work is 
indicated to investigate what on logical grounds 
appears to be a promising approach. 


SUMMARY 


The purpose of the present investigation 
was to determine the influence of sexual drive 
and sexual guilt upon thematic sexual re- 
sponses. Sixty-six male subjects took a the- 
matic test with a built-in dimension of sexual 
relevance, after which they anonymously filled 
out a questionnaire from which were derived 
two indices of sexual drive, one based on rate 
of orgasm and the other on deprivation (time 
since last orgasm relative to rate), and an index 
of guilt. The major findings are as follows: 

1. Across levels of guilt, a direct relationship 
was found between sexual drive as measured by 
rate of orgasm, but not by deprivation, and 
thematic sexual responses. 

2. Self-reported guilt and thematic sexual 
responses were significantly inversely related 
when deprivation was held constant. 

3. A significant interaction was found for 
deprivation by guilt, but not for rate by guilt, 
with thematic sexual responses as the de- 
pendent variable. When guilt was relatively low, 
there was a direct relationship between depri- 
vation and thematic sexual responses, but 
when guilt was high, the relationship was 
reversed. The finding of an interaction of guilt 
with deprivation but not rate could be ex- 
plained by the verified assumption that guilt 
and rate are significantly related. 

4, Analysis of individual pictures indicated 
that pictures of relatively low sexual relevance 
best measured drive, and pictures of relatively 
high sexual relevance best measured guilt. 
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Responses to thematic tests can profitably 
be viewed in terms of an interaction of expres- 
sive and inhibitory tendencies. Direct pre- 
dictions can be made to significant behavior, 
such as rate of orgasm, when the behavior is 
based upon a similar net effect of drive and 
inhibition. Where different levels of behavior 
are involved, it is necessary to take into ac- 
count the relative role played by expressive 
and inhibitory tendencies. The findings in this 
study suggest that approach (expressive) 
tendencies can be measured by the production 
of drive-related responses to stimuli of low 
relevance, and avoidance (inhibition) by the 
failure to produce appropriately strong re- 
sponses to stimuli of high relevance. 
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T role does the family play in 
the development and course of a 
schizophrenic disorder? To answer 
this question, clinical and experimental in- 
vestigators have employed a wide variety of 
research methods. Examples of these include 
case history analysis (Wahl, 1956), follow-up 
studies of children who subsequently became 
schizophrenic (O'Neal & Robins, 1958), 
psychotherapy with concomitant observations 
(Bowen, Dysinger, & Basamania, 1959), depth 
and structured interviewing (Kohn & Clausen, 
1956; Tietze, 1949), psychodiagnostic appraisal 
(Singer & Sugarman, 1955), and scaling 
procedures primarily involving the measure- 
ment of child rearing attitudes of parents of 
schizophrenic and normal patients (Mark, 
1953). 

Tn general, the reports of these investigators 
have stressed emotional instability in the 
home, deviant patterns of parent-child and 
husband-wife relationships and evidence of 
pathological child rearing attitudes and be- 
haviors on the part of the parents (with pri- 
mary emphasis on the mother), including overt 
and covert rejection, domination, hostility, 
oversolicitousness, and the like. These reports, 
however, are not entirely consistent. Several 
studies have reported a failure to differentiate 
the attitudes toward child rearing held by 
mothers of schizophrenic and normal males 
(Freeman, Simmons, & Bergen, 1959; Zucker- 
man, Oltean, & Monashkin, 1958). 

Many difficulties, inherent in the method of 
retrospective reconstruction, have been offered 
to account for these disparate findings (Gar- 
mezy, Farina, & Rodnick, 1960). Of these, 
considerable emphasis has been placed upon 


1'This study is part of a research program supported 
by National Institute of Mental Health, United States 
Public Health Service, Grant M-629. An abbreviated 
version of this paper was read at the Annual Meeting 
of the American Psychological Association, Cincinnati, 
Ohio, September 1959. 

?Now at the Center for Personality Research, 
University of Minnesota. 


the heightened defensiveness of parents when 
confronted with the reality of a son's hospitali- 
zation for a mental disorder. 

A more basic question concerning the 
validity of the parent-respondent report has 
been raised by Ausubel (1958). He has sug- 
gested that the child’s perception of parental 
attitudes and behavior is the more relevant 
consideration "because the personality de- 
velopment of children is influenced more 
approximately by the attitudes they perceive 
than by the objective properties of the latter 
as reported by parents or observers” (p. 356). 

Nevertheless, there has been relative neglect 
of the schizophrenic patient as both perceiver 
and respondent in evaluating the significant - 
child rearing attitudes of his parents. Heilbrun 
(1960) instructed matched groups of normal 
and schizophrenic daughters to respond to the 
Parent Attitude Research Instrument (PART) 
(Schaefer & Bell, 1958) “as their mothers - 
would.” He found that the patients, in com- 
parison with normal subjects, ascribed more 
deviant attitudes to their mothers on 12 of the 
23 PARI scales; the normals reported greater 
deviance on only 1 scale. On the other hand, 
the mothers themselves did not differ signifi- 
cantly from each other in their responses to 
these same attitudinal statements. Harris 
(1955) had groups of poor and good premorbid 
schizophrenic patients (Phillips, 1953) and 
normal control subjects answer a series of 
child rearing statements as they believed their 
mothers would have responded during their 
childhood. He found that the order of the 
magnitude of deviance of maternal child rear- 
ing as described by the three groups of subjects 
was Poors, Goods, and Normals. 

Studies conducted under the aegis of the 
Duke Schizophrenia Project have repeatedly 
found differences between good and poof 
premorbid schizophrenic groups (Rodnick & 
Garmezy, 1957). Since more recent studies 
have suggested that the structure of family 
organization may also differ for the two groups 
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(Garmezy & Rodnick, 1959), the current study 
was aimed at investigating the perceptions of 
the maternal and paternal child rearing 
patterns of comparable groups of schizophrenic 
and normal subjects. 


METHOD 

Subjects 

Forty-five male patients drawn from Veterans 
Administration and state hospital populations served as 
subjects for the study. Thirty subjects comprised the 
two groups of schizophrenic patients, 15 of these 
having been rated as Good Premorbids and 15 as Poor 
Premorbids on the basis of the Phillips (1953) scale. 
Goods were defined as those scoring below 15 on this 
scale; in general, these subjects were married and had 
made adequate social and heterosexual adjustments 
prior to the onset of the disease. To be rated as a Poor, 
the subject had to score 15 or above on the Phillips 
scale, a value which usually reflected social isolation in 
childhood and adolescence and limited sexual expres- 
sion in adulthood. Two of the investigators independ- 
ently rated the case records and achieved a satisfactory 
degree of interrater reliability (rho = +.84). The nor- 
mal control group consisted of 15 patients from GMS 
wards of the same Veterans Administration installation 
from which many of the schizophrenic subjects had been 
selected, These subjects had had no history of psychi- 
atric illness; their hospitalization was a result of non- 
psychosomatic ailments including fractures (3), removal 
of cysts (5), plastic surgery (2), tuberculosis (2), cardiac 
failure, appendectomy, and hernia repair. All subjects 
were between 18 and 45 years of age and had lived with 
their biological parents until at least age 18. The groups 
did not differ significantly in mean age or level of 
education. For Poors, Goods, and Normals the respec- 
tive ages were 29.6, 33.5, and 27.7 years; mean educa- 
tion levels were, respectively, 10.7, 9.9, and 11.0 years 
of schooling. Socioeconomic status, as measured by the 
occupational status of the fathers of many patients, 
was comparable. The two schizophrenic groups did not 
differ significantly in length of hospitalization (total 
time in months: Poors, 4.67; Goods, 5.28). All subjects 
met the additional criteria of freedom from evidence of 
mental deficiency and organic dysfunction. 


Procedure 


Each subject was instructed to think back to when 
he was 13 or 14 years old and to try to remember his 
mother and father as they were at that time. The 
experimenter then presented the subject with 75 state- 
ments describing various child rearing attitudes and 
Practices and asked the subject to indicate whether 
his mother (or father) would have agreed or disagreed 
with each item at the time he was “growing up.” The 
items, which were printed individually on 5 X 8 un- 
lined index cards, were presented in sequence to the 
Subject who read them aloud after the experimenter 
had done so initially. The subject then indicated his 
Agree or Disagree response. In order to maintain the 
instructional set, the experimenter continued to stress, 


throughout the presentation of items, the need to 
respond as the subject's parent would have answered 
during the subject's adolescence. A standard explana- 
tion was used to describe each item further if the subject 
inquired as to the meaning of a statement. The order 
of presentation of mother-father and father-mother 
scales was counterbalanced within each group. 

The statements used were similar in form and 
content to those originally devised by Shoben (1949) 
and subsequently used by Mark (1953) and Harris 
(1955). Fifty-three of the 75 items were those which 
Mark found significantly differentiated mothers of 
schizophrenics from normals, The remaining items were 
constructed by the senior author. All items were rated 
by five senior clinicians into Dominance, Overprotec- 
tion, and Ignoring subcategories on the basis of Shoben’s 
original definitions. An item was assigned to a given 
category if at least four or five judges agreed on its 
placement. Items not meeting this criterion were placed 
in a Miscellaneous category and served as filler state- 
ments. The number of items in the Dominance, Over- 
protection, Ignoring, and Miscellaneous categories were 
27, 25, 11, and 12, respectively.* All items were worded 
such that Agree or Yes responses were indicative of 
deviance. 

RESULTS 

For each subject the proportion of Yes 
responses given to each of the three categories 
of statements was computed for the mother 
and father condition separately. The mean 
and sigma values of these proportions for each 
group are presented in Table 1. Inspection of 
the table reveals that the order of magnitude 
of deviance in all instances is Poors, Goods, 


and Normals. 

‘An overall analysis of the data was made 
by a Lindquist (1956) Type VI analysis of 
variance in which there were two within- 
subject variables (scale categories and mother- 
father ratings) and one between-subject vari- 
able (patient groups). The results of the 


3 A copy of the scale is available upon request. 

4In addition to the overall Type VI design, six 
simple randomized designs were also employed to test 
the significance of differences in the proportion of Yes 
answers that Goods, Poors, and Normals ascribed to 
"Mother and Father on each of the three scales. The 
results of these analyses were essentially confirmatory. 
The rank order, from high to low proportion of Yes 
answers for every scale and for each parent, was Poors, 
Goods, and Normals with both schizophrenic groups 
differing significantly from the reports of control 
subjects. In addition, for all mother comparisons, the 
Poors reported significantly greater deviance than did 
Goods; for father comparisons, however, these two 
groups of schizophrenics differed only on Overprotec- 
tion (Poors > Goods). Failure to obtain differences for 
Dominance and Ignoring was a function of the rise in 
reported deviance by Goods when rating their fathers. 
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TABLE 1 


MEANS AND SIGMAS OF PROPORTIONS OF YES ANSWERS FOR Poors, GOODS, AND NORMALS ON 
Camp REARING ÅTTITUDE SCALES 


Dominance Overprotection Ignoring Total 
Mother Father Mother Father Mother Father 
M |Sigma 
M |Signa| M |Sigma| M (|Signa| M |Sigma| M |Sigma| M Sigma 
Poors .66 | .16 | .68 | .15 | .51 | .14 | .39 | .11 | .48 | .13 | .48 | .19 | .53 | .18 
Goods .50 | .15 | .64 | .15 | .38 | .13 | .29 | .13 | .35 | .16 | .42 | .18 | .43 | .19 
Normals 128 | .16 | .30 | .14 | .22 | .12 | .18 | .08 | .22 | .17 | .14 | .12 | .22 .14 
analysi i ntained in Table 2, m: TABLE 2 
YSIS, bong AS ^ ge B x in Table 2 may Summary TABLE: Type VI ANALYSIS 
be summarized as follows: XIVISANOE 
1. The main effect for Groups is highly significant; 
1 tests reveal that the order of the groups, ranked for ase ad hts 2 z ? 
amount of deviance reported, is Poors, Goods, and 3 i nm 
i differi ignificantly fi jetween subjects “4 x 
Rios a E ering Sigolicanty Fom Grodps (GPN) 2 | 4.4530 | 2.2265 |45.25 |<.001 
2. The main effect of Scales is also significant, sug- Beton! dac [4-008 100492 
gesting that, regardless of group, the frequency of within subjects ms | 5.5888 
ascription of Dominating attitudes and behaviors to ^ Parents (M-F) 1 | .0034| .0034 | .24 |>.20 
parents is significantly greater than for Overprotection ^ Scales (DOI) 2 | 1.7976 | .8988 |47.81 |<.001 
and Ignoring (Rejecting) attitudes (5 = <.005). Parents X Scales 2 2204 | .1102 | 9.10 |<.001 
3. The significant Parent X Scale interaction indi- ^ Parents X Groups 2 | .0778 | .0389 | 2.74 |<.10,>.05 
cates that both normal and schizophrenic patients Scales X Groups 4 | 1948 | .0487 | 2.59 |<.05,>.25 
ascribe different types of attitudes to their mothers and ri X Scales X lc E. 
fathers. In general, for all three groups of subjects EEN í fo Hu nd iat 
combined, fathers tend to be seen as more dominant Errow (W) 20 | 3.1922 
and mothers as more overprotective. In terms of rejec- — Erron(W) 42 | .5969| .0142 
tion, both parents are perceived similarly when data ^ Errom:(W) 84 | 1.5781] .0188 
for all subjects are combined. Errors(W) 84 | 1.0172] .0121 
4. The significant Groups X Scales interaction is a 
joint function of the tendency of normal subjects to Total 269 
ascribe to their parents all three types of attitudes 


with comparable frequency, whereas the two schizo- 
phrenic groups tend to place greater emphasis on 
Dominance and Ignoring behaviors. 

5. The Parent X Groups interaction approaches 
significance and arises from the assignment of higher 
scores to mothers by Poors and Normals, whereas Goods 
tend to assign higher (ie., more deviant) scores to 
their fathers. 


Comparison within Families 

In order to contrast the attitudinal patterns 
reported within the families of the three groups 
of subjects, a further analysis was made, using 
as a measure the proportion of Yes responses 
assigned to mother minus the proportion of 
Yes responses assigned to father for each 
category. Thus, negative values were indica- 
tive of relatively greater deviance in fathers, 
whereas positive scores reflected more deviant 
behaviors in mothers. The mean and sigma 


values for these mother minus father differ- 
ences are shown in Table 3. 

Individual, simple-randomized designs, one 
for each scale category, were used to test inter- 
group differences. The F ratios for the Ignor- 
ing and Overprotection categories were not 
significant. For the former, however, fathers of 
Goods were seen as most rejecting; for the 
latter, mothers of Poors were perceived as most 
deviant. Among the groups, however, the 
greatest differences occurred on the Dominance 
variable. The good premorbid schizophrenic 
group perceived their fathers to be significantly 
more dominating in comparison with their 
mothers than did the Poors and Normals 
(F = 3.66; 2, 42 df; p = <.05); Poors and 
Normals did not differ significantly from each 
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TABLE 3 
MEAN AND SIGMA VALUES OF MOTHER MINUS 
FATHER DIFFERENCES FOR Poors, GOODS, 
AND NORMALS ON CHILD REARING 
ATTITUDE SCALES 


Dominance oaod Ignoring 

M |Sigma| M [Sigma M [Sigma 
Poors —.02 | .15 | .12 | .11 | .00| .24 
Goods —.14 | .13 | .09 | .13 |-.07 | -21 
Normals —.02| .12 | .03 | .08 | .08 | .12 


other. Goods, however, differed significantly 
from both (p = <.005). 


Discussion 


Two aspects of the current study confirm 
previous research findings of the Duke Schizo- 
phrenia Project (Garmezy & Rodnick, 1959; 
Rodnick & Garmezy, 1957). First, earlier 
studies have revealed marked differences be- 
tween good and poor premorbid schizophrenic 
patients; studies of performance in tasks 
involving verbal learning, perceptual dis- 
crimination, concept formation, and judgment 
have indicated the order of magnitude of 
deficit to be Poors, Goods, and Normals. This 
order has been maintained in the present study 
of the extent of deviance in child rearing 
attitudes ascribed to parents by comparable 
groups of subjects. Second, the results of the 
present study parallel, in striking fashion, the 
data obtained by Farina (1960), who used a 
situational test to study more directly the 
patterns of dominance and conflict exhibited 
by the biological parents of hospitalized good 
and poor premorbid schizophrenic and normal 
patients. Farina found evidence of marked 
paternal dominance in the interactions of 
parents of Goods and a more accentuated 
vying for dominance (resulting in heightened 
conflict) in the poor premorbid group. Normal 
parents, on the other hand, showed little 
evidence of marked conflict or attempts to 
dominate each other. Thus, Farina's data are 
consistent with the findings of the current 
study. In addition, the obtained results also 
support data derived from recent studies of 
Dunham (1959) and Kreinik (1959). The 
former, employing a visual discrimination 
procedure similar to Dunn's (1954), found 
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maximum deficits by Poors on mother-son 
censure scenes and a comparable (but lesser) 
decline in accuracy of discrimination by Goods 
for father-censure pictorial stimuli. Kreinik 
found selective failures by Poors to improve. 
in a conceptual task when first exposed to 
stimulus contents depicting maternal censure 
and a comparable effect upon Goods tested 
initially on a father-censure scene. 

These findings of a greater tendency toward 
ascription of maternal dominance (with a 
concomitant sensitivity by patients to relevant 
stimulus cues) in the families of Poors and of 
greater paternal dominance in the case of 
Goods would appear to be congruent with 
the more masculine identification achieved by. 
the latter. A review of the content of the 
Phillips (1953) Premorbid Scale suggests that 
to be scored as a good premorbid a schizo- 
phrenic patient must have manifested the 
following behaviors: friendship patterns in 
childhood and adolescence, dating behavior 
at puberty, a capacity for achieving and main- 
taining a meaningful and stable heterosexual 
relation in adulthood, together with a pattern 
of stable interpersonal relationships prior to 
hospitalization. By contrast, poor premorbid 
schizophrenics have usually shown marked 
asociality and asexuality from childhood 
through adulthood. 

Since the father of the Goods acts in a more 
assertive manner, participates more readily 
in family decisions and, in general, maintains 
a more masculine role within the family, he 
may serve as a more effective model for male 
identification for his son. Such a hypothesis 
would be consistent with other studies drawn 
from the area of normal developmental 
processes in childhood and adolescence (e.g., 
Altucker, 1956; Mussen & Distler, 1959). 

Though fathers of Goods may provide 
models for masculine identification, the strong 
element of punishment inherent in their child 
rearing practices may induce in their sons a 
greater proneness to anxiety in situations in- 
volving failure to fulfill paternal expectations. 
Tt is of some interest to note that the events 
(Philips, 1953) which tend to be identified 
as precipitants to the psychosis in Goods often 
are related to threats to the patient's self- 
esteem and his assumed role model (eg., 
separation from wife, economic loss, etc.), 
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whereas the precipitant for Poors is often 
abstract, highly symbolic and not readily 
identifiable as a stressor. 

Speculation on factors influencing the 
development of maladaptive behavior pat- 
terns in the Poors centers on the mother’s role. 
Levy's (1943) well-known research into ma- 
ternal overprotection has shown that the 
consequences of marked dominance in mothers 
is excessive obedience, docility, and sub- 
missiveness in their male children, accom- 
panied by an increased estrangement from the 
peer culture. Levy reports that the combina- 
tion of overprotectiveness and dominance in 
some of his mothers markedly reinforced the 
dependency solicitations of their male children. 
Such combinations of attitudes are not unlike 
those frequently reported as characteristic of 
mothers of schizophrenics. This pattern, in 
combination with the heightened conflict 
observed in the interactions of parents of 
Poors (Farina, 1960), possesses some of the 
elements of the “double-bind” (Bateson, 
Jackson, Haley, & Weakland, 1956) and may 
serve as one type of antecedent for the social 
withdrawal and gradual immobilization of 
the male child who subsequently develops a 
more chronic form of schizophrenia. 


Alternative Explanations 


Two alternative explanations must be 
considered in accounting for the obtained re- 
sults: variations between groups in social 
class status and differential adherence to an 
acquiescence response set. 

Social class. The studies of social class mem- 
bership and child rearing practices suggest 
that middle class families tend toward greater 
maternal permissiveness and paternal author- 
ity; maternal dominance tends to be more 
characteristic of lower social class member- 
ship (Bronfenbrenner, 1958; Kohn & Clausen, 
1956). To evaluate whether variations in social 
class membership could serve to explain the 
obtained differences in child rearing attitudes, 
ratings of father’s occupation (as a rough 
measure of socioeconomic status) were made 
for the two schizophrenic groups. There were 
only minor differences between the two groups; 
the largest proportion of fathers in both groups 
had been farmers and semiskilled factory 
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workers. Thus, social class variations cannot 
serve to explain the differing reports of the 
good and poor premorbid schizophrenic pa- 
tients. 

Response sets. Since deviance, as expressed 
to all items of the child rearing attitude scale, 
was based upon a positive response, an alter- 
native hypothesis involving the operation of 
an acquiescence response set (Cronbach, 1946) 
in the two groups of schizophrenic subjects 
also had to be considered in evaluating the 
findings of the study. It has frequently been 
observed that schizophrenic patients tend to 
be overtly more compliant and less apt to 
manifest directly disagreement or aggression. 
If such a set operated in the present study, it 
would have been revealed by a greater tend- 
ency toward positive assertions (ie, re- 
sponding “yes” more frequently) to attitudi- 
nal statements. Thus, an increase in the 
frequency of Yes responses by Poors could 
indicate either pathology in family relation- 
ships, a generalized response set, or a combi- 
nation of both. 

To test the acquiescence response set 
hypothesis, the data for both the poor and 
good premorbid schizophrenic groups in- 
dividually were subjected to a treatments X 
treatments X subjects analysis of variance. 
For Poors the main effect for Scales was highly 
significant (F = 27.9; 2 and 28 df; p < .001), 
indicating that these patients differed in the 
proportion of Yes responses they assigned to 
the three categories of items (mother and 
father combined). For Goods there was a 
comparably significant main effect for Scales 
(F = 20.96; 2 and 28 df; p < .001), and also 
for Parents (F = 4.88; 1 and 28 df; p < 05). - 
Furthermore, the Scale X Parent interaction 
was highly significant (F = 6.52; 2 and 28 df; 
$ < .005). This interaction resulted from 
heightened ascriptions of D and I behaviors 
to Father relative to Mother and a reverse 
pattern for O items. Clearly, the Goods ap- 
peared to differentiate both between parents 
and scales. Poors, too, differed in the propor- 
tion of affirmative responses they assigned 
within scales. 

Although these analyses do not refute defi- 
nitively a response set formulation, they do 
require a modification of such an over- 


PARENTAL ATTITUDES AS REPORTED BY PATIENTS 


simplified explanation of the data. If schizo- 
phrenic patients do manifest an acquiescence 
set, it is apparently one that tends to be 
initiated selectively by specific verbal cues 
such as those referring to statements of 
dominance and rejection when these are paired 


with specific parental figures. 


SUMMARY 


Forty-five poor and good premorbid (i.e., 
socially inadequate vs. adequate) schizo- 
phrenic and nonpsychiatric hospitalized normal 
control patients were asked to recall their 
early adolescence and respond to a 75-item 
child rearing attitude scale (comprising Over- 
protection, Dominating, and Ignoring state- 
ments) as they believed each parent would 
have responded to the items at that time. 
Results indicate that the subject’s level of 
social maturity and extent of attitudinal 
deviance ascribed to parents are related. Poors 
reveal maternal dominance, whereas Goods 
ascribe heightened paternal dominance in their 
responses. 

The results are discussed in terms of other 
studies of variations in premorbid adequacy 
in schizophrenia, the nature of identification 
processes in the schizophrenic disorder, and 
the potential influence of acquiescence Ie- 
sponse sets. 
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ARBITRARINESS OF FRUSTRATION AND AGGRESSION! 


JOHN J. KREGARMAN ann PHILIP WORCHEL 
University of Texas 


N important modification in frus- 
Bos mem theory (Dollard, 
Doob, Miller, Mowrer, & Sears, 
1939) was proposed by Pastore (1952), who 
contended that whether aggression appears 
following frustration depends on the arbi- 
trariness of the frustrating agent. When 
subjects perceived frustration as reasonable or 
justifiable, they reported fewer aggressive 
responses on à questionnaire than when they 
perceived it as arbitrary or unreasonable. 
Rothaus and Worchel (1960) pointed out that 
the reduction of aggression under nonarbitrary 
frustration may be due not to lowered instiga- 
tion to aggress but to response inhibition 
aroused by the social nature of the situation 
described in the questionnaire. If this hy- 
pothesis is valid, then Pastore’s findings can 
be directly incorporated in frustration- 
aggression theory. By using à projective 
version of the Pastore questionnaire, Rothaus 
and Worchel partially confirmed the hypothesis 
of response inhibition. Their results, however, 
still showed a significantly greater number of 
aggressive responses in the arbitrary than in 
the nonarbitrary situations, a difference that 
could have been due to the failure to remove 
all inhibition by the projective technique. 
The question still remains, therefore, 
whether the reduction of aggression under non- 
arbitrary frustration is due to the reduction in 
drive level or to response inhibition. Tf it is due 
to response inhibition, greater displacement 
of aggression towards innocent targets Or 
towards oneself (Dollard et al., 1939) should 
occur in the nonarbitrary than in the arbitrary 
frustration. The present study was planned to 
test this hypothesis. Furthermore, since the 
foregoing studies all dealt with hypothetical 
situations, the reduction of aggressive responses 
reported under nonarbitrary frustration may 
also be attributable to social desirability. One 
might be more likely to give responses in the 


1 This research was supported in whole or in part 
by the United States Air Force under Contract No. 
AF 49(638)-460 by the AF Office of Scientific 
of the Air Research and Development Command. Refer 
to AFOSR Document No. TN 60-1375. 


socially expected direction to reasonable situa- 
tions described in questionnaires. To minimize 
the possibility, the present study induces arbi- 
trary and nonarbitrary frustrations experi- 
mentally. 

Tn addition, the present study seeks to iso- 
late the influence of two different conceptions 
of “arbitrariness” that have been confounded 
in previous studies (Cohen, 1955; Pastore, 
1952; Rothaus & Worchel, 1960). In one case, 
arbitrariness has been conceived as expectancy 
of frustration. Under arbitrary conditions, the 
subjects’ expectations are violated. Thus, Hor- 
witz (1956) finds evidence that when members 
of a group are led to believe that they have 
equal weight in determining a decision, hos- 
tility is evidenced towards a “leader”? who as- 
sumes more than bis legitimate weight in the 
final decision. Pastore (1950) suggested that 
the aggression aroused in the experiment by 
Sears, Hovland, and Miller (1940) was due to 
the “transgression by the experimenter of the 
terms of the experiment, under which the sub- 
jects agreed to participate” (p. 276). In other 
cases, arbitrariness has been equated with 
reasonableness (Allison & Hunt, 1959; Pastore, 
1952; Rothaus & Worchel, 1960). Reasonable- 
ness refers to some dimension of justifiableness 
(Pastore, 1952), that is, the imposed frustra- 
tion is seen as necessary or justifiable under the 
conditions of the situation. A frustration may 
be unexpected but perceived as reasonable or 
be expected and still judged unreasonable. The 
present experiment is an attempt, therefore, 
not only to test the hypothesis of response in- 
hibition under nonarbitrary frustration, but 
also to test the effects of expectation and rea- 
sonableness on the arousal of aggression. 


METHOD 


Subjects 
The subjects were 76 male students selected from 


the undergraduate psychology courses to fila 2 X 2 
i ign (i cy X reasonableness). They 


were requested to report for one of eight experimental 


id any social o 
peychology class, subjects for each experimental session 
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came from different sections. No experimental session 
contained less than 7 or more than 13 subjects. 


Experimental Variables 


Each experimental session comprised an independent 
replication of the experiment. Frustration was imposed 
by the use of an "insult-failure" technique in the 
administration of an "intelligence" test. Previous 
studies have demonstrated that the technique was 
quite successful in arousing hostility (Worchel, 1957, 
1958). 

Expectancy of frustration was aroused by describing 
the nature of the frustration, that is, verbal comments 
by the examiner during the administration of the test. 
Reasonableness of frustration was introduced by stating 
a reason for the frustration, that is, to study the effects 
of pressure and distraction. It was found, however, 
that while expectancy does not necessarily involve 
reasonableness, the introduction of reasonableness does 
arouse an expectancy of frustration if we adhere to the 
foregoing definitions. In order to attempt to vary each 
dimension independently, reasonableness was not 
only defined as “a reason for the frustration” but also 
justifiable in the sense that the test was understood as 
being administered in the same way to all students 
(used in the condition of high reasonableness and low 
expectancy). In other words, the frustration is not 
arbitrarily directed only towards the present group of 
subjects. The instructions, therefore, for each of the 
four conditions were: 

High Reasonableness-High Expectancy (HR-HE). 

You will be given a test which has been given to a 

large group of college students. This is part of a study 

of how people function under pressure. Here the 
examiner will attempt to distract you, urge you to 
work very rapidly, and tell you what to do while 
you are working. Your scores will be entered in your 
academic files. Do as well as you can. Do not turn this 
page until you are given the signal to begin. 

High Reasonableness-Low Expectancy (HR-LE). 

You will be given a test which has been given to a 

large group of college students. While the test 

administration may seem strange, the way this test 
is given to you is the same way it has been given to 
all students. Your scores will be entered in your 
academic files. Do as well as you can. Do not turn 
this page until you are given the signal to begin. 
Low Reasonableness-High Expectancy (LR-HE). 

You will be given a test which has been given to a 

large group of college students. The examiner will 

try to keep your mind from wandering, urge you to 
work rapidly, and, if necessary, tell you what to do 
while you are working. Your scores will be entered 
in your academic files. Do as well as you can. Do not 
turn this page until you are told to begin.? 

Low Reasonableness-Low Expectancy (LR-LE). 

You will be given a test which has been given to a 


2 Though some indication of a reason is given for 
the frustration, that is, to keep your mind from wander- 
ing, the reasonableness is judged to be less than in the 
condition of high reasonableness (HR-HE), that is 
where subjects are informed that it is a study of how 
people function under pressure. 
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large group of college students. It has been found 
that this test is a fairly accurate predictor of how 
well people do in college. Your scores will be entered 
in your academic files for future references. Do as 
well as you can. Do not turn this page until you are 
given the signal to begin. 

Approximately one quarter of the members of each 
experimental group were given HR-HE instructions, 
one quarter HR-LE, etc. Of the final sample, an equal 
number of subjects had received each condition, 

The strength of each subject’s tendency to express 
aggression toward the examiner, express aggression 
toward himself, and express aggression toward mildly 
negative stereotypes was obtained by the use of a test 
of Insight and Social Sensitivity and a Sentence Com- 
pletion test. The former test consisted of statements 
about the examiner and the subject’s performance on 
the test. Each statement was followed by a six-point 
scale in which the subject was asked to express the 
extent of his agreement with the statement. A high 
Score indicates agreement with a negatively phrased 
statement or disagreement with a positively phrased 
one. Two scores were obtained from this instrument: 
direct aggression toward the examiner, and self- 
aggression. 

The Sentence Completion test consisted of eight 
stems selected from Zimmer (1959) that were reported 
to measure the tendency of a subject to project aggres- 
sion. These stems require some sort of reaction to a 
moderately negative stereotype. The number of 
responses to these stems classified by Zimmer as indicat- 
ing displacement was then taken as a measure of the 
subject’s inclination to attack such stereotypes. 


Procedure 


The examiner's assistant passed out the University 
of Texas Intelligence Test which had been stacked 
alternately on the basis of the instructions for reason- 
ableness and expectancy. The examiner introduced 
himself as a professor of psychology. He informed 
the subjects they would be given intelligence tests, 
attitude scales, and other tests. The results would 
go on their permanent records and be used for voca- 
tional guidance, recommendations for scholarships, 
admission to graduate school, and so forth. Subjects 
were then told to read the instructions on the first page 
of the test. As previously outlined, differences in these 
general instructions defined the independent variables 
of this study. During the administration, the experi- 
menter would talk frequently to distract the subjects. 
He told them to “skip the ones you don’t know,” 
“you will be penalized for guessing,” “you are working 
too slowly,” “can’t understand why you can’t finish 
each test in the time allotted,” etc. Time limits for 
each test were so short that very few subjects were able 
to finish any of the subtests even though they had 
been informed that most subjects should be able to do 
so if they worked rapidly. 

At the end of the session, the purpose of the experi- 
ment was explained, as well as the fact that the Univer- 
sity of Texas Intelligence Test was not an intelligence 
test. 
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RESULTS AND DISCUSSION 


As outlined, the experiment consisted of a 
2 X 2 factorial design with eight replications. 
As subjects within any given replication cannot 
be thought of as independent, sessions were 
treated as an independent variable. Therefore, 
a three-way analysis of variance was computed 
for each of the three dependent variables em- 
ploying the mean scores for each cell. In such 
an analysis Sessions X Reasonableness is the 
proper error term for the Reasonableness di- 
mension, Session X Expectancy of Frustration 
for the expectancy of frustration dimension, 
and Sessions X Expectancy of Frustration X 
Reasonableness for the three two-way interac- 
tions. As McNemar (1955) points out, this 
model allows no test for either the Sessions 
variable or the three-way interaction. 

Table 1 contains the grand means on each 
dependent variable in the four experimental 
conditions. 

As can be seen in this table, less aggression 
was expressed towards the examiner when 
there was high expectation of frustration than 
when there was low expectation of frustration. 
As all theoretical positions predicted this re- 
sult a one-tailed test is appropriate to test its 
significance. Table 2 shows that this difference, 
obtained on the basis of expectation of frustra- 
tion, is significant at about the .03 level. Rea- 
sonableness did not show any significant differ- 
ences. Perhaps the instructions were not 
really effective in inducing two sufficiently ex- 
treme degrees of reasonableness and therefore 
the present experiment is not conclusive as to 
the effects of reasonableness on the direct ex- 
pression of aggression. It does show; however, 
that expectancy is an influential factor in 
affecting the direct expression of aggression. In 
Such items (Pastore, 1952) as “You are waiting 
on the right corner for a bus, and the driver in- 
tentionally passes you by,” and “Your date 
phones at the last minute and breaks the ap- 
pointment without an adequate explanation," 
the factor of expectancy is very Strong. 
Though Pastore (1952) considered this possi- 
bility, he still felt that it is the degree of justi- 
fiableness of the action of the frustrating agent 
that arouses aggression. The present results 
do not confirm this interpretation. 

In other contexts, Bateson (1941) and Berko- 
witz (1960) argue for the recognition of the 
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TABLE 1 


MEANS oF THE DEPENDENT VARIABLES FOR EACH 
oF THE FouR EXPERIMENTAL CONDITIONS 


High Expectation | Low Expectation 


Variable 


High Low High Low 
Reason- | Reason- | Reason- | Reason- 
ableness | ableness | ableness | ableness 
Direct aggression 13.48 | 14.60 | 16.33 | 16.52 
Self-aggression 20.64 | 18.91 | 19.17 | 15.16 
Other-aggression 4.13 | 4.54 | 4.72 | 4.36 
TABLE 2 


ANALYSIS OF VARIANCE OF DIRECT AGGRESSION 
AND SELF-AGGRESSION SCORES 


Direct " 
ts Self-aggression 
Source apne 

MS F MS F 
Reasonableness (R) 1 | 3.39| .21 165.90 8.63* 
Expectancy (E; 1 45.36,5.40* 54.5019. 84** 
Sessions (S) 7|1.35 — | 5.07) — 
RXE 1 | 1.72| .13 |10.42| 1.53 
SXR 7 116.43/1.26 | 7.64| 1.12 
SXE 7 | 8.41} .65 | 2.75] -40 
SXRXE 7 13.03, — | 6.80} — 


* Significant beyond the .05 level. 
** Significant beyond the .01 level. 


importance of expectation of frustration in ex- 
plaining the frustration-aggression sequences. 
Bateson suggests that the Balinese might be 
infinitely willing to suffer interruption because 
their childhood experiences have led them to 

ect frustration continually. Berkowitz has 
demonstrated that aggressive reactions to frus- 
tration are indeed more intense when frustra- 
tion is unexpected. 

Evidence for a response inhibition interpre- 
tation of the effects of nonarbitrariness on the 
reduction of aggression is obtained from the 
analysis of the results of self-aggression. Table 
2 shows that the F ratios for both reasonable- 
ness and expectancy are significant at the .05 
and .01 levels, respectively, (two-tailed test). 
Though the subjects under high expectancy 
expressed less aggression toward the frustrating 
agent, they showed significantly higher aggres- 
sion toward the self. This finding is in line 
with the implication that response inhibition 
would result in greater displacement toward 
the self or other objects. Also, the subjects pre- 
sented with a justification for frustration (high- 
reasonableness) expressed significantly greater 
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aggression toward the self than those under the 
low-justification condition. This result is some- 
what surprising in view of the fact that there 
was no significant difference between the two 
groups (high and lowreasonableness) in aggres- 
sion toward the frustrating agent. Perhaps with 
high reasonableness the self is really perceived 
as the frustrating agent. In other words, when 
the subjects were told that they would undergo 
a test of their ability to function under pres- 
sure, failure under such conditions might be 
conceived as a reflection of their own inability. 
In the Pastore (1952) questionnaire, such items 
as ‘You're a private in the army and you apply 
for a promotion which is denied to you, and the 
promotion is given to a private whose qualifica- 
tions are superior to yours," or “Your instruc- 
tor gives a difficult examination for which you 
are poorly prepared," (nonarbitrary frustra- 
tions), really emphasize the self as the ap- 
propriate obstacle to goal attainment. Under 
these conditions, aggressive tendencies toward 
the self are likely to be high, and as would be 
predicted by the present results, Pastore found 
less aggression expressed in these items than 
in the arbitrary set of questions. 
There were no significant differences in pro- 
jective aggression. 


SUMMARY AND CONCLUSIONS 


"The present study was conducted to test 
the differential implications of two theoretical 
positions, namely, reduced level of drive or 
response inhibition, in accounting for the re- 
duction of aggression under nonarbitrary frus- 
tration, and to isolate the influence of ex- 
pectancy and reasonableness on the expression 
of aggression. 

Seventy-six male undergraduate students 
were given a bogus intelligence test under frus- 
trating conditions. Following the administra- 
tion of the test, measures were obtained of 
each subject's tendency to attack the frus- 
trator, himself, and slightly negative stereo- 
types. Subjects were tested in eight groups, 
each of which had from 7 to 13 subjects. An 
equal number in each group received one of 
four different general instructions to the in- 

telligence test. These instructions differed in two 
dimensions: reasonableness and expectancy. 
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To one fourth of the subjects, the frustra- 
tion was presented as expected and reason- 
able; to another fourth, as expected and un- | 
reasonable; to another fourth, as unexpected 
and reasonable; and to the remaining subjects, 
as unexpected and unreasonable. 

The results indicated that in the present 
situation: 

1. The expectation of a frustration reduces 
the tendency to express aggression toward the 
frustrator. 

2. The reasonableness of a frustration does 
not seem to significantly affect the amount of 
aggression expressed toward the frustrator, 
but this may be due to the failure to suffi- 
ciently vary this dimension. 

3. Both expectation of frustration and 
reasonableness of frustration increase the tend- 
ency to express aggression towards the self. 

Of the two hypotheses, reduced level of 
drive vs. response inhibition in the reduction 
of aggression under nonarbitrary frustrations, 
the latter is more tenable. Furthermore, the | 
results suggest that under reasonable or justi- 
fiable frustrations, the subject himself may be 
perceived as the direct agent of frustration. 
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CRITIQUE AND NOTES 


POSTDECISION AFFINITY FOR INCOMPATIBLE INFORMATION 


SIDNEY ROSEN? 
Marquetie University 


N recent years Festinger (1958) has advanced a 
theory of engaging simplicity and imposing 
scope: the theory of cognitive dissonance. A der- 
ivation from this theory suggests that once a 
person has decided in favor of a particular course 
of action as against some alternative course, he 
tends to seek out information consistent with his 
choice and to avoid information that runs counter 
to his choice, particularly on a matter of im- 
portance. 

An attempt was made by Mills, Aronson, and 
Robinson (1959) to test this proposition. They 
hypothesized that the more important the decision 
the greater the preference for positive information 
about the chosen alternative than for positive in- 
formation about the rejected alternative, and that 
the converse would hold for negative information. 
Much of their procedure was adopted in the present 
experiment, 

In order to test their hypotheses, Mills et al. 
(1959) asked college students to choose between 
taking an objective or an essay examination, with 
the understanding that the choice, once made, was 
“irrevocable.” They were told that their instructor 
was cooperating with an Educational Practices 
Committee of the Psychology Department, which 
was interested in finding out whether students 
would do better on examinations of their choice. 
To vary decision importance some subjects were 
told on the questionnaire that the exam would 
count 70% of their course grade, others that it 
would only count 5%. Subjects were also asked to 
rate, on a six-point scale, how difficult it had been 
to make the choice which they had just recorded; 
the purpose intended was to establish whether 
on importance had been successfully manipu- 
ated. 

After the questionnaires were collected, the stu- 
dents were presented with a contrived, annotated 
listing of six articles, three on objective and three 
on essay exams, and asked to indicate which refer- 
ences they preferred to read by ranking all six in 
order of preference. The list of article descriptions 
for the Positive Information condition contained 

` phrases such as “in favor of,” “more,” “better,” as 
opposed to such phrases as “against,” “ess,” 
“poorer,” which appeared in the Negative list, 


1 The author is indebted to John I. Johnson, Jr. 
and George Zimny for suggesting important improve- 
ments in the experimental design. 
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Decision importance did not appear to influence 
the tendency to select prochoice articles. Dis- 
sonance reduction theory received support only in | 
the Positive Information condition. The authors 
attributed their success with positive but not with 
negative information to two interacting tendencies: 
the seeking of supportive information and avoid- 
ance of discrepant information, and the seeking of 
more information about the chosen alternative. -+ 

While this explanation seems plausible, there isa 
possibility that their Negative condition failed to 
work partly because people are disinclined or un- 
accustomed to choosing among unfavorably de- 
scribed pieces of information. A negative element 
can be stated without necessarily employing un- 
favorably toned wordings like “poorer.” For ex- 
ample, one might present information suggestin| 
the advisability of abandoning a particular alterna: 
tive in favor of another alternative. If after a person 
has finally chosen Alternative A and rejected B, he ` 
expresses a preference for information that advises 
people who like B to switch to A, as against in- 
formation which advises people who like A to 
switch to B, then it could be inferred that his 
preference was indicative of a dissonance reducing ' 
tendency. 

The present experiment is concerned with the 
effects of this modified version of negative informa- | 
tion, and is also an attempt to replicate the positive ~ 
information condition of Mills et al. (1959). By 
incorporating both kinds of information into the 
same experimental condition, it becomes possible 
to examine their interrelationship within the same 
respondents, 

The experiment proper was supplemented by a 
preliminary exploration of individual differences in 
the tendency to reduce cognitive dissonance. One 
possibility that immediately suggested itself follows 
from the assumption that to retain dissonant | 
cognitions is to risk being wrong. If some indi- 
viduals set unusually high store by the avoidance 
of such risks, they could be expected to try to avoid — 
risks in other situations as well. It therefore seemed 
appropriate to study the relation of dissonance re- 
ducing behavior to cognitive variables known to be 
inversely linked to risk avoidance. Two such 
variables are decision certainty and that kind of 
cognitive style known as width of categorizing. Both 
variables have been used in the study of individual 
differences (Kogan & Wallach, 1960; Pettigrew, 
1958; Wallach & Kogan, 1959). For example; 
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Pettigrew (1958) and Wallach and Kogan (1959) 
have demonstrated that females use narrower 
ranges to define the limits of a given category. 
Wallach and Kogan (1959) have shown that this 
sex-linked conservatism is partly dependent on the 
degree of decision certainty, although females 
generally expressed less certainty about their 
opinions than did males. The hypothesis was there- 
fore entertained that the dissonance reducing 
tendency is inversely related to decision certainty 
and to category width. 


PROCEDURE 


The entire class of 77 male and 43 female intro- 
ductory psychology students at Marquette University 
were asked during their first day of class to choose 
between taking an objective or an essay examination. 
‘The instructions were essentially the same as those of 
Mills et al. (1959). Decision importance was varied in 
the same manner. To check on the perceived differential 
importance of the two decision making situations, the 
subjects were likewise asked to rate their decision 
making difficulty. The questionnaires were then col- 
lected. Subjects were then presented with an annotated 
listing of three fictitious pro-objective and three 
fictitious pro-essay articles, and asked to indicate 
which references they preferred to read, by ranking all 
six articles. Negative information was introduced by 
having two of the six references read as follows: “These 
authors present some evidence that students who 
prefer essay exams generally do a lot better on objective 
tests,” and “the results of this study suggest that 
students who like objective tests would probably do 
much better to switch to essay.” Article descriptions 
were counterbalanced by using two lists differing in 
the substitution of the word “objective” for the word 
“essay,” and vice versa. 

Eleven days later these students were instructed to 


‘complete an abbreviated version of Pettigrew's Cate- 


gory Width (C-W) Scale, and were also asked to indicate 
by means of five-point rating scales how certain they 
were of their judgment as to the range of each category 


- of information. They were told that the investigator 


hoped to find out how well their accuracy in making 
estimates correlated with their scores on the Scholastic 
Aptitude Test. Category-width scores were computed 
in the manner described by Pettigrew (1958), and 
certainty scores were obtained by summing individual 


\ ratings, 


RESULTS 
Tt is clear that the sum of ranks assigned to any 


| three of six articles has a possible theoretical range 
| Of 6-15, and that if the articles favoring each kind 
' of exam are equally preferred, then the summed 
| ranks accorded either kind should be 10.50. Conse- 


quently, average summed ranks for the three pro- 
decision articles smaller than 10.50 would indicate 
postdecision selectivity in favor of the decision. 
The over-all mean summed ranks for the three 
Prodecision articles was 9.41, which when com- 
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TABLE 1 


MEANS FOR THE SUM OF Ranks oF THE THREE 
ARTICLES ABOUT THE CHOSEN EXAMINATION 


j Im Mean 
Choice por | -s is 
sce ral A 
Objective High | 49| 9.26 | 3.82 | .001 
Low 45 | 9.53 |2.39 | .05 
Essay High | 21| 9.09 |3.44 | .01 
j Low 20 | 9.85 | 1.07 | ns 
Combined — |135| 9.41 | 5.20) .001 


Note.—t tests are based on the difference between each mean 
and 10,50. 


pared by t test with the hypothetical mean of 10.50 
is significant in the appropriate direction at the 
.001 level. The trend was consistent across con- 
ditions (see Table 1). In fact, two-thirds of all 
subjects yielded prodecision sums of 10 or less. 

Decision importance again failed to contribute 
significantly to dissonance reduction, despite the 
prior report by the subjects of having greater 
difficulty in reaching a decision in the High Im- 
portance condition (p « .02). It may be that the 
High Importance condition was trivial, for only a 
third of the subjects used that half of the scale 
which indicated greater difficulty. 

A comparison between the ranks accorded to the 
two articles that recommended reversal of choice 
indicated that the article advising change from the 
chosen to the nonchosen type of exam was preferred 
by a significant majority (67%) in both choice 
groups to the article advising a switch from the 
nonchosen to the chosen type (p < .001, chi square 
test). Furthermore, compared to the other subjects, 
those who preferred the former article to the latter 
expressed a significantly lower preference (p < .02; 
chi square test), but still a clear preference, for the 
remaining prochoice articles as against the remain- 
ing antichoice articles. 

Concerning individual differences in risk taking, 
male subjects were again found to use wider cate- 
gories (P < .001, ¢ test, two-tailed) on the C-W 
Scale and to express more certainty regarding their 
judgments (^ < .01, £ test, two-tailed) than female 
subjects. No relationship was found between these 
two risk Sevens measures, The sex difference on 
dissonance reduction, though not significant, was 
consistent. Consequently, MET scores on the cw 
and the certainty scaleswere compared, separately 
for each sex, with dissonance reducing Scores. An 
inverse relationship, significant only in the case of 
the males (p < -05, chi square test, two-tailed), 
was found between the dissonance reducing 
tendency (preference for prodecision articles) and 
category Width. Dissonance reduction showed nO 
relationship to decision certainty. 
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DISCUSSION : 

Evidence is again provided to indicate that the 
effects of dissonance reducing tendencies are wide- 
spread. Empirical support is also found for an 
opposing, though somewhat weaker, tendency the 
existence of which was suspected by Mills et al. 
(1959), namely, that people seek more information 
about their choice, even if the information is ad- 
verse, It remains to be determined what immediate 
goal this opposing tendency serves, whether the 
tendency indicates a post-mortem curiosity, or 
misery seeking its own company, or a positive 
interest in learning how to choose more wisely in 
the future. 

The inference that the dissonance reducing 
tendency entails risk avoidance receives some cor- 
relational support in relation to width of categoriz- 
ing, but no support in relation to decision certainty. 
This partial failure may have resulted from the 
fact that all three measures necessarily yielded a 
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restricted range of scores, particularly in the case 
of female subjects. Nevertheless, this line of in- 
vestigation appears sufficiently encouraging to 
merit further consideration. 
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A COMPARISON OF ACTUAL AND PAPER-AND-PENCIL VERSIONS 
OF THE WATER JAR TEST OF RIGIDITY? 


| LAWRENCE S. WRIGHTSMAN, Jr. AND ALFRED A. BAUMEISTER? 


LrHOUGH the water jar Einstellung test has 
been used widely, it has been questioned as 

a valid measure of rigidity. Levitt (1956), for ex- 

ample, found few studies in which the water jar 

test demonstrated either construct validity or con- 
current validity. But, as Chown (1959) points out, 
the limitations of the water jar test may lie in the 
version employed. Since the traditional paper-and- 
pencil version merely involves the drawing of 
arrows from one figure to the others, the continued 
use of the longer, or "rigid," method of solution 
_ when the shorter one is available may actually be 
more efficient. In a real-life situation, however, it 
may be more advantageous to change one's mode 
of attack because a new and shorter solution would 
take less time and effort. 
The study of Luchins and Luchins (1950), ap- 
parently the only one employing actual water jars, 
- found that concretizing the problems did not elimi- 
nate the Einstellung phenomenon. Although the 
study lacked a control group, the authors compared 
the results with those of similar subjects in previous 
studies using the traditional method and concluded 
that the use of actual jars somewhat reduced the 
. amount of rigidity. Luchins and Luchins' recent 
review (1959) presents no additional studies using 
actual jars. According to the first hypothesis of the 

Study, then, the traditional paper-and-pencil 
. version of the water jar test is expected to produce 
, more set-maintaining responses, and hence more 
rigidity—as it is operationally defined—than a 
Version requiring overt behaviors. 

Beyond the issue of the validity of the particular 
Version, the water jar test as a predictor of person- 
ality rigidity is of an unknown usefulness. Accord- 
Ing to the relevant past research, measures of be- 
 havioral rigidity appear to be little related to 
. tigidity as a general personality trait, but the lack 
_ of relationship could be due to inadequacies in the 
Paper-and-pencil method used to measure be- 
havioral rigidity. It is therefore hypothesized that 
. à more realistic version of the water jar test should 
yield higher relationships with personality rigidity 
than should a paper-and-pencil version. 

J An earlier draft of this paper was presented at the 
Midwestern Psychological ^ Association meetings, 
St. Louis, April 29, 1960. 

* The authors wish to acknowledge the suggestions 
9f C. J. Bartlett and R. C. Norris on the design and 
analysis of the data of this study. 
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George Peabody College for Teachers 


METHOD 


The subjects were 76 undergraduates enrolled in a 
course in general psychology. Each was randomly 
assigned to work on a paper-and-pencil version of the 
water jar test or to work with actual containers, The 
two experimenters alternated in supervising the two 
conditions. 

The 38 subjects in the Paper-and-Pencil condition 
(P&P) were individually administered a form of the 
water jar test in which all of the problems were pre- 
sented on the same sheet of paper. The examples, 
problems, and instructions were identical to those 
used by Luchins (1942) in his initial study, with the 
exception that his extinction problem was not used and 
the size of the largest jar in one problem was slightly 
reduced to satisfy the available sizes of milk cartons 
used in the Actual Water Jar (AWJ) condition. The 
reasons for omitting the extinction problem are similar 
to those listed by Rokeach (1949). Each subject was 
insi ed to draw arrows indicating which jar he 
would fill first and where he would pour. The two 
examples and the eight problems all could be solved 
by a “long” formula which involved filling Jar B and 
from it filling Jar A once and Jar C twice. The last 
four problems could also be solved by shorter formulas 
involving the use of only Jars A and C. If the subject 
employed some idiosyncratic solution in one of the 
first four problems, the experimenter pointed out to 
him how it could also be solved by the set-establishing 
solution. With encouragement, all subjects were 
eventually able to complete the problems, the time 
taking from 2 to 30 minutes. All subjects were used in 
the analysis of the data. 

The 38 subjects in the AWJ condition were admin- 
istered the same problems. For each problem, the 
subject was given milk cartons, labeled with the 
amounts to be used. Cartons from half-gallon to half- 
pint sizes were used with measuring units changed to 
cubic inches. The experimenter demonstrated the two 
examples by filling the jars with water from a faucet 
and pouring from one jar to another or into a sink 
until the necessary amount was obtained. The subject 
was furnished a sheet listing the amounts to be obtained 
for each problem and was permitted to do computations 
prior to filling the jars. As the subject filled and poured, 
the imenter recorded the type of solution used. 
Rigidity in the water jar test was then measured by 
determining the number of problems on which the 
subject used the long or set-maintaining formula when 
a shorter formula was available. The subjects’ responses 
were dichotomized on the basis of the presence or 
absence of at least one short solution in the four critical 


problems. 


‘After solving the water jar problems, each subject 
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individually completed the following three inventories: 

1. The California F Scale (Adorno, Frenkel- 
Brunswik, Levinson, & Sanford, 1950), which has 
been found to be related to behavioral rigidity 
(French, 1955). The version used was the 34-item 
Form 60. 

2. The Wesley Rigidity Scale (1953), a set of 50 
items similar to those of personality inventories, 
interspersed with 17 filler statements. Scores on the 
scale have been found to be related to the length of 
time subjects persevere in a concept-formation 
task before shifting to a more appropriate set. 

3. The Ri (Rigidity) Scale by Rehfisch (1958), a 
set of 39 items also similar to those of personality 
inventories. The scale was constructed by selecting 
items which differentiated between samples of rigid 
and flexible subjects, as classified by staff psycholo- 
gists at the Institute of Personality Assessment and 
Research. The split-half reliability of the scale, for a 
sample of 60 Air Force captains, was .72. 

All measures were scored so that higher scores 
indicated greater amounts of rigidity. 


RESULTS 


Table 1 presents the frequency of long solutions 
in each condition. More than 55% of the subjects 
always used the longer formula in the four critical 
problems, but there was a significantly greatfr use 
of the longer formula in the P&P condition. A x? 
analysis of the scores on the basis of the dichotomi- 
zation of responses in the two conditions produces a 
value of 5.4, significant at the .02 level. The first 
hypothesis and Luchins and Luchins’ previous 
findings (1950) are then confirmed, in that less 
rigidity was manifested when subjects used actual 
jars than when the test was done with paper and 
pencil. 

Table 2 presents the results relevant to the 
second hypothesis. Subjects’ scores were again 
dichotomized on the basis of the presence of at least 
one short solution, and biserial correlations were 
computed between water jar rigidity and scores on 
each of the inventory measures. The second hy- 


TABLE 1 
NUMBER or SUBJECTS EMPLOYING SET-MAINTAINING 
SoLUTIONS 
Number of Long Actual Paper-and- 
luti th 
Sea Problems | Condition Camdin . Total 
Four 17 27 44 
Three 6 2 8 
Two 3 1 4 
One 6 1 7 
None 6 7 13 
38 38 76 


Note.—x? for four long solutions vs. other classifications = 
5.4 (p < .02). 
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TABLE 2 
BisERIAL CORRELATIONS BETWEEN RIGIDITY AS 
MEASURED By Two VERSIONS OF THE WATER JAR 
Test AND INVENTORY MEASURES OF RIGIDITY 


Actual —— Paper-and- 
Water Jar 'encil 
Version Version 
Inventory Measure (N = 38) (N = 38) 
F Scale .02 .48* 
Wesley Rigidity Scale .28* —.24 
ERAS Ri (Rigidity) .18 .09 
ie 


* Significant at.05 level using s transformation (Walker & Lev, 
1953, pp. 269-270). 


pothesis was not fully confirmed, as the correla- 
tions with inventory measures were not always 
higher in the AWJ condition. In the case of the two 
inventories expressly designed to measure person- 
ality rigidity, the actual water jar situation pro- 
duced higher correlations, but significantly so only 
in the case of the Wesley scale. The reverse was 
true for the F Scale, for which only the P&P cor- 
relation is significantly different from zero. 

Percentile scores on the ACE Psychological 
Examination, given at freshman orientation, were 
available for 26 AW] subjects and 25 P&P sub- J 
jects. Biserial correlations between Linguistic 
percentile score on the ACE and rigidity as meas- 
ured by the water jar problems were —.70 for the 
AWJ condition and —.65 for the P&P condition. 


Discussion 


Why is it that less rigidity is shown when sub- 
jects use actual jars rather than when they draw 
arrows on paper? The reason may lie in the effort- 
fulness of the task. To those subjects in the P&P 
condition, it is hardly worth while to change, but 
to those in the AW] condition, a new method o 
solution is selected because of the saving in 
time and the physical effort required. 

Or, the reason may lie in some uncontrolled 
factor such as the length of time intervening be 
tween problems or the conspicuousness of previous } 
solutions. Although the average time taken to com- 
plete the set of problems was similar in the tw? 
conditions, there was a longer time interval be 
tween working problems in the AWJ condition, 
because of the replacement of jars and the empty: 
ing of water. Such a delay may have disrupted the 
subject’s set. 

The correlation of .48 between P&P rigidity and 
the F Scale is difficult to interpret, especially be | 
cause of the nonsignificance of the correspon 
correlation in the AW] condition. A possible e% | 
planation may be a general factor of acquiescenc 
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A tendency for subjects in the P&P condition to 
blindly follow the method demonstrated in the 
examples may be related to a tendency to agree 
with the statements of the F Scale. The unique 
aspects of the task in the AWJ condition, dis- 
cussed above, may remove the effect of acquies- 
cence from the water jar results in that condition. 

Despite the finding that the two versions produce 
different numbers of rigid responses, there seems 
little justification for advocating the use of either 
as a predictor of personality rigidity. The modest 
correlations between various measures of rigidity 
imply the absence of a unitary concept, and it may 
be that the water jar test is measuring little more 
than problem solving ability. 


SUMMARY 


Seventy-six subjects were individually adminis- 
tered either the traditional paper-and-pencil 
version of the water jar test or a more real-life 
version which used actual jars. A hypothesis that 
the traditional version would elicit more set- 
maintaining responses, and hence more opera- 
tionally-defined rigidity, was confirmed. However, 
when rigidity measures from each version were 
correlated with subjects’ scores on inventory meas- 
ures of rigidity, neither version produced con- 
sistently higher relationships. The modest corre- 
lations indicate the lack of a unitary concept. 
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THE COMPLEMENTARY NEED HYPOTHESIS IN NEWLYWEDS AND 
MIDDLE-AGED MARRIED COUPLES 


BERNARD I. MURSTEIN 
Interfaith Counseling Center, Portland, Oregon 


NE of the perplexing questions with regard to 

marriage choice is: Do opposites attract, or 
do people tend to choose mates with personal- 
ity patterns similar to their own? Earlier re- 
search, while far from definitive, has tended to 
support the belief that choice occurs on the basis 
of similarity. On the other hand, however, an 
hypothesis of “complementary needs” has been 
supported by Winch and his colleagues (Winch, 
Ktsanes, & Ktsanes, 1954). They define com- 
plementariness as follows: 


When two persons, A and B, are interacting, we 
consider the resulting gratifications of both to be com- 
plementary if one of the following conditions is satisfied: 
(1) the need or needs of A which are being gratified are 
different in kind from the need or needs being gratified 
in B; or (2) the need or needs in A which are being 
gratified are very different in intensity from the same 
needs in B which are also being gratified (p. 243). 

Their results, statistically favorable to their 
hypothesis (Winch, 1958), have not been dupli- 
cated in a study by Bowerman and Day (1956). 
Winch (1958) has criticized the latter results on the 
grounds that engaged and "steady" couples were 
used instead of married ones and that the Edwards 
Personal Preference Schedule (PPS), which he 
feels has no demonstrable validity, was used rather 
than the Winch need-interview. Despite these 
criticisms, the relative novelty of the PPS war- 
rants further use rather than a premature inter- 
ment in the psychological test graveyard. 

The present study used the Edwards PPS, a 
marital adjustment scale devised by Wallace 
(Locke & Wallace, 1959), and the Bass Famous 
Sayings Test (FST) (1958). The subjects con- 
sisted of two groups: newlyweds and a group 
married 10 years or more (nonnewlyweds). The 
newlywed group was thought to more nearly 
meet the requirements of mate-selection on the 
basis of complementary needs than did the sub- 
jects used in the study by Bowermen and Day 
(1956) since approximately one-third to one-half 
of engaged couples do not marry each other 
(Burgess & Wallin, 1953). The nonnewlyweds 
were used to study the effect of marriage on the 
development of complementary needs. This group 
could be viewed as a select group since by Winch’s 
hypothesis, persons who do not exhibit com- 

plementary need patterns would presumably be 
likely to terminate their marriages before 10 
years had passed. 
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The hypotheses stated in the null form were: 

1. There is no significant difference in the] 
correlations between newlywed couples and f 
randomly paired newlyweds. 

2. There is no significant difference in the f 
correlations between nonnewlywed couples and 
randomly paired nonnewlyweds. 


METHOD 


A total of 68 married couples was obtained from 
sundry sources. Using names obtained from the Regis- 
trar’s Office, approximately 45 telephone calls were 
made to married students at the University of Port- 
land, and 20 couples from this group agreed to par- | 
ticipate in the study. Other subjects were obtained | 
from married couples clubs’ at Temple Beth Israel 
(10 couples), The Unitarian Church (two groups 
totaling 26 couples), and the YMCA (12 couples). For 
all groups, other than the University of Portland | 
group, full participation of the groups contacted was f 
obtained. The newlywed group included those couples 
married less than 2 years who had no children. This n 
requirement served the purpose of making it more 
likely that the need patterns studied were those opera- 
tive in drawing the couples into marriage rather than 
those developed as a consequence of the marriage. 

The nonnewlywed group was defined as persons 
married longer than 2 years, with no restriction made 
as to the presence of children. None of the subjects | 
had ever been in psychotherapy or marital counseling 
as determined by a brief questionnaire. The groups 
are characterized in Table 1. 

The PPS results in a score for each of 15 needs, 
which were taken from the work of Henry Murray: | 
The FST presents the subject with 100 proverbs and 
maxims and asks him to check his agreement or dis: | 
agreement with each. From this list, four scores at 
derived: Social Acquiescence, Conventional Mores, f 
Fear of Failure, and Hostility. Both the PPS and FST 
have low but minimally satisfactory reliabilities and 
largely unproven validities. The Wallace scale i$ 
essentially a shortened form of the commonly us 
Terman and Burgess and Wallin marital adjustment 
scales. The scale incorporates the most discriminating 
items from each of these scales. It has been found to be 
valid in differentiating well-adjusted from maladjusted 
couples (Locke & Wallace, 1959). | 

The tests were administered to the groups afte! || 
reassurance that no names were to be put on the answet 
sheets and only code numbers were to be used. The 
couples were encouraged to be frank and honest ™ 
their answers and no subject verbalized a disinclination | 
to participate, The husbands and wives were separated | 
during the administration and no discussion was 9” 
lowed until the papers had been handed in. 
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RESULTS 


The correlation between the 20 men and their 
spouses for the 20 scores (1 Wallace-Marital 
Adjustment, 4 FST, 15 PPS) for the newlywed 
groups are listed in Table 2. 


TABLE 1 


ACTUARIAL DATA FOR NONNEWLYWED AND 
NEWLYWED GROUPS 
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In order to compare these correlations with 
those of men and women in general, use was made 
of a table of random numbers. Each man was 
randomly paired with a woman of the newlywed 
group. This method allows the use of a control 
group identical to the experimental group in 
socioeconomic level, education, and age. The 
correlations for the control group are also shown 
in Table 2, along with ¢ values for the differences 
between r's for each need (by Fisher's z trans- 


Nonne e SR ee formation). There are two significant / values 
bs) in Table 2 (p < .05); one in accord with the 
Mean age 36 yrs. 5mos. — 22 yrs, 8 mos, complementary need ; hypothesis, | fbeifiorher 

SD 6 yrs. 11 mos |. yrs: 3 moai significant at the .001 level in favor of a homo- 
Average education 334 yrs. college 2 yrs. college gamous theory of needs. The average r for newly- 

SD 1yr. 7mos, — 9mos. weds was .24; for random couples it was .04. A 
Average length of time 11 yrs. 5 mos. 1 yr. 2 mos. test of the difference between the mean of the 

married - 

SD 6 yrs. 8mos | 634 mos correlations for the newlyweds and randomly 
Average number of children 2.33 0.00 paired newlyweds did not appear feasible since 
Wife working Yes 10 (20.8%) Yes 9 (45%) the continuum of 7 does not constitute interval 
Tapen sccGmaiS E data. By using 7? as the unit of measurement, 

Catholic 3 3 however, a ratio scale is achieved, thus justifying 

Protestant 28 6 the use of the ¢ test. Accordingly, this transforma- 

Jewish 4 2 tion was undertaken and the resulting ¢ was 
aes z 3 found to be 1.64 (p > .05). 

below 4,000 1 10 Since the pattern of correlations does not favor 

4,000-6,000 20 8 either theoretical viewpoint, the null hypothesis 

6,000-8,000 12 1 is accepted for Hypothesis 1. 

ES 2 j Turning to the nonnewlywed group, it may be 

TABLE 2 


CORRELATIONS OF NEWLYWEDS AND RANDOM NEWLYWEDS WITH | TESTS FOR SIGNIFICANT 
DIFFERENCE BETWEEN GROUPS 
(N = 20 couples) 


Test Trait Pearson r Newlyweds Poemon Binaon t 

Wallace Marital Adjustment Marital adjustment .66 .28 1.47 

Famous Sayings Test n Conventional mores .48 .23 .85 
Hostility .10 .18 "25 
Fear of failure .16 13 i p 
Social acquiescence .33 .28 1 Ho 

Personal Preference Schedule Achievement —.16 +36 4 
Deference —.19 -43 .18 
Order .20 .02 .52 
Exhibition .07 —.16 y es 
Autonomy 18 .29 i 
Affiliation 31 —.02 .99 
Intraception 31 —.04 1.07 
Succorance —.08 .05 .08 
Dominance .38 —.07 1.40 
Abasement .30 —.28 1.75 
Nurturance .01 = m d 
Change 55 A 80 
Endurance —.30 .38 2 Qn 
Heterosexuality 63 —.82 5.86 
Aggression 25 .18 302. 


* Only £ in support of “complementary need" hypothesis. 
* Significant at .05 level. 
** Significant at .01 level. 
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TABLE 3 
CORRELATIONS OF NONNEWLYWEDS AND RANDOM NONNEWLYWEDS AND ¢ TESTS FOR SIGNIFICANT 
DIFFERENCES BETWEEN GROUPS 
Test Trait Pessonr Nonnew Eon r Random" (alus 

Wallace Marital Adjustment Marital adjustment E .18 1.64 

Famous Sayings Test Conventional mores .06 .20 x11 
Hostility .07 —.31 1.95 
Fear of failure .36 .08 1.48 
Social acquiescence .14 —.12 1.30 
Achievement A3 17 .20 
Deference .39 —.12 2.65** 
Order 335 01 1.75 
Exhibition = raa =; SA e 

x Autonom : = ‘ 

Personal Preference. Schedule ERE ny oe ce PEUT 
Intraception .07 —.08 en 
Succorance .08 —.05 65 
Dominance —.42 23 EE 
Abasement .10 —.08 .90 
Nurturance .34 .02 1.67 
Change .30 .20 453 
Endurance .05 .04 1 05. 
Heterosexuality .50 .05 2.49 
Aggression .20 —.02 1.11 

Average z .30* Average r .04 


Rc I PEE E A Re OP s Nt 
a Only significant ¢ in support of “complementary need” hypothesis. 


* Significant at .05 level, 
** Significant at .01 level. 


seen from Table 3 that 5 of the 20 sets of correla- 
tions differ significantly; 1 favoring the comple- 
mentary need hypothesis, 4 opposing it. If all 
of the tests were independent, the probability of 
such a split due to chance would be less than .05. 
The same subjects appear in each of the correla- 
tions, however, so no probability statements are 
made regarding the ¢ tests. Since the average r's 
of .30 for the nonnewlywed couples and .04 for 
the random nonnewlyweds suggested a difference 
between the groups, the individual r’s were con- 
verted into 7? scores and tested for differences 
between means. In this case the resultant ¢ was 
441, p « .005. The distribution does not appear 
normal, however, and the variance of the non- 
newlywed group was significantly larger than 
that for the random newlyweds. Boneau (1960) 
has shown that under similar conditions approxi- 
mately 8% of means chosen at random result in 
significant differences at the .05 level. Accordingly, 
the significance level was doubled, i.e., the value 
significant at the .005 level was considered to be 
significant at the .01 level. From this conservative 
position, the results support the homogamy view- 
point for the nonnewlywed group, thus resulting 
in a rejection of the second hypothesis. 


Discussion 


The heterogeneity-homogamy dichotomy seems 
a gross simplification of the actual marital situa- 
tion. For example, the strength of a need is not an 


omnipresent, unvarying characteristic of a person 
but varies according to the individual concerned, 
where he is and who is present. Further, some 
persons may exhibit many needs while others 
manifest only a few. Yet another problem to be 
considered is that needs change as a function of 
time. 

The assumption seems far more plausible thus, 
that for adequate marital adjustment some needs 
require complementary components in the marital 
partner, while others necessitate homogamous | 
need patterns. A proper test of this view requires 
replicative studies on individual needs, not over: 
tests of the two theories. Should a pattern includ- 
ing both complementary and homogenous needs 
emerge as determiners of marital choice we wt 
be faced with the problem of constructing à 
theoretical framework to encompass the disparate 
findings. 


SUMMARY 


The Edwards Personal Preference Schedule, 
Famous Sayings Test, and a marriage adjustment 
scale devised by Wallace were administered to 48 
middle-aged married couples and 20 newlywed 
couples in an attempt to test homogamous àn| 
complementary need theories of marital choice. 

The data, while far from conclusive, favor 4 
homogamous theory of need pattern choice for 
nonnewlyweds. The evidence for newlyweds 38 
entirely inconclusive in the sense that neither 
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the homogamous nor heterogenous theory is 

supported. 
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ESTINGER (1954), discussing the problem of 
Ds formation and maintenance of groups, 
postulated that people choose to affiliate with 
others whose abilities are congruent with their 
own. Earlier work on the level of aspiration 
(Chapman & Volkmann, 1939; Festinger, 1942; 
Preston & Bayton, 1941) indicates that the stated 
achievements of groups judged superior or in- 
ferior to one’s own greatly influences the level 
at which one sets his level of aspiration. Back 
(1951) suggests friendship for the members, and 
their perceived power, as sources of attraction to 
groups (a component of group cohesiveness). 
Pepitone and Reichling (1955) and Pepitone and 
Kleiner (1957) found that members whose sup- 
posed skill or ability contributed to advancement 
of a group toward its goal or to reduction of a 
threat to the group, receive status, approval, and 
liking from the other group members. 

Most of the studies examine the roles of liking 
and ability in groups already in existence. There 
remains the problem of the importance of these 
variables in the actual process of choosing one's 
associates. French (1956), in a study bearing 
directly on this point, asked people to choose 
their partners for pairwise work on a concept 
formation task. Partners were to be chosen from 
a group of four (including the subject), three of 
whom were rated as highly liked by each other, 
and one who though ranked low in liking by the 
other three, was shown to be the best performer in 
a preliminary session with the concept task. High 
achievement motivation was associated with 
preponderance of choices for the able, relatively 
unliked person, while high affiliation motivation 
was associated with preponderance of choices of 
liked persons. 

The present experiment varies the subject’s 
own alleged ability in a task, the perceived ability 
of possible partners, the subject's degree of relative 
liking for the possible partners, and the value of 
prizes which might be won in pairwise competi- 
tions. Data were collected in such a way as to 

yield, for each subject's choices, an index of 
whether they were purely on the basis of liking, 


purely on the basis of perceived ability, or repre- 
sented a compromise between liking and per- 
ceived ability. 

Our results show that for all ability groups, — 
partners were chosen on the basis of “general | 
liking," rather than ability, when the value of the 
prize was slight. Subjects of low presumed ability 
chose their partners on the basis of liking (rather 
than ability) regardless of the value of the prize. 
Subjects of high presumed ability chose their 
partners increasingly on a pure ability basis as 
the value of the prize increased. Subjects of inter- 
mediate ability gave results intermediate between 
those of high and low presumed ability. Thus, 
the general trend in the data is that the presumed 
ability of others becomes increasingly important 
in their choice of partners for a competition as the 
value of the prize increases. But this trend holds 
most markedly when the subject is led to suppose 
that his own ability is either at the upper end or 
in the middle of the total range of ability. When 
the subject's ability is presumably low, the ability 
of a prospective partner does not figure at all in 
his being chosen as a partner. 

The hypothesis that people choose partners $0 
as to match their own ability level does not ac- 
count for the finding that when the chooser's 
ability is high or moderate, he chooses able 
partners only when the value of the prize is great. 


MzrHOD! 


The specific hypotheses tested were: 

1. As payoff increases from mild social reward to 
logarithmically increasing monetary reward, partner 
choice is determined less by “liking” of the potential 
partner and more by knowledge of the potential 
partner’s skill. $ 

2. As level of the subject’s own perceived skill 


*A fuller treatment of the Methods and Procedures 
has been deposited with the American Documentation 
Institute. Order Document No. 6827 from ADI Auxil- 
jary Publications Project, Photoduplication Service 
Library of Congress; Washington 25, D. C., remitting 
in advance $1.25 for microfilm or $1.25 for photo 
copies. Make checks payable to: Chief, Photoduplica~ 
tion Service, Library of Congress. 


198 


tte tials Xs 


CRITIQUE AND NOTES 


varies from high through medium to low, his choice 
of partner is determined less by "liking" for the po- 
tential partner and more by the potential partner's 
presumed skill. 

Accordingly, we manipulated (a) information given 
the subject about her own ability, (b) the ability of 
the potential partners, (c) the value of rewards to be 
gained in competition, and (d) the relation between 
the presumed ability of, and the subject's liking for, 
the potential partners. 

Twenty-four student nurse subjects ranked their 
classmates according to their preference to “do things 
with" them. For each subject, the first, sixth, twelfth, 
eighteenth, and twenty-third ranked names were 
chosen. Fictitious * auditory perception test" scores 
were assigned these names by the experimenters so 
that the ability scores showed a perfect negative corre- 
lation with the “liking” rankings of the five names. The 
24 subjects were divided into three equal groups; one 
group was assigned fictitious high self-scores, another 
was assigned medium self-scores, and the third was 
assigned low self-scores. 

Fictitious self-scores as well as scores for the five 
potential partners were given to each subject, who 
was then asked to rank them according to preference 
as a partner in competition against other partnerships 
at an auditory perception task. Subjects ranked the 
potential partners under each of four payoff conditions. 


Method of Treating Data 


Kendall's S can be taken as a measure of the de- 
gree of agreement between two sets of rankings. 
Perfect agreement in two rankings of the five names 
yields an S of +10, while a perfect inverse ranking 
yields an S of —10. S values based on the sociometric 
rankings versus the partnership preference rankings 
Were computed for each subject under each of the four 
payoff conditions: merely being declared the winners, 
which we have called the “Fame” payoff; sharing a 
$1.00 prize; sharing a $10 prize; and sharing a $100 
prize. Thus, the higher the positive S value, the greater 
the tendency to retain the original preferences as 
Partners despite low skill scores. The greater the 
negative S values, the greater the tendency to choose 
partners showing high scores at the task while abandon- 
Ing the original preferences. In the nonparametric 
statistical methods used, these S values were treated as 
ordinal scores, 


RESULTS 


Figure 1 shows the mean S scores for the H, 
M, and L Groups plotted against the log of the 
amount of the payoff in dollars. 

Since the Fame payoff is not monetary, its 
mean S scores are joined to the mean S scores of 
the $1.00 payoff by dotted lines. The S scores 
of the H Group show a striking shift from positive 
on the Fame payoff to slightly negative on the 
$1.00 Payoff to increasingly negative as the pay- 
off increases, This finding reflects an increasing 
tendency to choose partners according to the skill 
they are thought to possess at the task. The S 
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Fic. 1. Mean S values for Low, Medium, and High 
Self-Score Groups against log payoff. 


scores of the M Group also shift from positive to 
negative as payoff increases but become negative 
only on the $100 payoff. The slopes of the H and 
M Groups are quite similar in the monetary 
payoffs but differ in the shift from the Fame to 
the $1.00 payoff, with a much sharper shift in the 
H Group. The L Group shows no tendency to 
shift partner choice in response to increasing 
payoff. 

These changes with payoff are significant when 
all groups are combined (Friedman’s test).* 
Hypothesis 1 is upheld. However, when each 
group is considered separately only the H Group 
shows significant change with increased payoff 
(Friedman’s test). Were there more subjects in 
the M Group the strong trend shown would 
probably be significant. 

When payoff is disregarded and mean S scores 
of the H, M, and L Groups are compared, a 
significant difference is found (Kruskal-Wallis 
test). However, contrary to Hypothesis 2, the 
H Group showed the strongest tendency to select 
partners according to skill. The L Group, on the 
other hand, showed the least tendency to do so. 
Further analysis shows that it is only on the $100 
payoff that a significant difference exists between 
these groups (Kruskal-Wallis test), and then only 


2 All differences called significant in this paper have 
$ S05. 


200 


between the H and L Groups (Komolgorov- 
Smirnov test). 


Discussion 


Subjects who were told, and believed, that they 
had high ability at the task were strongly inclined 
to choose partners who also had high skill, even 
though such choices meant abandoning their 
original personal preferences and working with 
their originally less preferred classmates. Sur- 
prisingly, these subjects responded strongly to 
even a $1.00 payoff. Apparently the assurance of a 
good chance to win even a small amount of money 
made it worth the change. Among these subjects, 
the tendency to choose partners according to 
skill is a linear function of the log of the payoff 
in dollars. 

In general, the same observations apply to the 
subjects who believed they had medium ability. 
Although they responded less markedly, the 
tendency for this group to choose partners ac- 
cording to skill is also a linear function of the log 
of the payoff. 

Subjects who were told they themselves had 
low skill scores continued to choose partners 
according to their original liking preferences, even 
though the monetary payoff increased. This is 
surprising since such choices meant that they 
had no chance of sharing the reward. The study 
throws no light on the factors determining their 
failure to respond to the payoff changes, 

Payoff conditions must be considered if one is 
concerned with forming cohesive partnerships, 
that is, partnerships where each member wishes 
to work with the other. Also of importance is the 
level of ability that the subjects concerned have, 
or think they have, at the task which is to be 
jointly performed. When payoff is small or of no 
consequence, cohesive partnerships may be formed 
in large measure on the basis of liking. When a 
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large payoff is involved, however, assigning part- 
ners on a "liking" basis might well result in non- 
cohesive partnerships. This could happen if one 
of the partners is of high ability or perceives 
himself to be so, while he believes the person 
whom he has indicated he “likes” to work with to 
be of low ability. Conversely, the potential partner 
who is “disliked” when payoff is negligible may 
well be the one preferred when payoff is large, if 
that person is seen as having high ability. 

On the other hand, if subjects perceive them- 
selves to be of low ability, it would seem safe to 
form cohesive groups on the basis of liking no 
matter what the payoff. Such subjects do not 
change their partner preferences in response to 
increased payoff. 
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or many years clinicians and researchers alike 

have been searching for some way aside from 
content to conceptualize and measure the emo- 
tionality that is expressed in interviews. There is 
general agreement that while content, or what is 
talked about, is very important, and gives many 
clues to the inner life of patients, the manner of 
speech, or how things are talked about, enriches 
the communication immeasurably. Recently the 
field of linguistics has been seen by many workers 
to offer the hope of capturing the vocal aspects 
of communication. Many of the references to 
linguistics have been impressionistic and program- 
matic: investigators have been enjoined to study 
systematically the effects of “tone of voice” in 
communications. To date only one published 
research (McQuown, 1957)! has used linguistic 
techniques in analyzing interview material. The 
study is a pioneering one, but does not attempt 
to evaluate the techniques in a systematic way. 

In the light of the present status of the field, 
we should like to report our experiences in using 
microlinguistic techniques in studying interviews. 
Our results are limited to only one aspect of 
linguistics, and a very microscopic one at that; 
so what we have to say must be regarded as a 
progress report rather than a final evaluation of the 
entire field. 

We wanted several kinds of help from linguistic 
techniques, First, at the lowest level of abstraction, 
we wanted some measure of the degree of disturb- 
ance shown in the speech. Disturbance in this 
sense may mean any deviation from the speaker's 
normal, calm, controlled state, be the deviation 
the result of anxiety, fear, anger, elation, or 
whatever. Speech disturbances as a measure of 
anxiety have been used by Mahl (1956) and Dibner 
(1958), but their methods were more closely based 
upon content and grammatical form than we 
wished them to be for our purposes. One sign used 
by both Mahl and Dibner, for example, is Sentence 
Change (Mahl) or Breaking in with New Thought 
(Dibner). The judgment of this sign depends on 
the meaning of the words and their syntactical 
Sequence, not on how the words are spoken, and 
Dibner could thus use typescripts to make judg- 
ments of speech disturbances. Our wish was for a 
technique that would depend less on content, 


, ! Since this study was completed, a report of another 
investigation has appeared using linguistic analysis 
(Pittenger, Hockett, & Danehy, 1960). 


through which we could delineate very slight 
disturbances, and thus catch changes more im- 
mediately as they occur in the interview. 

The second kind of judgment we hoped lin- 
guistic techniques could give was that of type of 
expressed affect. Is the patient asking this question 
challengingly or from puzzlement? Is the therapist 
offering this interpretation as an hypothesis or as 
a foregone conclusion? Answers to questions like 
these would call upon linguistic techniques not 
only to tell whether affect is present, but also to 
specify the meaning of the affect. 

The third kind of judgment we hoped for was 
the most general, that of specifying styles of 
communication which are important in the de- 
velopment and maintenance of psychopathology. 
Contradictions and confusions between messages 
simultaneously or sequentially communicated 
have recently been stressed as contributory to 
the etiology and perpetuation of schizophrenia 
(Bateson, Jackson, Haley, & Weekland, 1956; 
Ruesch, 1958; Wynne, Ryckoff, Day, & Hirsch, 
1958). Statements at this level of abstraction, if 
they are to be based upon linguistic coding, ob- 
viously demand a great deal of the coding system 
in all of its aspects. 

The system of linguistic analysis we have used 
was developed by Trager, Smith, and Hall. 
G. L. Trager served as a consultant to our program 
for 2.5 years, and the writers learned techniques 
of coding from him directly? Descriptions of the 
system may be found in Trager and Smith (1951), 
Pittenger and Smith (1957), and Trager (1958). 
Only certain aspects of the total system were 
considered for this study, those which seemed 
most pertinent to what is usually called “tone of 
voice.” These are parts of spoken language as 
much as other sounds, even though they are not 
generally included in the teaching of grammar 
and style. The elements we coded may be grouped 
under Linguistic and Paralinguistic phenomena. 
The Linguistic phenomena were coded in terms of 
phonemes, which are speech sounds grouped 
together so that despite local variation, each 
group has the same meaning to all native speakers 
of the language. Linguistic methodology makes 


2 The authors wish to extend their thanks to G. L. 
Trager for the painstaking work which he undertook 
both in teaching us the techniques and in submitting 
them to this analysis. The responsibility for this report, 
however, is entirely ours. 
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use of the psychological discontinuity between 
the phonemic groupings; a  same-or-different 
criterion enables linguists to analyze and code 
languages in terms of phonemes. The Paralin- 
guistic phenomena are superimposed upon the 
phonemes of speech. They include very diverse 
phenomena, only some of which lend themselves 
to discrete coding. Others are not discriminated 
into discrete groups, but have the character of 
continua. 

Those linguistic phenomena that were coded in 
this study are juncture, stress, and pitch: 


A, Junctures are dividing points in speech by which 
“clauses” are separated from one another. The word 
clauses is put in quotes, since these may be quite 
unrelated to clauses as defined by traditional grammar. 
There are three types of junctures, defined by the 
amount of lengthening or stretching of the material 
beginning with the primary stress, and by the pitch 
phenomena at the very end of the clause. 

B. Stress refers to the pattern of increase and 
decrease in loudness within clauses which is additional 
to the accents on different syllables of multisyllabic 
words. There are four degrees of stress in English. Each 
clause must have one and only one primary stress, and 
may have one or more secondary, tertiary, or weak 
stresses, depending on the length of the clause and the 
meaning to be conveyed, 

C. Pitch refers to the rise and fall of fundamental 
frequency of the speaker’s voice within a clause. There 
are four pitch levels in English. Most clauses have 
three pitches although some short ones have only two, 
and a small minority up to four. The three usual 
pitches occur at the beginning, at the primary stress, 
and at the end. Pitch changes in the juncture definitions 
mentioned above are fine pitch changes superimposed 
upon the end pitch for a clause. 


The Paralinguistic phenomena are grouped 
under vocalizations, voice quality, and voice set: 


A. cpa qn may be of three types: 

+ Voci aracterizers: laughii crying, voii 
taking ghing, crying, voice 

2. Vocal Segregates: sounds other than “words” 
which have specific communicative value, such 
as “um-hmm,” “huh?,” and the like. 

3. Vocal Qualifiers: extra increase or decrease in 
loudness, pitch, and duration beyond what are 
needed to convey juncture, pitch, and stress 
patterns. 

B. Voice Quality carries baseline information about 
an individual’s speech such as tempo, rhythm, 
precision (or sloppiness) of articulation, breathiness, 
register range, intensity range, rasp and openness, 
nasality, and resonance. Changes in these qualities 
can be indicators of the effects of current situational 
factors on the usual speech of an individual. 

C. Voice Sei refers to physiological characteristics 
current in the speaker—fatigue, immaturity, and 
the like. 
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METHOD 


Sample 


The speech samples used for the present study were | 
excerpts from six interviews with a single patient and 
excerpts from the recording of a radio program. The 
interviews were chosen for their variability in emo- 
tional expression, including direct anger at the psy- 
chotherapist, anger about an incident in the past, 
anxiety in relation to the psychotherapist, panic about 
an episode of “crazy” acting out, a calm discussion of 
vocational plans, and a discussion of a novel in an 
interview conducted by another person. In addition to 
the interviews with this one patient, we also used 
excerpts from one session of the NBC radio program 
Conversation, in which Clifton Fadiman and three 
guests talked in a completely unrehearsed situation 
about a topic of general interest. This conversation 
seemed to be an enjoyable “thinking out loud,” with | 
the normal tensions of discussion among well-informed 
people and of the knowledge of the broadcast situation. 


Procedure 


Excerpts of about 3-minutes’ duration from each 
of the interviews were typed and coded for the Lin- 
guistic and Paralinguistic phenomena listed above. 
Six excerpts from four of the interviews were coded 
again after a period of 6 months for reliability study. 
Coding was always done by a group of three coders. | 
While this procedure is subject to the influence of any 
dominant member of the coding group, it does serve 
to insure constant alertness to the task, which is at 
best a tedious one, approximately an hour being re- 
quired to code one minute of speech material in this 
detail. 

Reliability was checked by comparing the codings 
of the judges at the two different times. For juncture | 
patterns all possible places where a juncture could 
occur (usually between words) were tabulated as to 
presence or absence of junctures, and if one was present, 
which one was coded. For stress and pitch patterns 
and for the Paralinguistic phenomena. a different unit 
was necessary for determining reliability. Tallies were 
made of those places where the codings were made at 
each session, and agreement between codings was 
tabulated. Results are reported in terms of percentages 
of agreement, 

For those phenomena which turned out to have 
high reliability, we next pursued the question of | 
validity, comparing the distribution of the different 
phenomena among the interview samples of different 
emotional expression. We also compared the distribu- 
tions of codings of the patient’s speech with those of 
the speakers in the excerpts of the radio program. 


RESULTS 


The Linguistic phenomena held up well in the 
reliability studies. On juncture patterns the 
agreement on the two of the codings done 6 
months apart was very high. The decision as t0 


* Kind permission has been given by the program's 
producer for the use of this material. 
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whether to code a juncture or not produced 93% 
agreement (the x? was 304.7 for 1 df). Where a 
juncture was coded both times there was agree- 
ment as to which juncture 87% of the time (the 
xi was 89.5 for 4 df). Complete agreements for 
pitch and stress were 76% and 87%, respectively. 

While the Linguistic phenomena proved to be 
reliably codeable, the Paralinguistic did not. 
Vocal Qualifiers such as overhigh and overlow 
pitch, overloudness and oversoftness fared less 
well in the reliability check than did Linguistic 
Phenomena, and there was great variability in 
agreement between interviews: for example, 
type of vocal qualifier (not counting differences in 
onset and cessation) produced agreements of 31% 
and 77% for the two samples recorded after a 6- 
months’ interval. 

For Voice Quality agreement percentages varied 
even more widely, both from one code to another 
and from one interview to another for the same 
code. Often a voice quality was noted at one coding 
and nothing at all was recorded at the other time. 
In other instances the label for the voice quality 
differed; this was not surprising since such clinically 
impressionistic terms as "petulance" had been 
used to describe what seemed to be recurring 
combinations of voice qualities. In the unusual 
instances when the same label was used in both 
Codings, the points of onset and cessation often 
differed. Comparison of the elements of voice 
quality which made up such overall descriptions 
as “petulance” resulted in different components 
for the two occasions of coding. Vocal Charac- 
terizers and Vocal Segregates were too infrequently 
occurring in the sample to be subjected to re- 
liability study. 

Each time the coding was done there was 
group consensus as to the description finally made. 
McQuown (1957) has claimed that settling differ- 
ences in coding by consensus establishes “veri- 
fiability" of the techniques. “Without such a 
demonstration of methodological reliability, any 
Steps in the characterization [of persons on the 
basis of the coding]—will run the risk of being 
unconvincing” (p. 80). We agree that group 
Consensus is necessary for such complex coding, 
but group consensus is not a test of reliability, 
and the consensus itself must be subjected to 
reliability check. 

The Linguistic phenomena, juncture, pitch, 
and stress, which proved reliable enough for 
further investigation, were next subjected to 
Validity comparisons across the samples of the 
Interviews, and between the samples of the patient 
Speech and those of the conversationalists recorded 
from the radio program. Our most thorough efforts 
to find any relation between the variety of emo- 
tional expression and the kinds of profiles of 
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Linguistic phenomena coded met with failure. 
'There was no tendency, for example, for any 
pattern of intonations and junctures to appear 
more frequently with anger than with calmness. 
Similarly, judgments of “appropriateness” of 
these patterns were unrelated to the widely differ- 
ent affects expressed in the interviews, Finally, 
the participants of the radio program could 
neither be differentiated from each other nor from 
the patient by the Linguistic codings. 


Discussion 


Our studies indicate that the Linguistic pat- 
terns (juncture, stress, and pitch) can be described 
reliably with presently available coding techniques, 
but that these aspects of speech probably have 
little psychological relevance. By contrast, the 
Paralinguistic phenomena (vocalizations, voice 
quality, and voice set) presumably have higher 
psychological relevance, but cannot be coded 
reliably. Our explanation for these findings is 
that the methods developed in traditional lin- 
guistic analysis may not be applicable to the 
analysis of emotional expression, not because of 
deficiencies in the field of linguistics, but because 
of fundamental differences in the nature of lan- 
guage and emotional expression. 

The difference lies in the fact that language is 
made up of discrete elements, while emotional 
communication in speech consists of continuous 
phenomena that have more complex interrelations. 
The elements of any language are usually de- 
scribed as phonemes. Under the definition given 
previously, these elements are seen as either 
present or absent, and may be coded as such by 
the linguist. Phonemes are put together accord- 
ing to a long, but finite, series of rules into words, 
and words in turn into more complete structures 
to convey cognitive messages. While there are 
many words which have different “meanings” in 
different contexts, still all the elements of lan- 
guage, and to a large extent the meanings they 
convey, are discrete, and lend themselves to 
codings of Present or Absent. 

Emotional aspects of speech, on the other hand, 
are different from the cognitive aspects. They have 
not yet been so completely studied that we can 
tell if they are describable in Present-Absent 
terms: it may be that emotional communication 
is always present in speech. Our experience does 
show, however, that where these emotional 
factors occur, they are best described as continua. 
Seen in this way, coding techniques taken from 
linguistics are inappropriate to the analysis of emo- 
tional factors in speech. Similarly the search for 
elements of emotional expression, analogous to 
phonemes in classical linguistics, may be just as 
inappropriate, or so time consuming as to defeat its 
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own purpose. Global judgments of emotionality in 
speech where cognitive messages are filtered out 
electronically, as used by Starkweather (1956a, 
1956b), may prove useful in the analysis of inter- 
views long before the elements which form the 
basis for these judgments are understood. In de- 
veloping methods for such judgments, the tech- 
niques designed for scaling data which fall on 
continua seem more appropriate than does the 
search for discrete elements. Trager (1958) has 
recognized the need for scaling in his most recent 
formulation by assigning degrees to vocal qualifiers 
(p. 6). We may hope that linguists, who have ex- 
tensive experience in listening carefully to speech, 
will contribute to the application of scaling tech- 
niques to this purpose, and may, in the course of 
this work, find fruitful avenues for further lin- 
guistic analysis of emotionality in speech. 


SuMMARY 


To test the applicability of linguistic techniques 
to the analysis of emotional expression in speech, 
excerpts of six psychotherapeutic interviews of 
markedly different emotional tone and excerpts 
from an unrehearsed conversation recorded from a 
radio program were studied. These excerpts were 
coded on those aspects of the linguistic system 
developed by Trager and his associates which 
were presumed to have relevance to “tone of 
voice”: the Linguistic phenomena of juncture, 
pitch, and stress, and the Paralinguistic phenom- 
ena of vocalizations, voice quality, and voice set. 
Coding was done by three coders working as a 
group, and a sample of excerpts was selected for re- 
coding by the same group 6 months later for relia- 
bility check, The Linguistic phenomena proved to 
be highly reliable, while the Paralinguistic did not, 
but the reliable Linguistic codings were independ- 
ent of the emotional expression in the different 
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interviews and of the individual differences among 
the speakers represented by the interviews and 
the radio program. Explanation for these results 
was sought in the difference between language 
proper and emotional expression as communicated 
by vocal means other than language. 
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2 VALUES DESTROYED BY DEATH 


v JAMES C. DIGGORY! ann DOREEN Z. ROTHMAN? 
fy University of Pennsylvania. 


RE are two questions about fear of death. 


a First, does everyone always fear death more 


than anything else? Second, exactly what is it 


about death that is feared? 


in answer to the first question, expressions of 
attitude and overt acts often indicate that other 
‘things are more feared than death. College students 


"reported they feared snakes, cancer, and death of 


loved ones more than their own death (Means, 
1936); some children do not fear death very 
acutely, perhaps because they believe its probabil- 
ity is low (Schilder & Wechsler, 1934); men risk 
their lives or lose them to save others or to advance 
some cause; in the reality or prospect of grievous 
loss, some commit suicide; martyrs have expired 
to retain their principles or to gain some future 
reward; and some aged persons are said to long 
for death (Challaye, 1924). So it seems that death 
is not always the object of everyone’s greatest fear. 

The answer to the first question increases the 


; importance of the second: knowing what, specifi- 
cally, is feared about death. First, some ways of 


dying are more feared than others, according to 


- Means (1936) and some unpublished data of our 
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own. We do not know yet why this is so, but the 
fact that it is so suffices for the present discussion. 
Second, suicide often follows receipt of an insult 
or loss of status so extreme as to make it im- 
possible for the person to function comfortably or 
effectively in the future (Bunzel, 1937; Malinow- 
ski, 1926; Mead, 1935). Though his skills and 
abilities may be undamaged, he has lost the op- 
Portunity to use them. Suicide also occurs when 
the person’s ability or skill has deteriorated to the 
point where he cannot function even when he has 
the opportunity (Freuchen, 1931). Whether a 
Person chooses to die because death attracts him 
or because he fears it less than other alternatives 
is of no consequence for the present argument. 
What interests us is that reduction in fear of 
ee may be associated with loss of utility of the 

Following the value theories of Dewey (1939) 
and von Ehrenfels (cf. Eaton, 1930), we assume 


1 The senior author wishes to thank the University 
of Pennsylvania Committee on the Advancement of 
ch for a grant which provided opportunity for 
analysis of the data reported here. 
.. * Several people, in addition to the authors, assisted 
in the collection or analysis of these data: John Ballogh, 


- W. Douglas Bradley, Raya Eckron, Nancy Miller, 


and Ronald Ries. 


that utility of the self corresponds to the proba- 
bility that a person, by his own efforts, can achieve 
objectives that are important to him. The larger 
the number of important objectives for which 
one's probability of achievement is high, the 
greater his self-esteem. Loss of ability or skill 
reduces probability of achievement, and with it, 
self-esteem. Elimination of opportunity for the 
exercise of skills or abilities makes them, in effect, 
worthless, even though they are undamaged. We 
try to preserve or extend objects we value highly, 
but those of low value are treated with indifference 
or destroyed. Thus, a person who values himself 
highly should be more afraid of death than one 
whose self-esteem is low, because death is the 
limiting case of loss or destruction of the self. The 
specific goals important to one person may differ 
from those important to another. We may value 
ourselves because we have achieved objectives im- 
portant only to us or because we take pride in our 
ability to help others gain their ends. But regardless 
of the specific content of his goals, what one would 
lament most about his own death is loss of the 
specific activities in which he is most involved or 
feels to be most important. To the extent that the 
goals a person values highly depend on his social 
status, his fear of various consequences of his own 
death should vary with his status or role, whether 
defined by age, sex, social class, religion, or marital 
condition. 


PROCEDURE 


With these considerations in mind, we developed 

a list of “consequences of one’s own death”: 

. I could no longer have any experiences. 

Iam uncertain as to what might happen to me 
if there is a life after death. 

Iam afraid of what might happen to my body 
after death. 

. I could no longer care for my dependents. 

My death would cause grief to my relatives and 
friends. 

All my plans and projects would come to an end. 
. The process of dying might be painful. 

These were arranged for paired comparisons and 
included as part of a larger questionnaire on various 
aspects of attitudes toward death. Respondents in- 
dicated the member of each pair that they regarded 
as worse or more distasteful than the other. 

In selecting respondents we made no systematic 
attempt to draw a sample representing the general 
population, nor did we use a strictly random method. 
The questionnaires were handed to groups or individ- 
uals wherever we had opportunity: in college and 


ou HU Oo pe 


205 


206 


high school classrooms, offices, social clubs, fire houses, 
and union halls. Many respondents are the older or 
younger relatives of students whom we asked for help. 
About two-thirds of the questionnaires were returned, 
including 563 usable responses dealt with in this paper. 
Figure 2 gives the number of respondents in each 
category. 

Respondents are classed by: Age in years (15-19, 
20-24, 25-39, 40-54, and >55); Sex, Marital status 
(Single, Engaged, Married, Widowed, or Divorced); 
Religious affiliation (Protestant, Catholic, Jewish, 
or Other-None); and Social class (upper, middle, or 
lower). Social class was judged by combining reported 
amount and source of annual income and the general 
educational level of the family. 


RESULTS 


The curves in Figure 1 were arrived at by the 
following process. First, from the paired compari- 
sons data a ranking of the seven items was deter- 
mined for each respondent, with the highest rank 
(1) assigned to the most frequently chosen, and 
lowest rank (7) to the least frequently chosen item. 
Ties were resolved by assigning the average of 
the appropriate ranks to each of the tied items. 
Second, frequency distribution tables were con- 
structed showing the number of respondents in a 
given category who assigned a particular rank to 
any item. From these tables we constructed tables 
of the cumulative proportions of respondents 
assigning an item to a given rank or higher. The 
curves in Figure 1 are plotted directly from these 
cumulative proportion tables. The figure shows 
how we derived the median rank of each item for 
all respondents. The rank scale on the abscissa 
of Figure 1 is reproduced in Figure 2A. The other 
scales in Figure 2 were derived in the same way. 

The distribution of cumulative proportions 
permitted the use of the Kolmogorov-Smirnov 
test for significance of differences among the 
various respondent categories (Siegel, 1956). 
Thus, for any consequence of death, we can find 
whether the sexes differ significantly in the median 
rank they assign it, and similarly for the social 
class, age, marital status, or religious categories, 
Hereafter, a “significant difference” is understood 
to mean rejection of the null hypothesis of the 
Kolmogorov-Smirnov test at $ < 05. The meanings 
of such differences in terms of the median rank of 
the items can be determined from Figure 2. 

Dissolution of the body. Item C (“I am afraid 
of what might happen to my body after death.”), 
is a good “anchor point” since it is always at the 
lowest, least avoided rank. No other item holds 
its rank so consistently in all comparisons. As a 
matter of interest, note- that women fear this 
consequence of death significantly more than 
men do. This may be related to the fact that 
women value themselves more in terms of their 
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physical attractiveness than do men (Cohen, 
1955, pp. 137-147). 


longer care for my dependents.”), and F (“All 
my plans and projects would come to an end.”) 
are the most relevant to our main hypothesis, 
Note that A and F are egocentric as compared 
with D, and they are less clearly related to any 
special role. Thus, it is D, rather than A or F, 
to which we must look for support of our major 
hypothesis, and the hypothesis is supported. The 
importance assigned to being able to care for 
dependents obviously requires that one has de- 
pendents or is committed to acquire them. Thus, 
the progression in marital status: Single, Engaged, 
Married, should be accompanied by a marked 
increase in fear for this item. Since this progression 
is also a function of age, Item D should increase 
in importance with increasing age. Also, the sex 
of the respondents should determine the im- 
portance of ability to care for dependents because, 
in our culture, the modal adult male has dependents 
while the female is a dependent. Figure 2—B, C, 
and F—shows these expectations fully met, In the 
age category the mean rank of Item D is signifi- 
cantly higher at 40-54 years than at 15-19 years! 
it is significantly higher for men than for women, 
and significantly higher for the married group 
than for the single group. 

Items A and F are usually near the top of the 
scale in all categories of respondents. Moreover, 
they tend to vary together, only twice being more 
than one rank apart (cf. Catholic and Widowed 
groups). The least fear of either of them is among 
Roman Catholics. They are feared most in the 
Other-None religious group. Outside the religious 
categories the only significant difference is that 
Item F is more feared by single than by marri 
persons. This makes sense because married people’s 
plans are not completely egocentric, and are 
likely to be subsumed under caring for dependents. 
Clearly, differences in religious affiliation produce 
more variation in the fear of Items A and F than 
any other category. All differences among the 
religious groups are significant for Item A. The 
Other-None group regards F as significantly more 
fearful than do Catholics or Jews. 

That Items A and F are generally high in the 
tankings is consistent with our view that actively 
getting experiences and executing plans are values 
the loss of which by death would be keenly felt 
because of their importance to self-esteem. 

The fact that loss of ability to have experiences 


* The failure of the significance test between 15-19 
years and >55 years is doubtless due to the si 
number of cases in the latter category. 


The purposive activity items. A ("I could m| 
longer have any experiences.”), D (“I could m 
ied | 
mall 
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all 563 respondents assigning each item to a given, or higher, rank. (The 


item key is: A, I could no longer have any experiences; B, I am afraid of what might happen to me if there is a 


life after death; C, I am afraid of what might happen to my body after death; D, I would no longer be able to 
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My death would cause grief to my relatives and friends; F, All my plans and projects 


care f. d dents; E, issa i: in Fi, 
Te xor mydepene en dying might be painful. The scale on the abscissa is reproduced in Figure 
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A. ALL RESPONDENTS (N=563) 


B. AGE 
15-19  (N*197) 
20-24 (N=143) 
25-39 (N=120) 


40-54 (N= 85) 
l 2 3 4 5 6 EO eso 
C. SEX 
© OOO. © © 
= MEN (N=228) 
©OQDO © C 
l 2 3 4 5 $ + WOMEN (N -335) 
D. RELIGION 
© 


PROT. (N=254) 
CATH. (N=135) 
JEW. (N=133) 


UPPER (N=117) 
MIDDLE (N=364) 


LOWER (N= 82) 


SINGLE (N=355) 
ENGAG. (N= 29) 
MARR. (Ns170) 


WID. (N= 14) 


| 2 3 4 5 6 7 DIVOR. (N= 9) 
RANKS 


Fic. 2. Median scale positions assigned to seven consequences of one’s own death by respondents in various 


categories. (The item key is the same as in Figure 1.) 
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(Item A) is important for the religious groups in. 


the order Jewish > Protestant > Catholic prob- 
ably reflects differences in their beliefs that death 
is the end of experience. For Protestants and Jews 
there are relatively few specific prescribed actions 
to increase the probability of happiness in the 
hereafter, but there are many such actions pre- 
scribed for Catholics. This difference may mean 
that life after death is more real to Catholics than 
to Jews or Protestants. Realism in this sense 
corresponds to Mahler's (1933) operational def- 
initions of levels of reality in tasks: real tasks 
demand and permit concrete manipulations; but 
unreal ones allow only thinking about the problem 
solving operations. If this distinction is correctly 
applied, it is likely that Catholics are less fearful 
of the loss of experiences by death simply because 
they do not believe that such a loss is possible. 

Possible pain of dying. Item G (“The process of 
dying might be painful"), also varies in median 
rank as a function of religious affiliation, being 
least feared by Protestants, more so by Catholics, 
and most by Jews. The position of this item also 
changes significantly with social class, more 
feared by upper and middle than by lower class 
respondents. Also, there is a sex-difference: women 
fear the possible pain of dying more than men do. 

Fear of the future. Item B (“I am uncertain as 
to what might happen to me if there is life after 
death.”), progresses from greatest to least among 
the religious groups Catholic > Protestant > 
Jewish > Other-None. Like the differences in 
Item A discussed above, this doubtless reflects 
differences in the concreteness of beliefs about 
life after death. The importance of this item also 
varies with social class: lower > middle > upper. 

Affective consequences of one’s own death for 
others. Item E (“My death would cause grief to my 
relatives and friends.”), is less important for the 
Other-None than for any other religious group. 
The item is also considerably more fearful to 
upper and middle than to lower class respondents, 
a somewhat surprising finding in view of the 
reputed emphasis in lower class groups upon 
dependence on, and satisfaction in, primary group 
relations (Bellin & Riessman, 1949; Rosenberg, 
1949; Young & Rosenberg, 1949). However, it 
is entirely possible that the majority of our “ower 
class” respondents are actually marginal with 
respect to class membership and espousal of norms. 


CoNCLUSIONS 


The general picture provided by these data is 
that the concomitants or consequences of his own 
death which a person fears most depend on the 
role he has or expects to have, and therefore, on 


the goals to which he is committed. The items in 
this study most relevant to goal striving are D 
(caring for dependents), A (having experiences), 
and F (completing plans and projects), the latter 
two being relatively egocentric compared to the 
former. The other items are related to ways the 
person may be passively affected by his own death: 
B (his fate in a life after death), C (the fate of his 
body after death), G (he might experience pain 
while dying), or E (the emotional impact of his 
death on others). 

The important factors associated with differ- 
ences in rank positions of these various items are 
(a) one’s adherence to a set of goals defined by 
his role and (6) his devotion (or lack of it) to a 
religious or social class ideology. 

Our hypothesis, that a person fears death 
because it eliminates his opportunity to pursue 
goals important to his self-esteem, is supported 
by the following: fear that one can no longer care 
for dependents varies systematically with roles 
defined by marital status, sex, and age; the pur- 
posive items of having experiences and completing 
one’s own projects are consistently near the high 
end of the fear scale, except for people who may 
be assumed to believe that death is not the end of 
experience. 
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THE RELATIONSHIP OF EXTRAVERSION-INTROVERSION TO VISUAL 
AND KINESTHETIC AFTEREFFECTS 


KATHRYN J. NORCROSS, RONALD S. LIPMAN, ANp HERMAN H. SPITZ 
Edward R. Johnstone Training and Research Center, Bordentown, New Jersey 


YSENCK (1955, 1956) has hypothesized a 
E relationship between the personality dimen- 
sion of extraversion-introversion and the constructs 
of neural satiation and reactive inhibition (Ip). 
In testing his hypotheses with respect to satiation, 
Eysenck (1955) compared the kinesthetic after- 
effect (KAE) scores of hysterics with dysthymics, 
populations presumably differing in extraversion. 
He concluded that the results supported his 
prediction that aftereffects would develop quicker, 
appear more strongly, and persist longer in 
hysterics than in dysthymics. Rechtschaffen 
(1958), on the other hand, found no significant 
correlation between scores on Guilford’s Rha- 
thymia (extraversion) scale and visual figural 
aftereffect (VAE) scores of college students. Since 
Spitz and Lipman (1960) failed to find a significant 
intercorrelation of individual differences in KAE 
and VAE scores, the use of satiation measures 
obtained from different sense modalities might 
lead to different results. However, Rechtschaffen 
(Rechtschaffen & Bookbinder, 1960) has since 
replicated his experiment, using a KAE measure, 
and again reports negative results. This note is a 
further examination of the relationship between 
extraversion-introversion scores and both the 
VAE and KAE scores obtained by Spitz and 
Lipman (1960). 

A complete description of the VAE and KAE 
apparatus and procedure is given in the original 
Spitz and Lipman article and will not be repeated. 
Their experimental design included two groups 
whose treatments differed only in respect to the 
sequence of tasks. (Group I was administered the 
VAE first, followed by the KAE, while Group IT 
was administered the KAE first, followed by the 
VAE.) 


1 Now at Psychology Department, Western Reserve 
University, Cleveland, Ohio. 


The following measures were available from 
both groups: the rate of development of visual 
satiation effects, total amount of visual satiation, 
total amount of kinesthetic satiation, rate of dis- 
sipation (Spitz, 1959) of kinesthetic satiation, and 
rate of dissipation of visual satiation. 

Extraversion (E) scores obtained from the 
Maudsley Personality Inventory were available 
for 127 of the subjects? According to Eysenck’s 
hypotheses, E scores should be positively correl- 
ated with rate of development and total amount 
of satiation but negatively correlated with rate of 
dissipation. These hypotheses were tested by 
calculation of Pearson r's. Since the E scores for 
females were significantly higher than for males, 
r’s were figured separately for males and females. 
The possible combination of sex and treatment 
sequences (Group I or Group II) thus yielded 
four correlations between E scores and the five 
aftereffect measures listed above. Of these 20 
correlations, one was significant and in the direc- 
tion predicted by Eysenck. This was a correlation 
of .29 (p < .05) between E scores of females in 
Group I and their total amount of visual satiation. 
This is not a large correlation, and 1 significant 


? Appreciation is expressed to the Psychology De- 
partment faculty at Trenton State College, Trenton, 
New Jersey, for their assistance in the collection of 
Maudsley Personality Inventory scores. 

3 Although the prediction would seem to follow 
that females should show greater figural aftereffects 
than males, just the opposite finding has been re- 
ported (Rechtschaffen & Bookbinder, 1960; Spitz 
& Lipman, 1960). However, since Eysenck’s hypothesis 
concerns an inherited causal factor for extraverted 
behavior and since it seems likely that social learning 
could bring about sex differences in extraversion, it 
was decided to determine r keeping the distributions 
separate for males and females. 
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correlation out of 20 could readily be due to chance 
(Sakoda, Cohen, & Beall, 1954). 

In summary, these results confirm those pre- 
viously reported by Rechtschaffen and fail to 
confirm Eysenck's hypothesis that satiation effects 
in extraverts develop more quickly, reach higher 
levels, and dissipate more slowly than in intro- 
verts. 
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DOGMATISM AND ANXIETY IN RELATION TO PROBLEM SOLVING: 
AN EXTENSION OF ROKEACH’S RESULTS 


SAMUEL FILLENBAUM 
University of North Carolina 


F a recent book (Rokeach, 1960) considerable 
evidence has been presented with regard to in- 
dividual differences in the openness-closedness of 
belief systems. In particular, the behavior of open 
(undogmatic) and closed (dogmatic) persons was 
observed in laboratory situations that required the 
subject to cope with new conceptual systems con- 
tradictory to those of everyday life. Generally, con- 
sistent differences were found between subjects 
scoring high and low on a dogmatism scale. The 
latter subjects showed considerably less difficulty 
in integrating and synthesizing beliefs at variance 
with the assumptions of everyday life into new 
systems, and were able to cope with the problem 
more adequately, and solve it more rapidly. 

This study consists of a replication and exten- 
sion of part of Rokeach’s work, in two respects: 

1. The relation between dogmatism and problem 
solving behavior is examined over the whole range 
of scores on dogmatism rather than with extreme 
groups only as was Rokeach’s procedure. 

2. The relationship between dogmatism and 
anxiety is examined explicitly and directly. 
Rokeach has presented data indicating a consistent 
relationship (product-moment r's ranging from 
-36 to .64) between openness-closedness in belief 
systems, as measured by his Dogmatism scale, and 
anxiety, as measured by the use of an MMPI based 
scale (Welsh, 1952). One possible interpretation of 
the differences in problem solving behavior of sub- 
jects scoring high and low on dogmatism might be 
in terms of difference in anxiety, since there is evi- 
dence for some relation between anxiety and be- 
havior in complex tasks (e.g., Romanow, 1958). 
Rokeach presents no direct evidence on the rela- 
tion between anxiety and behavior in the problem 
solving task at interest. 


METHOD 


The Rokeach Dogmatism (D) scale (Form E com- 
prising 40 items) was administered to all 73 students 
in the summer introductory psychology sections at the 
University of North Carolina. Subjects were selected 
in such fashion as to get D scores covering the whole 
range. The 10 subjects with highest D Scores and the 
10 subjects with lowest D scores were selected as well 
as 29 subjects with intermediate Scores, these subjects 
being chosen at random from each of the intermediate 
score intervals.! All 49 subjects were run in the Denny 
Doodlebug (DD) problem (Rokeach, 1960), and for 42 


1 Actually 50 subjects were chosen, but the data 
for one of these had to be discarded. 
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ARNOLD JACKMAN 
University of Kentucky 


of these subjects scores were also obtained on a gen- 
eralized anxiety (A) scale derived from a factor analysis 
of MMPI responses (Welsh, 1956). The subjects had 
No reason to suspect any relation or connection between 
the various procedures, since the D and A scales were 
administered by their instructor at different times 
during the semester among a variety of other tasks and 
procedures, and subjects were solicited for the DD 
problem being told only that they had been chosen at 
random to serve in a problem solving experiment. 

The DD problem is one where the subject must be 
willing to adopt temporarily and cope with a set of 
rules organized into a miniature belief system at odds 
with the standard operating procedures of everyday 
life. The subject must discover how Joe Doodlebug, 
who can move only according to a set of restrictive 
conditions, can get to some food in exactly four jumps. 
Essentially the same procedure was followed as that 
used by Rokeach.? 

The total time allowed was 40 minutes. The follow- 
ing constituted some amplification of the procedure used 
by Rokeach: if the subject did not work out loud he 
was urged to do so; if he offered a false solution he was 
told that this was not satisfactory and referred back 
to the conditions of the problem; if he asked whether 
or not a specific procedure or kind of move was legiti- 
mate, he was told. From time to time the subject was 
urged to look again at the conditions of the problem, 
and it was emphasized that four jumps were necessary, 
no more and no less. 

For the first 10 minutes the subject was allowed to 
work continuously and if he overcame any of the three 
inappropriate beliefs, the time was recorded; record 
was kept of the time at which the various beliefs were 
overcome or hints provided. Hints were given, if 
needed, at the end of 10, 15, and 20 minutes, respec- 
tively, the hint given depending upon which belief, if 
any, the subject had previously overcome by himself; 
generally the facing belicf was given first (“Joe does not 
have to face the food in order to eat it.”), the direction 
belief second (‘Joe can jump sideways and backwards 
as well as forward.”), and the movement belief third 
(“Joe was moving east when the food was presented."). 
The general procedure was such that if a subject 
overcame one belief on his own in the first 10 minutes, 
he was given the second hint at 10 minutes, and the 
third hint at 15 minutes; if he overcame two beliefs 
within the first 15 minutes he was given the third hint 
at 15 minutes, and so on. The hints were presented 
typed on cards, the subject being allowed to read the 
hint and look at the card for 10 seconds or so before 
it was turned over. 

In the analysis of data from the DD problem two 
measures were used: total time to solution of problem 
(T) and time from breaking of last inappropriate belief, 


* See Rokeach (1960, pp. 186-187) for a description 
of the problem and the instructions given the subject. 
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either through the subject’s own action or as a con- 
sequence of being given hints, to solution of problem 
(Ts). Earlier work has shown that there are no differ- 
ences as a function of D scores in the analytic phase of 
the search for solution (requiring breakdown of the in- 
appropriate beliefs) so that the first measure not only 
provides an over-all index to performance in this task, 
but differences in this measure may also be regarded 
as indicative of differences in time needed to synthesize 
new beliefs into a working system. The second measure 
provides a more direct index of the time needed, after 
the old conventional approaches have been discarded, 
to integrate and synthesize the new beliefs into a 
working system that permits solution of this novel 
problem. 

"The A scale is composed of 39 MMPI items and 
was scored according to instructions (Welsh, 1956). 
Fach of the 40 D-scale items could be marked on a 
six-point scale going from +3 (I agree very much) to 
—3 (I disagree very much). For scoring purposes this 
scale was converted to a 1-7 scale by adding a constant 
of four to each item. A subject’s score was the sum of 
his scores over all items. 


RESULTS AND DISCUSSION 


Consider first some ways in which the results 
replicate earlier findings. 

For the total pool of subjects the mean score on 
Dogmatism was 142.1 (SD = 25.3); for the 49 
subjects run in the experiment the mean score on 
Dogmatism was 140.8 (SD = 28.8). Thus the 
group used was somewhat more variable than the 
total group since all subjects at the extremes but 
not all with intermediate scores were included. 
These figures are quite comparable with some re- 
ported by Rokeach for a number of different groups 
of American college students, the reported means 
tange from 141.3 to 143.8 and the standard devia- 
tions range from 22.1 to 28.2. 

Contrast the performance on the DD problem 
of the 10 subjects scoring highest (X = 179.1) and 
the 10 subjects scoring lowest (X = 99,5) on the 
Dogmatism scale. Consider first the total time 
measure (T). Mean total time to solution for the 
10 highest scorers was 27.25 minutes, for the 10 
lowest scorers this was 18.25 minutes, a mean 
difference of 9 minutes. Upon test this yields a ¢ 
of 2.33 (p < .01, one-tailed); since the variances of 
these groups are rather different a nonparametric 
analysis was also performed; using the Mann- 
Whitney U test this yielded a z of 1.81 (p < .05, 
one-tailed). With regard to the second measure, 
time from breaking of last inappropriate belief to 
solution (Ts), the 10 subjects highest on Dogma- 
tism took an average of 15.0 minutes while the 
corresponding figure for the 10 subjects lowest on 
Dogmatism was 4.4 minutes. Test of the difference 
yields a ż of 2.88 (p < -01, one-tailed) and, us- 
ing the Mann-Whitney U test, a z of 1.89 ($ < 
05, one-tailed). Clearly the result is much the same 
whichever time measure is used and is consistent 
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with previous results. In this task subjects low on 
Dogmatism perform more efficiently than those high 
on Dogmatism. 

In the present study a product-moment corre- 
lation of .49 (p < .01) was found between D scores 
and A scores, consistent with Rokeach’s earlier 
results. 

As for new findings of the present study, the 
correlation between D scores and performance in 
the DD problem was .37 (p « .01) if total time 
(T) scores are considered, and .41 (p < .01) if 
synthesis time (Ts), from breaking of last inap- 
propriate belief to solution, is used as the time 
measure. While these findings indicate a significant 
correlation between D scores and performance in 
the DD problem it may still be largely accounted 
for| by the subjects with extreme scores on D. 
To test this possibility the following analysis was 
performed: the 10 subjects scoring highest and the 
10 subjects scoring lowest on D were removed from 
the sample and the D X T and D X Ts correla- 
tions were recomputed on the remaining middle 29 
cases, the values of these coefficients then being ad- 
justed for constriction in variability on D. If in- 
deed there is no relation over the middle range of 
D values between D and performance on the DD 
problem then these values should be considerably 
lower than the ones given above. If the values are 
close to those obtained when correlations were 
computed for all 49 subjects then there is some rea- 
son to think that the association between D and 
performance on the DD problem holds over the 
range of D values. For the middle 29 cases on D 
the correlation between D and T was found to be 
25 and the correlation between D and Ts was 
found to be .20. When these coefficients were cor- 
rected for the curtailment in variability on D they 
took values of .44 and .37, respectively, very close 
to those obtained when the full sample of 49 sub- 
jects was used. Consequently there is some evi- 
dence that the correlation between D and per- 
formance on the DD problem, while by no means 
very high, does hold over the range of D values. 

The correlation between generalized anxiety (A) 
scores and T scores was found to be .02; that be- 
tween A scores and Ts scores was .12.4 On this 
basis it is clear that there is no relation between 
generalized anxiety and performance on the DD 
problem. 


3]t should be remembered that in some respects 
the anxiety scale used by Rokeach differs from that 
used in this study. His was comprised of 30 MMPI- 
derived items each answered on a seven-point agree- 
disagree scale while our A measure is based on 39 
MMPI items each answered true or false. 

4Scores on a repression dimension (based on a set 
of MMPI items, see Welsh, 1956) had been obtained 
at the same time as the A scores. ‘These also turned 
out to be unrelated to performance on the DD task. 
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Some evidence has thus been presented that the 

relation between dogmatism and performance in a 
task requiring the subject to synthesize and or- 
ganize a new set of beliefs that differ from the 
standard operating procedures of every day life 
holds not only for extreme scores on dogmatism, 
but also over the range of scores; the more closed- 
minded the person the more trouble he has on such 
a task. This result has interest in that it extends 
earlier work that provided evidence only with re- 
gard to the test performance of very open- and 
closed-minded people. Second, it seems clear that 
even though there is a definite relation (r = .49) 
between D scores and A scores, the relation be- 
tween dogmatism and performance in the DD 
problem cannot be interpreted in terms of differ- 
ences with regard to a generalized anxiety variable 
(for partialing out A hardly affects at all the D X 
T, or D X Ts correlations). On the one hand this 
result may be regarded as partly consistent with 
Rokeach’s (1960) position since he supposes that 
the degree to which a person’s belief system is 
open or closed (as indexed by score on the Dog- 
matism scale) depends upon the extent to which 
that “person can receive, evaluate and act on 
relevant information viewed from the outside on 
its own intrinsic merits...” (p. 57), and there is 
no reason to think that such ability to act on infor- 
mation on its own merits is only or largely condi- 
tioned by a person’s position on a generalized anx- 
iety dimension. On the other hand if it is the case 
that “to the extent that a belief-disbelief system is 
closed it represents a cognitive network of defenses 
against anxiety" (p. 347), and if the significant 
relation between D and A scores is taken as evi- 
dence in support of this Proposition, then one 
should perhaps expect some relation between gen- 
eralized anxiety and performance in a task requir- 
ing the subject to synthesize a new set of beliefs, 
at odds with those of everyday life, into a working 
system. Our negative results on this Point do not, 
of course, agree with the results of some studies 
that indicate a negative relation between anxiety 
and complex task performance, (though the litera- 
ture on this issue is by no means very consistent or 
clear cut). 

On the basis both of our results and of Rokeach’s, 
one point concerning performance on the DD 
problem should be stressed. Most of the differences 
between subjects in total time to solution are 
clearly attributable to differences in time to solu- 
tion after the last of the inappropriate beliefs has 
been broken, either through the subject’s own 
action, or, much more commonly, as a result of 
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being given the third of the hints. This result is 
open to two somewhat different interpretations: 


on this task may be attributed to individual differ- 
ences in the time taken to synthesize a set of new 


beliefs into a working system after discarding old | 


inappropriate beliefs, (the interpretation favored 
by Rokeach), or that the last belief (the movement 
belief) is the most difficult to overcome (in this 
study only two or three subjects overcame this 
belief by themselves) and that differences in solu- 
tion time mainly reflect differences in the ability to 
use the third hint. This last suggestion could 
easily be tested by reversing the order in which the 
hints are presented, giving the last one first. 


SUMMARY 


This study involved a replication and extension 
of some of Rokeach’s work on the relation of Dog- 
matism (D) and Anxiety (4) to performance in a 
problem solving task requiring the subject to re- 
ject a number of standard operating procedures of 
everyday life and integrate and synthesize a new 
set of beliefs into a working system, the Denny 
Doodlebug (DD) problem. Consistent with previ- 
ous results, subjects with very high D scores 
performed more poorly on the DD problem than 
subjects with very low D scores, and a significant 
positive relation (r = .49) was found between D 
scores and A scores. The relation between D scores 
and performance on the DD task held not only for 
extreme scores on D, but over the range of scores 
on D (r = .37 orr = 41 depending upon the par- 
ticular time measure used). A scores were unrelated 
to performance on the DD task. 

Some implications of these results were dis- 
cussed; and, in particular, question was raised as to 
the interpretation of performance scores on the DD 
task. 
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IDENTIFICATION PROCESSES IN HOSPITALIZED NARCOTIC DRUG 
ADDICTS! 


JACK J. MONROE ann ALEXANDER W. ASTIN? 
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HE reluctance on the part of many psycho- 

therapists to treat criminals, alcoholics, or 
other social deviants is well-known. Even those 
clinicians who do accept such individuals for treat- 
ment often report that therapeutic progress is hin- 
dered by what appear to be deeply ingrained 
identifications with some asocial or antisocial 
group. These same identifications are also cited 
frequently as accounting for the general ineffective- 
ness of other therapeutic or corrective measures in 
modifying the deviant behavior. 

The present investigation was initiated in order 
to study these identification processes in one so- 
cially deviant group, narcotic drug addicts. To be- 
gin with, we assumed that there existed among 
drug addicts a generalized tendency to identify 
with the addict- subculture. We labeled the ten- 
dency “Addict Identification" (ID). We next as- 
sumed that addicts differ with respect to the 
strength of ID, and that such differences, in turn, 
predict a wide range of addiction-related behavior. 
That is, we expected to account for many of the 
diagnostic and prognostic differences among ad- 
dicts in terms of how strongly each is identified 
with the addict subculture. 

_ Specifically, we postulated that high identifiers 
(ie., addicts with relatively strong ID), as com- 
pared with low identifiers, (a) are more accepting, 
of the way of life of the drug addict, including 
acceptance of the condition of addiction; (b) 
possess stronger asocial or socially rebellious tend- 
encies; (c) have encountered more past experiences 
which are conducive to learning a and b; and (d) 
are less able to develop satisfactory interpersonal 
relationships with members of middle class (non- 
addict) society. 

For the purposes of this study, we chose to define 
ID in terms of a discrepancy between the addict’s 
self-concept and his own stereotype of the addict 
group, i.e., “addicts-in-general.” In the terminology 
of Gage (1952), we would define as a high identifier 
any addict who has relatively high "assumed self- 
typicality.” 

Being able to estimate the degree of congruence 
between a self-concept and a group stereotype, 


1 Portions of this paper were presented in a sym- 
posium, Psychological Aspects of Drug Addiction, 
at the meeting of the American Psychological Asso- 
ciation, Cincinnati, 1959. 

2 Now with the National Merit Scholarship Cor- 
poration, Evanston, Illinois. 


however, tells us nothing about the nalure of the 
stereotype. Since, as with any discrepancy, similar 
scores may be obtained from quite different sets of 
measures, the meaning of ID may vary with the 
particular stereotype upon which it is based. Ac- 
cordingly, it was also decided to make available 
for each addict an estimate of how closely his 
stereotype of addicts-in-general agreed with a 
stereotype of addicts-in-general held by the psy- 
chiatric staff? The extent of this congruence, then, 
defined the “Valid Information” (VI) possessed by 
the addict regarding addicts-in-general.* 


METHOD 


The initial task was to develop measures of ID and 
VI, after which the four postulates regarding ID were 
to be tested by means of specific hypotheses. A pre- 
liminary 350-item true-false inventory was constructed 
so that the first 175 items were worded in the first 
person (e.g., “I like parties”), and the remaining 175 
(parallel) items were worded in the third person (e.g., 
“Most addicts like parties”). The criteria for selecting 
the basic set of 175 first-person items were similar to 
those employed in an earlier study (Monroe. & Hill, 
1958). In general, items contained content with known 
or suspected emotional significance for addicts, such 
as drug effects, interpersonal relationships, social 
conformity, frustration tolerance, etc. The 50 items 
from the MMPI Psychopathic Deviate (Pd) scale were 
also included in view of the marked tendency for addicts 
to obtain high scores on this scale (Astin, 1959). The 
parallel third-person items for each of these 175 first- 
person items were written by substituting “most ad- 
dicts” for the initial ^I" in the item. Any subsequent 
first-person references were replaced by appropriate 
third-person pronouns, e.g., the parallel sentence for 
“J Jove my father” would be “Most addicts love their 
fathers." 

The entire 350-item inventory was administered to 
100 hospitalized male addicts. The percentages of 
True responses for each of the 175 pairs of items were 
compared by means of ¢ ratios for correlated 
proportions. The 30 pairs of items which differentiated 


3]t was assumed that psychiatrically oriented 
nnel, by virtue of their training, nonmembership 
in the addict subculture, and especially their daily 
professional contact with drug addict patients, would 
be in a position to make objective and "valid" ob- 
servations about addicts-in-general: hence, the term, 
“valid information.” 
4Such a measure is analogous to Cronbach’s (1955) 
“stereotype accuracy," except that the staff's stereo- 
type, rather than the pooled self-description of the 
addicts, is used here as the norm or standard. 
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best between the first- and third-person versions were 
selected for the initial ID Scale. Any subject’s ID 
score (i.e., tendency to identify with his own stereotype 
of addicts), therefore, was defined as the number of 
these 30 pairs which he answered in the same direction 
in both the first and third persons. 

To obtain the staffs stereotype of addicts, the 
second half (third person) portion of the 350-item 
inventory was administered to 60 hospital treatment 
personnel, including psychiatric nurses and aides, 
psychiatrists, social workers, and psychologists. 
Third-person responses of the 100 addicts were then 
compared with those of the staff, and the 30 most 
differentiating items retained as the Valid Information 
(VI) Scale. (It should be noted that those were inter- 
group comparisons using the same items, as contrasted 
with the intragroup comparisons of parallel items 
used for the ID Scale.) Any addict’s VI score was the 
number of these 30 third-person items which he 
answered in the direction more characteristic of the 
staff. 

Of immediate interest in the VI Scale was the fact 
that eight of the items overlapped with ID items. For 
all eight of these, the aggregate self-ratings of the 
addicts occupied one response extreme; the staff's 
stereotype of addicts-in-general occupied the other, 
with the addicts’ stereotype of addicts-in-general in 
between. To examine this apparent relationship 
further, the eight overlapping items were divided 
equally between ID and VI, resulting in two 26-item 
scales. When scored configurally (as with the original 
ID Scale), these two scales correlated .68 in the sample 
of 100 addicts. Since even this value had probably 
been attenuated by eliminating the item overlap, it 
was decided to combine both scales into a longer and 
presumably more reliable ID Scale. The split-half 
reliability of this 52-item scale for the sample of 100 
are estimated by the Spearman-Brown formula, 
was .89. 
5 A revised true-false inventory, including the 52- 
item ID Scale, was administered to a new sample of 
250 male drug addicts as part of a routine group testing 
battery. Each subject was also interviewed by a staff 
clinical psychologist, who made certain ratings on the 
patient's interview behavior. Data pertaining to the 
subject’s past history were also collected during the 
interview. The 92 highest and 81 lowest-scoring sub- 
Jects on the 52-item ID Scale were designated as high 
and low identifiers, respectively. Data collected in the 
interview and with the group testing battery were 
used to test a number of specific hypotheses arising 
from the four postulates. 


ResuLTS AND DISCUSSION 
The results obtained may be conveniently con- 


5 Since the largest first-third person response differ- 
ences ranged between about 40 and 60%, it was felt 
that using these items would tend to maximize indi- 
vidual differences on the configurally scored ID scale, 
In addition, it was assumed that such items would also 
tap areas where the lowest identifiers tend systemati- 
on. to dissociate themselves from the addict subcul- 

e; 
* "ID" will henceforth refer to this combined scale. 
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sidered, together with the specific hypotheses, 
grouped in terms of the general postulates. 


Acceptance of the Condition of Addiction 


Hypothesis I: High identifiers become addicted to 
drugs more rapidly than low identifiers. Rapidity of 
addiction was defined inversely as the interval of 
time between first use of drugs, as reported by the 
subjects in the interview, and the establishment of 
the initial physical dependence upon drugs. The 
means of 1.55 and 1.50 years for high and low 
identifiers, respectively, were not significantly 
different (5 > .05), nor did they differ in the pre- 
dicted direction. The hypothesis was not sup- 
ported. 

Hypothesis II: High identifiers relapse to drugs 
more readily after drugs are withdrawn. Data on re- 
lapse to drugs were collected by asking each subject 
to state the longest period of time, since first being 
addicted, that he had remained voluntarily ab- 
stinent from drugs. Fifty-two percent of the high 
identifiers reported a period of abstinence lasting 
more than one month, while 69% of the low identi- 
fiers reported such periods. The chi square of 
4.971 was significant (p < .05).” The hypothesis 
was supported. 


Asocial or Rebellious Tendencies 


Hypothesis III: High identifiers are more psycho- 

pathic. Psychopathy was measured by the MMPI 
Psychopathic Deviate (Pd) scale. The mean raw 
score of 30.9 for the high identifiers was signifi- 
cantly higher than the low identifiers’ mean of 
28.6 ( < .01). The hypothesis was supported. 
' Hypothesis IV: High identifiers are more socially 
maladapted. High identifiers scored significantly 
higher than low identifiers (p < .05) on a six-item 
scale of Social Maladaptation (Astin, 1959). The 
hypothesis was supported. 

Hypothesis V: High identifiers have less education. 
The mean years of schooling for the high and low 
identifiers were 10.5 and 11.3, respectively. These 
were not significantly different (p > .05). The 
hypothesis was not supported. 


Conditions Conducive to Learning Addict Role 


Hypothesis VI: High identifiers are older. The 
mean ages of high and low identifiers of 31.2 and 
29.9 years, respectively, were not significantly 
different (p > .05). The hypothesis was not sup- 
Ported. 

Hypothesis VII: High identifiers began using 
drugs at an earlier age. The average age of first use 
for the high identifiers was 23.5 and 24.1 for the 


7 The marked skewness of these distributions re- 
quired the use of the nonparametric statistic. All other 
hypotheses were tested by means of f ratios. 
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low identifiers. These means were not significantly 
different (p > .05). The hypothesis was not sup- 
ported. 

Hypothesis VIII: High identifiers have been using 
drugs for longer periods of time. Length of drug use 
(chronicity) was defined as the discrepancy be- 
tween age when physical dependence upon drugs 
was first formed and the age at the time of the 
present admission to the hospital. The mean inter- 
vals for the high and low identifiers of 7.7 and 5.8, 
respectively, were significantly different (6 < 01). 
The hypothesis was supported. 


Ability to Develop Relationships with Middle Class 
Society 


Hypothesis IX: Establishment of rapport be- 
tween staff psychologists and high identifiers is more 
difficult than between staff and low identifiers. Rap- 
port between the subject and the interviewing psy- 
chologist was estimated in terms of ratings made 
by the psychologist on the patient. Ratings of 
“talked freely,” “relaxed,” “alert,” “calm,” 
“warmed quickly in the interview,” “self-assured,” 
and “plans are realistic” were considered as indica- 
tive of good rapport. Ratings of “apathetic,” *sub- 
tly hostile,” “belligerent,” “aloof,” “suspicious,” 
“shy,” “passive,” “reticent to talk,” and “did not 
get involved in the interview” were considered to 
be indicative of poor rapport. Six “good rapport” 
items were applied significantly more often to low 
identifiers: “relaxed” (p < .01), “warmed quickly 
in the interview" (p < .01), “alert” (5 < .05), 
“self-assured” (p < .05), “plans are realistic” 
(p < .05), and “calm” (p < .05). Differences on 
the remaining items, while not significant, were 
in the predicted direction. In general, the hy- 
pothesis was regarded as supported. 

Hypothesis X: High identifiers are judged as 
poorer candidates for psychotherapy than low identi- 
fiers. High identifiers scored significantly lower on 
a measure of “acceptability for psychotherapy" 
(Monroe & Hill, 1958) than did low identifiers 
(p « .01). The hypothesis was supported. 

Hypothesis XI: High identifiers’ stereotype of 
addicts-in-general agrees less with that of the staff 
than does that of low identifiers. High identifiers’ 
stereotype accuracy, as measured by the Valid In- 
formation (VI) score, was significantly lower than 
that of low identifiers (5 < .001). The hypothesis 
was supported. 

Tt thus appears that the theoretical formulations 
regarding ID gain some degree of support from the 
data. Seven of the 11 hypotheses were supported, 
and in only 1 of the 11 was the difference not in the 
predicted direction. 

The results leave no doubt that Addict Identi- 
fication (ID), defined as the extent to which an 
addict identifies with his own stereotype of addicts- 
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in-general, provides a meaningful way to differen- 
tiate among hospitalized drug addicts, Following 
Cronbach’s (1955) reasoning, ID, as with other 
assumed similarity scores, is best understood as a 
generalized mental set, and may be relatively inde- 
pendent of the psychological content of the items 
used in obtaining it. The behavioral correlates 
found for ID in this study extend its meaning far 
beyond its operational definition as a discrepancy 
between concepts of self and others. ID appears to 
be a function of several other variables, including 
both past and present aspects of the addict’s be- 
havior. One problem which merits further study 
has to do with the likelihood that ID is a learned 
phenomenon, and can therefore be unlearned. If, 
as suggested by the present study, high ID is re- 
lated to poor prognosis for psychotherapy and 
tendency to continue drug use (chronicity), the 
possibility of unlearning ID should be thoroughly 
investigated. An initial step in this direction might 
be to determine which historical variables are ante- 
cedent to and which are a consequence of the de- 
velopment of high Addict Identification. 

The method outlined here may prove to be 
valuable also as a means for studying other types 
of identifications or social class orientations, It 
may well be, for example, that group identifications 
of this kind account for some of the recent evi- 
dence relating social class membership to the type 
of psychiatric treatment prescribed (Hollingshead 
& Redlich, 1958; Kahn, Pollack, & Fink, 1957). 
The tendency for psychotherapists not to accept 
as patients those who come from the lower social 
classes may stem from something akin to the 
“poor rapport” which we observed in our high 
identifiers. 


SUMMARY 


The purpose of the present study was to examine 
the tendency on the part of hospitalized narcotic 
drug addicts to identify with “addicts-in-general.” 
Identification was defined for each addict in terms 
of a discrepancy between his self-evaluation on a 
true-false inventory of habits, feelings, and atti- 
tudes, and his evaluation of “most addicts” on a 
parallel inventory of similar content. Those sub- 
jects who tended more to attribute to themselves 
the same evaluations which they assigned to ad- 
dicts-in-general were designated as high identifiers. 
Four general postulates regarding high identifiers 
were formulated. As compared with low identifiers, 
they were expected to (a) be more accepting of the 
way of life of the addict, including acceptance of 
the condition of addiction; (b) possess stronger 
asocial or socially rebellious tendencies; (c) have 
encountered more past experiences conducive to 
learning a and b; and (d) be less able to develop 
satisfactory interpersonal relationships with mem- 
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bers of middle class society. Eleven specific hy- 
potheses, derived from these postulates, were 
tested with data collected in clinical interviews and 
with a routine group testing battery. These theo- 
retical formulations regarding addict identification 
gained some degree of support from the data. 
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TL a recent issue of this journal Lindzey, Lykken, 
and Winston (1960) undertook to investigate 
one of the principal questions concerning the deter- 
- minants of adult behavior, namely, the interaction 
between ontogenetic and genetic factors as related 
to later behavior. There is little question that the 
— problem is important and intriguing; however, as 
we hope to point out, the study of Lindzey et al. 
added little but confusion to an area that has its 
full share of difficulties. 

During the past several years there have ap- 
peared in the journals a number of papers (Denen- 
berg & Bell, 1960; Levine, 1958, 1959b; Levine, 
Chevalier, & Korchin, 1956) related to the effects 
of various infantile experiences with noxious 
stimuli on later behavior. Several of these papers 
present data on emotionality using measures 
similar to those used by Lindzey et al. (1960), 
namely, defecation, urination, and timidity. The 
conclusion reached by all of these studies was that 
infantile experience with noxious stimuli reduces 
emotionality as defined by a significant reduction 
in autonomic reactivity and increased ambulation. 
These conclusions are in direct contradiction to 
that reached by the Lindzey et al. study. 

Such conclusions were based on comparisons 
which unfortunately were irrevocably unavailable 
to Lindzey et al. (1960). In all of these studies three 
basic groups were employed, namely, a group sub- 
jected to noxious stimulation during infancy, 
another group which were treated similarly to those 
receiving noxious stimulation (these two groups of 
course resemble the Lindzey et al. groups), and a 
third and critical control group, a group which re- 
ceived no treatment during infancy. Both logically 
and methodologically if one wishes to compare the 
effects of any treatment such comparisons must be 
made with a condition of no treatment. This error 
is more flagrant in view of the numerous reports 
that have appeared using this specific design 
when investigating the problem of infantile 
“trauma” on later emotionality. Thus although in 
some instances the infant animals subjected to 
noxious stimulation appear to differ somewhat 
from their treated controls, when they are com- 
pared with totally nontreated animals the effects 
of noxious stimulation have been to reduce emo- 
tionality dramatically. In a recent paper (Levine, 
1959a) the following statement appears: 


In this study, as in several previous studies, no 
differences in behavior were observed between the 
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different modes of stimulation. Although some differ- 
ences have been noted in adulthood between animals 
that were shocked in infancy and those that were 
handled, these differences have been transitory, and 
the magnitude of differences has been much smaller 
than the differences between stimulated and non- 
stimulated animals. The evidence obtained thus far 
has raised the question of what in this area constitutes 
the major experimental treatment. It appears that the 
condition of no treatment seems to have the most pro- 
found effect upon development and the subsequent 
emotionality and performance of the animal, Thus far 
in all of the experiments coming out of our laboratory 
it has been the non-stimulated infant that has exhibited 
relatively slower development, greater emotionality, 
and poorer performance in adulthood (p. 245). 


Yet Lindzey et al. (1960) interpret their data 
as confirming their hypothesis that infantile 
trauma would produce an excessive emotionality. 
We contend that such an hypothesis is testable only 
within the strictest limitations if at all, using a 
design that does not include a nontreated control 
group. It is indeed difficult to understand how the 
relevant literature related to this point could 
have been overlooked since in three of the papers 
quoted in the references (Ader, 1959; King, 1958; 
King & Eleftheriou, 1959) reference is made to re- 
search in which a nontreated control group design 
was used. Still in the introduction Lindzey et al. 
seem to exhibit surprise that “One investigator 
(Ader, 1957) actually has reported results indicat- 
ing that treated rats were less emotional than 
control animals,” and again state “It is worth 
note, however, that there is actually no evidence 
in any study for infantile trauma decreasing’ 
emotionality.” The evidence speaks for itself 
when a complete and adequate design is used. — 

Unfortunately this same major flaw in experi- 
mental design has been perpetrated on numerous 
occasions by several investigators including Ader 
(1959), Griffiths and Stringer (1952), Scott (1955), 
and Stanley and Monkman (1956). Perhaps the 
problem is that the hypotheses that generate these 
studies start off with assumptions and biases 
which often result in the omission of “nonessential” 
control groups. 

One cannot preclude the possibility that given 
certain genetic conditions noxious stimulation can 
result in increased emotionality. The study by 
King and Eleftheriou (1959) and more recent 


1 The italics in these instances are those of Lindzey, 
Lykken, and Winston (1960). 
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investigations by Ginsburg (1960) have indicated 
that this does occur. Our comments are directed 
to the point that the Lindsey et al. (1960) study 
by the very nature of their experimental design 
could not have demonstrated these relationships. 
Close inspection of their data casts further doubt 
upon their conclusions and reveals only marginal 
levels of significance for the effect they claim to 
have firmly demonstrated. 

It might have been possible to argue that 
merely placing an infant animal in a washtub for 
6 minutes from the fourth to the eighth day could 
not be sufficient to cause any effect. However, it 
has been shown on numerous occasions that the 
simple process of picking up (Levine, 1956, 
1959b; Schaefer, 1957) an animal daily during 
infancy can exert a profound effect on its later 
behavior and emotionality. In addition the days 
between 4-8 compare almost directly with the 
critical periods during which the effects of infantile 
stimulation appear to have their most profound 
effect in rats and mice (Denenberg & Bell, 1960; 
Levine & Lewis, 1959; Schaefer, 1957). 

We should like to reaffirm our faith that the 
problem is an extremely important one and because 
of its importance demands the utmost of the in- 
vestigator, and if we may quote again (Lindzey 
et al, 1960) "this is a finding that makes life 
more complex for both the investigator and 
theorist, and in an area where there has never 
been any shortage of complexity," and albeit 
this statement is taken out of context it more 
than amply states the position of the present 
writer. 

Since this critique was written a similar note 
has appeared by Denenberg (1961). 
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evıne (1961) is convinced that our study 
cannot provide a satisfactory answer to 
any question because it was poorly designed. He 
asserts that as a result of omitting an essential 
control group we have observed findings that led 
to an erroneous conclusion. On the first issue he 
is flatly wrong, while on the second there is, as 
yet, insufficient evidence to permit a final answer. 
In brief, we assert that Levine is confused in regard 
to the logic of control groups and, further, that 
his convictions have led him to extrapolate an 
empirical generalization more broadly than is 
warranted by existing data. 

If we accept Levine’s (1961) reasoning that his 
“untreated” group belongs on the same continuum 
as our traumatized and nontraumatized groups 
(an assumption we would question if space per- 
mitted), it follows that we have examined the 
effects upon emotionality of two values of trauma. 
But, asserts Levine, without the zero point on the 
trauma continuum it is impossible for us to say 
anything about the relation between trauma and 
emotionality. This is simply incorrect! It is per- 
fectly meaningful and legitimate to compare the 
effect of two nonzero intensities of a particular 
variable on a dependent measure and the resultant 
finding stands as an empirical contribution in its 
own right. Indeed there are few zero points to be 
found on the psychological scene (cf. S. S. Stevens) 
and certainly none in Levine’s own research. It is 
evident that Levine’s “untreated group” is not 
untreated in any absolute sense. Treatment is 
nothing more than stimulation. All subjects are 
continuously undergoing treatment from the 
moment that sensory capacity begins. Our control 
group had one source of stimulation removed 
(loud, noxious bell), while Levine’s untreated 
group had another source of stimulation removed 
(handling prior to weaning). Obviously there are 
many additional kinds of stimuli (even noxious 
stimuli) that are a standard part of the laboratory 
‘and remain a part of the experience of his un- 
treated control as well as our untreated control. 
Thus at best his group represents only another 
point on the trauma continuum, not a zero point, 
and eventually his findings will have to be fitted 
into a broader empirical net, just as will ours. 

The second important issue has to do with the 


1 Preparation of this paper was supported by a 
grant from the Ford Foundation and Research Grant 
M-3536 from the National Institute of Mental Health, 
United States Public Health Service. 


general relation between infantile trauma and 
emotionality. Perhaps the major difficulty here is 
semantic. It is clear to us that Levine uses the 
terms "infantile trauma" and “emotionality” 
differently than we do. Consequently, where he 
sees the existence of direct conflict we see the need 
for bridging studies to bring two domains of data 
into some orderly relation. Specifically, we define 
infantile trauma as an intense noxious stimulus 
administered during infancy while Levine is willing 
to say that any noxious stimulus, no matter how 
intense, administered prior to weaning, consti- 
tutes an infantile trauma. Thus for him a brief 
period of daily handling that extends over 20 days 
of the life of the animal is an infantile trauma, 
while for us it is neither exclusively infantile nor 
is it necessarily traumatic. In regard to emotionality 
there is the same difference. We anchor the term 
specifically in urination/defecation in an open 
field situation while Levine is willing to use 
motility, avoidance learning, and consummatory 
behavior, as well as other operations, to define 
emotionality. We feel that our use of the terms 
“emotionality” and “infantile trauma” is better 
rooted in past convention than is Levine’s usage. 

This problem is most clearly revealed in the 
second paragraph of Levine’s (1961) note where 
he reports that four studies, several using meas- 
ures similar to ours, have led to results that directly 
contradict our findings. Let us remember that 
our dependent measures were timidity (stove-pipe 
test) and emotionality (urination /defecation) 
and examine these four studies. Two of them em- 
ploy as their dependent measures indices of avoid- 
ance learning. One employs a measure of water 
consumption and an index of emotionality con- 
sisting of the number of trials on which defecation 
and freezing occur. The fourth study used a meas- 
ure of defecation on an elevated runway, latency 
to move on this runway, and a measure of water 
consumption. To suramarize, there is no timidity 
measure here at all, one study uses an index com- 
bining freezing and defecation, which is only a very 
distant cousin of our urination/defecation meas- 
ure, and one study uses à somewhat similar 
defecation measure and actually finds with this 
measure no treatment effecis—indeed the trend of 
the data is consistent with our prediction. It is 
difficult to see how observation of no treatment 
effects can be considered to support Levine's 
contention that infantile trauma reduces emo- 
tionality. It is even harder to see how one can talk 
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about “direct contradiction” when discussing re- 
sults based upon measures so different from ours. 

In the process of exploring the semantic differ- 
ences between Levine and ourselves, it has become 
clear that a critical remark in our paper (Lindzey, 
Lykken, Winston, 1960, p. 11) concerning Ader's 
(1959) research was partially a product of language 
differences and consequently mot warranted. 
Apology is hereby tendered. 

There seems no need to comment at length 
upon the matter of "marginal levels of signifi- 
cance.” Such a remark is either too little or too 
much. Our data are presented explicitly and were 
subjected to the kinds of analyses that seemed to 
us most appropriate. We have no doubt that an 
objective observer given these data would conclude 
that there was a treatment effect. Moreover, these 
findings have been repeatedly obseryed in our 
laboratory and, as indicated in our paper, they are 
a replication of findings reported independently 
by others using the same procedures and com- 
parable subjects. If these findings are indeed 

` replicable, it would seem clear that Levine has 
Some responsibility for modifying his principal 
generalization if he wishes to extrapolate it so 
broadly as to include our data. 

What of our convictions in regard to the relation 
between emotionality and infantile trauma? Given 
our definitions of these terms we feel confident 
that. trauma increases adult emotionality. If we 
alter the definition of trauma to include the full 
Tange of stimulation, we have little doubt that 
research will eventually reveal a U-shaped function 
linking these two variables. Thus it is quite possible 
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that Levine is right in regard to one portion of | 
_this function and that we are right in regard tc 


another segment. It should be added that there 
are many parameters that can be expected to 
influence this general relationship and the study 
of these has only begun. 4 

What can we conclude? First, there is far too 
little connected evidence to permit an empirical 
resolution of the general issue. At least as im- 
portant as additional data is increased semantic 
clarity and greater information concerning the 
relation between the various dependent measures 
employed. Second, in the interests of generalizing 
his findings broadly, Levine has shown itle 
concern over many important differences between 
his research and our own and he has demonstrated 
no interest in the possibility that stimulation and 
adult emotionality (temperament) may. be linked 
by some reasonably complex function. Third, 
Levine was incorrect in asserting that our research 
was incapable of answering any question because 
we chose to omit his favorite control group. 
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í Hr most characteristic feature of 
contemporary treatments of moral 


development is undoubtedly the em- 
phasis on the role of moral judgment. Piaget's 
(1948) original observations on the child's 
growing awareness of morality have been 
followed by numerous assessments (Boehm, 
1957; Durkin, 1959; Kohlberg, 1958; MacRae, 
1954; Peel, 1959) of children's perceptions of 
social norms, of their conditions of application, 
and of their sources of origin. Of course, the 
knowledge of standards of conduct, as it is 
conveyed through verbal report, is a step 
removed from actual social behavior. But 
psychological conceptions of morality that 
have been more directly concerned with its 
behavioral manifestations have also commonly 
reserved the term moral for behavior that 
appears to rest upon standards with respect to 
which actions, thoughts, or feelings may be 
evaluated as “good” and “bad” or “right” and 
"wrong." The concept of guilt, so uniformly 
accorded a central position in theoretical 
accounts of internalized moral responses to 
transgression, reflects the assumption that 
judgment is the base of morality. The clinical 
impetus to psychologists’ concern with guilt 
has sometimes led to a focus on its affective or 
motivational properties. This focus should not, 
however; be permitted to obscure the fact that 
guilt ordinarily designates a psychological 
phenomenon having a cognitive aspect of 
self-evaluation. 

One finds in the work of many psychologists 
the view that guilt rests upon self-criticism 
and is a prerequisite of moral reactions to 
transgression which are internalized in the 
sense that they occur even in the absence of 
any explicit or threatened external punish- 
ment, Freud (1936), for example, clearly 
distinguished the self-mediated consequences 


1 The author is indebted to the Eastern Pennsylvania 
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of transgression from those dependent upon 
external sanctions. He portrayed the in- 
dividuals conscience. as an. autonomous 
internalized representation of prohibitions and 
punishments formerly present in his parents’ 
behavior. The same kind of distinction is 
made by Sears, Maccoby, and Levin (1957), 
who used children's expressions of guilt as 
evidence that they were ‘following thè self- 
instructio of their own standards and not 
merely anticipating the external consequences 
of their misdeeds. In a cross-cultural study ofa 
number of different societies, Whiting and 
Child (1953) similarly described two distinct 
mechanisms of social control, one of “moral 
anxiety” or guilt and the second of “objective 
anxiety” or fear of external punishment. 
Other investigators have; concurred in 
equating internalized moral reactions with the 
presence of self-criticism. Henry and Short 
(1954), in their study of societal influences 
upon the direction of aggression, speculated 
that self-punitive tendencies were propor- 
tionate to the extent of internalization of 
parental sanctions. Heinecke (1953) took the 
intensity of expression of guilt, in a group of 
4- and 5-year-olds, as a measure of “strength 
of superego formation.” And Allinsmith 
(1960) has attempted to relate a variety of the 
moral reactions of older children to an exten- 
sive network of self-evaluative standards. All 
of these approaches share the view that a 
response to transgression is moral when it 
follows from the child’s judgment of his own 
behavior and no longer requires the support 
of external norms and sanctions. 
Internalized responses to transgression 
comprise, of course, only one set of phenomena 
to be understood in any general account of 
moral behavior. But these responses provide a 


_ fundamental insight into the nature of morality 


in that they assume a variety of forms many 
of which reveal an absence of self-evaluation | 
and an external orientation inconsistent with 
what one might expect if they were entirely 
internally mediated. Experiments by Levin and 
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Turgeon (1957), and by Siegel and Kohn 
(1959), have demonstrated, in the area of 
aggressive behavior, some of the conditions 
under which children may relinquish internal 
controls and rely on an external definition of 
transgressions and their consequences. 
Common observation of children indicates 
quite clearly that even when their actions are 
not directly known to others, they often 
perceive or anticipate punishment or criticism 
in the responses of other people or in im- 
personal fortuitous events. Children often 
appear uncomfortable until they have con- 
fessed their transgressions or have done some- 
thing that brings about discovery or punish- 
ment. Nor is it unusual to see young children 
reproduce their parents’ disciplinary reactions 
in response to their own actions. Even their 
self-critical or self-corrective behavior may 
depend on external cues and resources for its 
initiation and performance. Another more 
indirect suggestion of children’s dependence 
on the behavioral norms of their social environ- 
ment is apparent in their tendency to localize 
the responsibility for their own misbehavior in 
external sources and to blame others for the 
same offenses. 
Externally oriented moral responses cannot 
be regarded as merely transitory reactions 
characteristic of childhood. Various studies 
(Adorno, Frenkel-Brunswik, Levinson & 
Sanford, 1950; Allinsmith, 1960; Aronfreed, 
1960) have shown that, even beyond childhood, 
people frequently perceive the source of 
criticism or punishment for their actions in 
other people’s responses, interpret impersonal 
events as punishment, or blame others rather 
than themselves for wrongdoing. In a recent 
statement based on cross-cultural comparisons, 
Whiting (1959) points out that while no 
society can rely completely on direct external 
supervision of social behavior, not all societies 
inculcate guilt as the effective mechanism of 
moral functioning, He suggests that externally 
defined modes of social control, such as the 
fear of ghosts or sorcerers, are more typical of 
some cultural groups. In our own western 
society, while it may be thought of as one 
that values the more highly internalized kinds 
of controls, one also sees that moral responses 
are often initiated and carried out in a way 
that testifies to the relevance of their external 
social context. 


JUSTIN ARONFREED 


It is difficult to conceive that self-evaluation 
is an essential ingredient of those responses to 
transgression which show an external orienta- 
tion. Indeed, even the verbalization of self- 
critical statements can sometimes be shown, 
upon questioning an individual, to follow not 
from an articulate standard of judgment, but 
rather from the tendency to make the critical 
response automatically as though it were a 
necessary consequence of having committed a 
transgression. Such a tendency can be 
especially transparent in the behavior of young 
children. We know, too, that a person may 
feel shame, disgust, or generalized anxiety, 
without a clear awareness of precisely what is 
wrong about his behavior. And it frequently 
seems that people do not need to engage in à 
great deal of judgment or conscious reflection 
in order to experience unpleasant feelings, and 
even to carry out self-corrective behavior, in 
response to some of their actions. The minimal 
Tole that explicit standards appear to play in 
some moral reactions may very possibly reflect 
the limited cognitive auspices under which 
much of socialization takes place. 

For various reasons, naturalistic observation 
seems more appropriate than experimental 
methods as an initial approach to exploring 
and circumscribing a wide range of mor 
responses. However, in order to examine the 
role of self-evaluation in such responses and, 
more generally, the extent of their internal or 
external orientation, a systematic assessment 
is required that will distinguish specific forms 
and patterns of response. The empirical survey 
of children's responses to transgression Te 
ported here is intended to provide a description 
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of the variety and distribution of these te | 


sponses in a sample of children chosen tO 
represent both sexes and the two major 
Socioeconomic classes. The survey will then 
be used to evaluate a. theoretical framework 
specifying the relationships between various 
kinds of moral response and the child's pos 
tions in society in terms of differential patterns 
of social reinforcement. 


Social Position and Moral Orientation 


One of the defining criteria of status within 
a socioeconomic hierarchy is that the people 
holding higher positions have greater power 
and responsibility to evaluate and determine 
their own behavior as well as to act upon thelt 
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external environment. The requirements of 
their work tend to encourage independence 
from immediate external supervision. Their 
opportunities for realizing their aspirations 
through modifying their circumstances would 
make their use of internal resources meaningful 
and rewarding.-In contrast, the occupational 
roles held by individuals of lower status tend 
not to encourage initiative or self-reliance. 
Their opportunities for affecting their environ- 
mentare morerestricted. Their work is bounded 
by its concrete material nature, and its per- 
formance is more highly specified, within fairly 
narrow limits, by sources outside of their own 
control. They should, therefore, be more 
prone to perceive external dangers, gratifica- 
tions, and restraints as determinants of their 
behavior. There is considerable evidence that 
differences of social orientation, in the direction 
described, do distinguish between the middle 
and the working classes in our society and 
characterize their values and practices in 
socializing their children (Davis & Havighurst, 
1946; Henry & Short, 1954; Kohn, 1959; 
McClelland, Rindlisbacher, & de Charms, 
1955; Miller & Swanson, 1958). These differ- 
ences would certainly be expected to have 
implications for the child’s internal or external 
orientation in moral behavior. 

If status has the kind of consequences for 
social behavior ascribed to it here, then the 
significance of the masculine and feminine sex 
roles for the child's moral orientation should 
be, in some respects, parallel to the significance 
of social class membership. Despite the in- 
creasing equality afforded the two sexes in 
western society, it still remains true that 
greater status and self-direction of action 
attach to the masculine role and that children 
are sensitive to the differential activities and 
privileges of the two sexes (Brown, 1958; 
Lynn, 1959; Parsons, 1953; Rabban, 1950). 
Men are expected to have more control over 
their own actions as well as over their external 
environment, Women are expected to be 
responsive to direction from without rather 
than to show self-reliance. They must be more 
sensitive, and ready to accommodate them- 
selves, to external influences (Barry, Bacon, 
& Child, 1957; Parsons, 1942). A woman’s 
daily activities in her primary roles of wife and 
mother are very much conditioned by her 
husband’s social status, and they usually do 
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not permit her as much freedom of movement 
and decision making power as accrue to his 
more crucial occupational role. 

Within the various means by which parents 
may transmit their social values and expecta- 
tions to their children, disciplinary practices 
would appear to be among the most specifically 
relevant to the child’s subsequent moral 
behavior. The prevailing view of social class 
differences in child rearing, that was held for 
some years after the well-known study by 
Davis and Havighurst (1946), was that the 
working class was more permissive and less 
frustrating than the middle class. Recent 
surveys (Bronfenbrenner, 1958; Maccoby, 
Gibbs, et al., 1954) have tended to challenge 
that view and to suggest that inconsistencies 
between different studies of the socialization 
practices of the two classes may be attributable 
to uncontrolled sampling factors or to processes 
of social change over time. It is therefore 
rather striking that in all of the surveys cited, 
as well as in others (Littman, Moore, & 
Pierce-Jones, 1957; Miller, Swanson, et al., 
1960), one persistent difference between the 
two classes continues to be found in their 
disciplinary techniques. Lower-class parents 
are more likely to use direct aggression in. 
their discipline, particularly physical punish- 
ment. Middle-class parents are more prone to 
use techniques variously described as “love. 
oriented" or “psychological” and consisting 
primarily of withdrawal of love, isolation, 
ignoring the child, reasoning, and explanation. 
On the basis of the little evidence available, it 
does not appear, in contrast, that there are 
consistent differences between boys and girls 
in the methods of punishment which they 
experience, although the survey by Sears et 
al. (1957) suggested that the specific technique 
of withdrawal of love might be used somewhat 
more frequently with girls. 

Many studies (Allinsmith, 1960; Heinecke, 
1953; Sears et al., 1957; Whiting & Child, 
1953) have indicated a limited degree of 
association between parental discipline and 
the child’s moral reactions to transgression. 
The implications of these studies for the 
distinctions made here between specific forms 
of moral response are somewhat difficult to 
assess, since typically a general index of 
conscience or internalization was used. In each 
case, however, the use of love oriented or 
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psychological methods was positively related 
to reactions to transgression which the in- 
vestigators regarded as evidence of “high 
internalization.” The aim of distinguishing 
moral responses with respect to their internal 
or external orientation suggested that dis- 
ciplinary techniques might be divided, for the 
purposes of the present study, into the two 
broad categories of induction and sensitization. 

The induction category subsumes methods 
that should tend to induce in the child reactions 
to his own transgressions which could readily 
become independent of their original external 
stimulus sources. A mother who indicates her 
disapproval by rejecting or ignoring her child, 
or by showing that she is hurt or disappointed, 
is stimulating unpleasant feelings that are not 
closely tied to her physical proximity or its 
imminence. The arousal and termination of 
these feelings are less well defined by externally 
explicit punitive events conveyed through her 
immediate presence than would be the case if 
she were to punish the child with a direct 

attack. Likewise, if she reasons with the child 
or explains why his behavior is unacceptable, 
she is using a verbal and cognitive medium of 
exchange that can provide the child with his 
own resources for evaluating and modifying 
his behavior. Finally, if she asks the child why 

. he behaved as he did, insists that he correct 
the consequences of his acts, or refrains from 
punishment when he takes the initiative in 
correcting himself, she is encouraging him to 
examine his actions and accept responsibility 
for them. 

The term sensitization is applied to the more 
direct techniques of physical punishment and 
verbal assaults such as yelling, screaming, or 
“bawling out.” Such techniques should give 
greater weight to the mother’s punitive pre- 
sence and be unlikely to induce reactions to 
transgression that could easily become sepa- 
rated from their original external definition. 
They should instead merely sensitize the child 
to the anticipation of painful external events 
and to the importance of external demands 
and expectations in defining appropriate 
responses. 

In view of the significance attributed to 
socioeconomic status and the emphasis on 
self-evaluative resources as one aspect of 
internal orientation, the role that intelligence 

might play in the anticipated relationships 


was explored by examining the relationship 
between intelligence and each of the specific 
forms of moral response. 


METHOD 


There are certain obvious difficulties in trying to 
reproduce the kinds of situations’ around which a 
variety of reactions to transgression may be elicited 
and systematically observed. It is unlikely, for example, 
that children would commit many of the necessary 
transgressions in the presence of observers. Moreover, 
some forms of moral reaction would not be easily 
discernible in the child’s behavior without resorting to 
techniques of verbal report and accepting the inherent 
limitations of these techniques. To meet these problems, 
a projective story completion device was constructed, 
consisting of five incomplete stories. The advantages 
of such a technique have been demonstrated in previous 
studies (Allinsmith, 1960; Aronfreed, 1960). The story 
completions written by the children can be analyzed 
for various responses to transgression occurring in the 
perception and behavior of the central figures, and these 
can be taken to represent the subject’s own strongest 
response tendencies. This projective method tends to 
minimize the child’s awareness that his own reactions 
are the center of interest. It also permits a fairly pre- 
cise structuring of a variety of the stimulus conditions 
under which children usually show moral responses to 
their own actions. Some recent investigations (Hokan- 
son & Gordon, 1958; Kagan, 1956; Purcell, 1958) have 
suggested that the assumption of isomorphism between 
overt behavior and its projective analogue gains 
credibility when the projective technique is not left 
unstructured, but rather approximates as closely as 
possible the conditions under which the overt behavior 
ordinarily takes place. 

"The transgressions that occurred in the stories were 
acts of aggression. Because such a considerable part 
of socialization in our society is devoted to the control 
of aggression, the range of moral responses elicited in 
this area of behavior was expected to be fairly broad 
and representative of moral responses in general. Eac 
story beginning was roughly 100 words in length and 
related, in simple language, an incident in which @ 
child (the central figure) had become very angry with 
little or no justification by ordinary social standards. 
The child then committed an act of aggression of 9 
kind generally socially prohibited but not uncommon 
among children. The aggression was directed against 
parents, a friendly neighbor, or a close companion, an 
the stories varied in the overtness and directness of the 
aggressive response. In one case, for example, à chil 
is about to join friends at a playground and is angered 
at the slight delay occasioned by the mother’s request 
to examine a rosebush and to pick any blooms whic 
may have appeared. Rather than take the time to 
return with the roses, the child takes them and throws 
them into a trash can at the playground. In another 
story, a child chalks a “nasty” word on a neighbor $ 
step because the neighbor's absence prevents the child 
from having a piece of play equipment that is usually 
generously provided. 

All of the stories attempted to convey that the 
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child’s action was not directly observed by others, so 
that externally oriented moral responses occurring in 
the story completions would reflect the child’s own 
perspective and not be invited by the context of the 
story. Likewise, provocation of the aggression in each 
of the stories was absent or negligible, so that there 
would be no realistic support for holding others re- 
sponsible for the actions of the central figure. 

Two sets of stories were used, so that the sex of the 
central figures could correspond to the sex of the child 
completing the stories. The two sets were otherwise 
almost identical, with the exception of minor varia- 
tions introduced to insure that the stories would be 
sex appropriate. For example, in two stories where the 
transgression was provoked by the child’s being reason- 
ably refused something, or inadvertently deprived of it, 
the specific nature of the object was made to accord 
with the interests of the two sexes. 


Subjects 


The subjects of the study were 122 white children 
drawn from the sixth grade classes of two public 
schools in a large urban school system.? The sample 
was composed of comparable numbers of children from 
the middle and working classes, and was equally 
divided between the two sexes. There were 34 middle- 
class and 27 working-class boys. Among the girls, 28 
were middle class and 33 were working class. Socio- 
economic status was determined on the basis of the 
fathers’ occupations as reported in questionnaire in- 
terviews with the children’s mothers. The working 
class group consisted of those children whose fathers 
earned their living through manual labor, factory work, 
or service and maintenance occupations. The fathers 
of the middle-class children were salesmen, office 
workers, owners and managers, and a few men engaged 
in professions. 


Procedure 


. Story completion technique. The stories were given 
in written form to small groups of five or six children, 
sitting widely separated from one another in a large 
room. Each child individually read each of the five 
story beginnings and wrote a completion to it, finishing 
a given story before going on to the next. The task was 
put into a context of creativity by asking the children 
to write “the most interesting kinds of stories for sixth 
graders,” and anonymity was preserved by having 
each child place the completed stories, without any 
name attached, in a large pile of already completed 
sets. Since each child completed five stories, 610 stories 
in all were obtained from the children. 

Questionnaire interview. The mothers of the 122 
children in the sample represented those who, among 
124 originally solicited, were willing to participate in a 
questionnaire interview, the purpose of which was 
described to them as “finding out about the different 
things which mothers do to help their children grow 
up." Sampling biases attributable to the criterion of 


? The author is grateful to a number of adminis- 
trators and teachers in the Philadelphia public school 
System, whose cooperation made this study possible. 


voluntary participation were, thus, virtually eliminated. 
When the mother's agreement to participate was 
received, she was visited at her home by one of four 
interviewers, two of whom were male and two female. 
"Though the majority of the interviews were conducted 
by the male interviewers, there was no difference be- 
tween the subsamples of mothers seen by the two sets 
of interviewers in their social class or sex of child charac- 
teristics. 

After about 5 minutes of relatively informal ques- 
tions aimed at attaining rapport with the mother and 
establishing the nature of her husband's. occupation, 
the interview turned to a series of 12 distinct questions 
designed to elicit the mother's report of specific dis- 
ciplinary techniques used with the child who was the 
subject of the study. Each question described a par- 
ticular form of aggressive behavior and the circum- 
stances under which it occurred, and then asked the 
mother how she usually responded when her child 
behaved in that way. The interviewers were provided 
with standardized probes for additional techniques or 
more detailed information. The situations described 
to the mother included not only obvious misbehavior 
such as throwing food or being verbally offensive 
toward her, but also more subtle incidents involving 
friends or property, where the child’s responsibility 
was somewhat ambiguous. Two of the questions were 
phrased to explore conditions under which the mother 
might withhold punishment or make it contingent 
upon the child’s subsequent actions, but even these 
commonly brought forth further disciplinary methods. 
Some of the questions dealt with the mother’s current 
methods, while others asked about her methods in 
response to the kind of behavior more characteristic 
of the earlier years of childhood. 

Intelligence measure. Scores on the sixth grade form of 
a test of verbal intelligence were available for almost 
all of the children in the sample. This test is a stand- ` 
ardized instrument used throughout the entire school 
system from which the children were drawn, It has a 
test-retest reliability of .91, a correlation of .78 with 
the Otis Quick-Scoring Inventory, and a correlation of 
.72 with the child’s average standing across all sixth 
grade achievement tests. 


Classification of Moral Responses 


The broad outlines of the classification to be de- 
scribed were determined by the theoretical orientation 
of the survey, although some of the details of cate- 
gorization were empirically adapted to accommodate 
the distinct form of typical responses in the story 
completions. Each story was individually scored for 
the presence of a number of independently defined 
forms of moral response. For some types of response, 
further distinctions were made among the story condi- 
tions defining their pattern of occurrence, in order to 
examine more closely the extent of their internal or 
external orientation. A single story almost invariably 
showed more than one form of moral response, and was 
therefore entered in more than one subcategory within 
the system of classification. A given subcategory was 
used only once, however, in the scoring of a particular 
story. The primary subcategories, and the major 
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categories of classification under which they were 
organized, were as follows: 

Self-criticism. This category was used whenever 
the moral reactions of the central figures showed 
evidence of self-criticism, self-blame, the explicit 
recognition of wrongdoing, or any form of self- 
appraisal of the kind associated with the experience 
of guilt. A slightly more liberal interpretation was 
made of this category than of others, so as to be 
certain that there would be no underestimation of 
the presence of self-evaluation. Disturbances of 
thought or action, for example, were included when 
they appeared to be associated with reflection upon 
the transgression. Being “sorry” was not in itself 
taken as an indication of self-criticism, since it 
might actually be a response to external pressures 
or anticipated unpleasant external consequences. 
But remorse was included when it was felt specifi- 
cally for the act of transgression and seemed to imply 
self-evaluation. 

Correction of deviance. A wide range of responses 
was classified here, all of them indicating an attempt 
to return to the appropriate social boundaries of 
behavior. These took the form of admission to 
others of responsibility for the transgression (con- 
Session), expression of remorse for actions or inten- 
tions (apology), removal or amelioration of injurious 
consequences (reparation), either concretely or 
through affection or helpfulness toward those per- 
ceived as harmed by the transgression, and resolu- 
tions or commitments to act more acceptably in the 
future (modification of future behavior). 

Degree of activity in self-correction. This index was 
a crude continuum constructed to use the total 
pattern of responses in each story completion. Its 
purpose was to evaluate the extent to which moral 
responses were self-defined and initiated in the 
child’s own behavior in a way that indicated an 
active acceptance of responsibility, or, conversely, 
the extent to which they revealed a passive reliance 
on external events. A story was classified as high 
on the index if the sequence of moral responses in- 
cluded one of the more active forms of self-correc- 
tion, such as reparation or modification of future 
behavior, without any dependence on external 
events. Stories were scored as intermediate if the 
Sequence of moral responses was self-initiated but 
did not include an active form of self-correction 

(for example, if only confession or apology occurred), 
or if the sequence was externally initiated but sub- 
sequently included an active self-correction not 
imposed or required by external conditions. Stories 
were classified as low if they contained only ex- 
ternally defined consequences of transgression, or if 
such consequences were followed only by corrective 
actions that were relatively passive and essentially 
required by the external situation (for example, if 
confession or apology occurred only after the trans- 
gression had been discovered and punishment 
threatened). 

For all of the other categories in the classification 
of moral responses, social class and sex compari- 
sons could be made on the basis of the subjects’ total 
scores, which were obtained simply by counting the 
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number of instances of a category occurring over the 
five stories. This index, however, was an attempt to 
express the degree to which a certain property of 
moral responses was present rather than its mere 
presence or absence. It was therefore necessary to 
devise a means of combining the evaluations of 
individual stories so as to arrive at a composite 
evaluation of a subject’s entire set of stories, Ac- 
cordingly, values of zero, one, and two were arbi- 
trarily assigned to stories scored, respectively, in the 
low, intermediate, and high categories. These values 
were then summated across the five stories to repre- 
sent each subject’s total degree of activity in self- 
correction. 

External resolution. This category was used to 
describe responses characterized by the central 
figure’s perception of the consequences of trans- 
gression as being primarily defined by external 
events. Three basic forms were distinguished. In the 
first type, discovery and punishment, other people 
discovered the transgression and responded with 
punishment or criticism. In some cases, discovery 
was invited or punishment provoked by the central 
figure’s actions subsequent to the transgression. The 
second type consisted of experiences of unpleasant 
fortuitous consequences following the transgression 
through indirect effects or impersonal circum- 
stances. The central figure might, for example, suffer 
a painful accident, or be left with an unpleasant 
task, or have something of value lost or destroyed. 
The third type of external resolution was a focus 
on external responsibility, in which the actions of 
others were held responsible for the transgression or 
were used to justify it. 

Externally oriented initiation and performance. A 
set of somewhat more refined distinctions was 
developed to describe various ways in which even 
the central figure’s own moral actions, such as self- 
criticism or corrections of deviance, might never- 
theless reveal some degree of reliance on the ex- 
ternal social environment. Thus, certain moral 
reactions received an external initiation in that they 
occurred only upon the influence or demands of 
other people. Likewise, public expressions of remorse 
and promises of conformity were considered & 
display of moral responses (often with the result 
of being forgiven). And acts of reparation were 
characterized as dependent on a use of external 
resources when the assistance of other people (and 
occasionally even of God) was sought in carrying 
them out. 


Classification of Maternal Discipline 


The disciplinary practices reported by the mothers 
were divided into the two broad categories of induction 
and sensitization techniques on the basis of the con- 
ceptions already described. Three types of techniques 
were classified as induction: withdrawal of love in the 
form of rejecting or ignoring the child, indicating dis- 
appointment, refusing to speak to the child, or telling 
the child that he ought to feel bad; techniques 1n- 
fluencing the child to assume or examine his own Tê- 
sponsibility, such as asking him to report or account 
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for his behavior, insisting upon his making reparation, 
or encouraging him in various ways to define trans- 
gressions for himself or to initiate his own moral 
responses; explanation of relevant standards (including 
“reasoning” and “talking things over”), describing the 
consequences of his actions to the child, suggesting 
appropriate alternative actions, or simply telling the 
child what aspects of his behavior were unacceptable. 
All forms of physical punishment and uncontrolled 
verbal assaults such as yelling, shouting, screaming, or 
“bawling out” were classified as sensitization tech- 
niques. 

Other techniques used by the mothers which could 
not be clearly coordinated to the behavioral impact 
ascribed to induction and sensitization methods were 
excluded from the classification, Among the most 
frequent of these were various forms of restriction and 
deprivation, the effects of which were considered to be 
highly dependent on their behavioral context and style 
of administration. Discipline described with such terms 
as “scolding,” “lecturing,” or “criticism,” without 
further elaboration, was also viewed as ambiguous 
within the induction-sensitization framework, since it 
was not obvious that it provided self-evaluative re- 
sources for the child nor could it, on the other hand, be 
assumed to represent a direct attack with the function 
of merely eliminating the offending behavior. 

For each mother, two separate counts were made 
across the 12 disciplinary situations used in the ques- 
tionnaire—one for the number of situations in which 
she reported induction techniques and the second for 
the number in which she reported sensitization tech- 
niques. In some instances, of course, the mother’s 
response toa single situation contributed to both counts. 
Each situation was permitted to enter only once into 
each count, however, even though mothers often men- 
tioned more than one specific kind of induction or 
sensitization technique in a given situation. The entire 
sample of mothers was then divided into two groups on 
the basis of which of the two counts was higher. Three 
mothers whose induction and sensitization counts were 
identical were placed in the group showing a predomi- 
nance of sensitization techniques in order to have the 
two groups as nearly equal in size as possible. 


Reliability of Classification 


. The entire classification of moral responses was 
independently applied by two judges? one of whom 
was the writer, to the 100 stories written by a sub- 
sample of 20 children. The children were selected so as 
to equally represent both sexes and the two social 
classes, but were randomly chosen within this restric- 
tion. Each specific subcategory of moral responses 
scored by either of the judges was treated as an instance 
of classification, and was entered as a disagreement if 
not scored by both. In the case of the index of degree of 
activity in self-correction, failure of the two judges to 
Score the same degree of activity was counted as a dis- 
agreement. Percentages of agreement were computed 
over the total number of instances of classification in 


3 The author is indebted to Stephanie Manning for 
her work in establishing the reliability of classification. 
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TABLE 1 
PERCENTAGE OF AGREEMENT BETWEEN Two INDE- 
PENDENT JUDGES FoR EacH Major CATEGORY 
OF CLASSIFICATION OF MORAL RESPONSES 


Category of moral responses Quee 
ment^ 
Self-criticism 94 
Correction of deviance 96 
Degree of activity in self-correction 90 
External resolution 97 
Externally oriented initiation and per- 
formance 98 


? Based on a sample of 100 stories completed by 20 subjects. 


each of the five major categories of the system. These 
percentages of agreement, summarized in Table 1, 
indicate that the classification system can be objectively 
and consistently used. The rather high reliability of 
classification probably reflects the fact that the system 
was specified in great detail and left little room for. 
inference or interpretation. Comparable reliability 
has been obtained with a similar system in a previous 
study (Aronfreed, 1960). 

The same two judges independently classified the 
disciplinary techniques reported by a randomly se- 
lected subsample of 20 mothers. Although the tech- 
niques were eventually to be grouped into the two 
broad types of induction and sensitization, reliability 
was based on the identification of specific forms of 
discipline, and a disagreement was entered whenever a 
particular form was scored by one judge but not by 
the other. The judges agreed on 97% of the total 
number of instances of classification of induction 
techniques and on 98% of the instances of sensitization 
techniques. 


RESULTS AND DISCUSSION 


Table 2 shows, for each of the subcategories 
in the classification of moral responses and 
for the major types into which they were 
grouped, the frequencies and percentages of 
occurrence among both subjects and stories. 
In order to assess the role of self-evaluation 
in correction of deviance and in external 
resolution, instances of each specific form of 
these two types of response were separately 
tabulated with respect to the presence or 
absence of self-criticism in the stories in which 
they occurred. In computing percentages, a 
common basis for comparing different forms 
and patterns of moral response was provided 
by taking all frequencies as proportions either 
of the total number of subjects or the total 
number of stories. 

The entries given for the broad response 
types in Table 2 represent frequencies and 
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percentages of occurrence of these types in 
any specific form. A given subject, and even a 
single story, frequently showed multiple 
instances of moral responses classified within 
the same major type. Consequently, the 
entries presented for any major type are 
always smaller, and often considerably so, 
than the sum that would be obtained by 
adding the totals recorded for each of its 
specific forms. The single exception to this 
pattern occurs, in the case of the stories only, 
for the index of degree of activity in self- 
correction, since there a rating of degree is 
given to every story and is applied to the whole 
story as a unit. Similarly, for those forms of 
moral response which are separately tabulated 
according to whether or not they are accom- 
panied by self-criticism, the entries for subjects 
showing any instance of a response are smaller 
than the combined entries for the presence and 
absence of self-criticism, since a particular 
subject may show some instances of the 
response that are accompanied by self-criticism 
and others that are not. The entries for stories 
showing any instance of a response do, how- 
ever, represent the combined entries for the 
presence and absence of self-criticism, since in 
a single story self-criticism can only be either 
present or absent. 

The most immediately obvious feature of 
the distribution of moral responses is their 
great number and variety. Even restricting 
our attention to self-criticism, correction of 
deviance, and external resolution, and exclud- 
ing for the moment the more refined patterning 
of moral responses, we see that each trans- 
gression must be followed by multiple, 
Sequential reactions. It is apparent that, 
among the perceptions and behaviors that 
constituted moral responses, a single form of 
response was generally not sufficient to define 
the consequences of transgression. 


Self-Criticism 


Over one-fourth of the children show no 
evidence at all of self-criticism. The 90 sub- 
jects who do show evidence of self-criticism, 
while they are clearly in the majority, do not 
use it as a recurrently characteristic response, 
as can be seen from the fact it occurs in only 
28% of the stories. The restricted role of self- 
criticism is also reflected in the limited extent 
to which it accompanies corrections of deviance 
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TABLE 2 
FREQUENCY AND PERCENTAGE OF SUBJECTS AND 
Stores SmowiNG  EacH CATEGORY or 
CLASSIFICATION OF MORAL RESPONSES 


Subjects Stories 
Category of moral responses 


N | % | N|% 


Self-criticism 90 | 73.8 | 171 | 28.0 
Correction of deviance (any form) 
Selí-criticism present 84 | 68.8 | 160 | 26.2 
Self-criticism absent 113 |92.6 | 325 | 53.3 
Any instance 117 | 95.9 | 485 | 79.5 
Confession 
Self-criticism present 58 | 47.5] 91 | 14.9 
Self-criticism absent 94 | 77.1 | 182 | 29.8 
Any instance 108 | 88.5 | 273 | 447 
Apology 
Self-criticism present 18 | 14.7 | 18 | 29 
Self-criticism absent 46 | 37.7] 55| 9.0 
Any instance 54 |44.3| 73 |119 
Reparation 
Self-criticism present 56 |46.2| 79 | 12.9 
Self-criticism absent 99 | 81.1 | 165 | 27-1 
Any instance 110 | 90.2 | 244 | 40.0 
Modification of future behavior 
Self-criticism present 38 | 31.1] 53 | 87 
Self-criticism absent 56 | 46.2| 89 | 14.6 
Any instance 71 | 58.2} 142 | 23.3 
Degree of activity in self-correction 
Low 105 | 86.1 | 212 | 348 
Intermediate 116 | 95.1 | 248 | 40.7 
High 89 | 72.9 | 150 | 24.5 
External resolution (any form) 
Self-criticism present 79 | 64.8 | 125 | 20.5 
Self-criticism absent 96 | 78.7 | 168 | 27.5 
Any instance 118 | 96.7 | 293 | 48.0 
Discovery and punishment 
Self-criticism present $8 | 47.5| 70 | 11.5 
Self-criticism absent 84 | 68.8 | 122 | 20.0 
Any instance 107 | 87.7 | 192 | 31.5 
Unpleasant fortuitous consequences 
Self-criticism present 34 | 27.9] 36 | 59 
Self-criticism absent 33 | 27.1 | 38 | 16.2 
Any instance 60 |49.2| 74 | 121 
Focus on external responsibility 
Self-criticism present 43 | 35.2| 52] 85 
Self-criticism absent 47 | 38.5 | 52| 85 
Any instance 75 | 61.5 | 104 | 17.0 
Externally oriented initiation and 
Performance (any form) 93 | 76.2 | 150 | 24.6 
External initiation 55 | 45.1] 75 | 123 
Display si | 41.8] 75 | 123 
‘Use of external resources 43 | 35.2) 46 | 75 


^ Percentages based on Ns of 122 subjects and 610 stories. 


and external resolution, For each of the 
specific forms of correction of deviance, and in 
the entries for instances of any form, the 
percentage of subjects for whom there is an 
accompanying self-criticism is uniformly con- 
siderably smaller than the percentage of 
subjects for whom there is not. Among the 
stories, also, self-criticism is typically absent 
roughly twice as frequently as it is present 
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when corrections of deviance occur, and the 
disproportion is even greater in the case of 
apology. It would certainly not seem that any 
of the different kinds of correction of deviance 
presuppose the presence of self-evaluation. 
Corrections of deviance should not, therefore, 
be necessarily taken as indicating the operation 
of cognitive resources of moral judgment. 
They should instead by viewed as ambiguous 
in their implications for moral orientation. 
Apparently, they can be used rather mechani- 
cally, either because they have become instru- 
mental in reducing what are simply unpleasant 
feelings or perhaps because they have been 
successful in avoiding anticipated external 
punishment. 

The tendency for self-criticism to be absent 
more often than present is less consistent as a 
context for the various forms of external 
resolution. In the case of each form, however, 
it is absent at least as often as, if not more 
often than, it is present. The substantial 
extent to which each of the different forms of 
external resolution is accompanied by self- 
criticism is a finding of special interest. There 
have been some attempts (Allinsmith, 1960; 
Aronfreed, 1960; Flugel, 1947) to conceptualize 
moral responses that are highly externally 
defined in such a way that self-evaluation is 
still a condition of their occurrence. The 
concept of “defense against guilt,” for example, 
specifies that “externalized” moral responses 
defend the person against the painful experi- 
ence of self-criticism which is presumed to be 
always present following a transgression, but 
may somehow not be given conscious expres- 
Sion. The patterning of moral responses does 
not, however, support the view that the 
different forms of external resolution are 
merely secondary elaborations whose function 
is to avoid self-criticism. It would appear, 
rather, that self-criticism and external resolu- 
tion are parallel and complementary conse- 
quences of transgression, and that neither one 
precludes or rests upon the presence of the 
other. There is no reason to assume that all 
moral responses are to be interpreted as trans- 
formations of the experience of guilt. 


Corrections of Deviance 


Corrections of deviance are clearly indis- 
pensable to almost all of the children and are 
more characteristic of their behavior than any 
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other type of moral response. They occur in 
almost 80% of the stories. Confession and 
reparation are decidedly the most common 
forms. Either of these kinds of moral reaction 
would, in itself, come close to accounting for 
the number of subjects who show correction 
of deviance in any form. It is also interesting 
to note that over 40% of the children give no 
indication at all of the modification of future 
behavior following a transgression. 

The various levels of degree of activity in 
self-correction are each manifested by very 
large proportions of the total group of children 
and are fairly well distributed throughout the 
stories. There is discernible, nevertheless, 
some tendency for moral responses showing a 
self-defined and active acceptance of responsi- 
bility to be less typical of the sample as a 
whole. Such responses occur in only about 
one-fourth of the stories and do not appear at 
all for over 25% of the subjects. The entire 
pattern of findings on this index points to the 
heavy situational determination of corrections 
of deviance and to their very uncertain value 
as indicators of moral autonomy. 


Externally Defined Responses 


External resolution, like correction of 
deviance, is a component of moral responses 
for virtually every child. Its various forms, 
while not as pervasive as corrections of 
deviance, are present in almost half of the 
stories. Discovery of the transgression and 
punishment by others is more common than 
unpleasant fortuitous consequences or à focus 
on external responsibility, but all of the forms 
are substantially present. What cannot be 
conveyed in the formal presentation of re- 
sponses is the frequency with which external 
resolution is brought about through the most 
devious and artificial means, particularly in 
view of the fact that the story beginnings make 
it explicit that only the central figure knows of 
the transgression. Just as in actual life situa- 
tions, however, there generally remains in the 
stories some minimal possibility of external 
consequences. This possibility is commonly 
seized upon and exploited in those instances 
where external resolution occurs. Apparently, 
this kind of reaction is sometimes an indis- 
pensable part of the child’s equipment for 
responding to a transgression. 
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The importance of external events in 
defining the consequences of transgression is 
further attested by the extent to which even 
moral actions carried out by the central figure, 
such as corrections of deviance, show an 
external orientation. The initiation of moral 
responses through the influence or demands of 
other people, the public display of some of 
these responses, and the use of external 
resources in carrying out reparation each 
occur in the stories of a sizable percentage of 
subjects. Over three-fourths of the subjects 
show evidence of some form of reliance on an 
external social context in the initiation and 
performance of moral actions. 


Nature of Reactions to Transgression 


The findings of the survey indicate that 
self-evaluation of the kind implied in the 
concept of guilt is not a prerequisite of interna- 
lized responses to transgression and that such 
responses frequently take a variety of forms in 
which the consequences of transgression are 
perceived as externally defined. Such a con- 
clusion does not deny that cognitive resources 
of moral judgment are an important aspect of 
morality and have special implications for 
social behavior. But the use of such resources, 
however salient it may seem in the behavior of 
some individuals, would appear to constitute 
only a special case of the generalized dimen- 
sions needed to describe moral responses. The 
intimate dependence of so many moral re- 
actions on external events is, in some sense, 
just what one might expect. It would be 
strange, indeed, if moral responses assumed 
Properties entirely divorced from the external 
socializing situations in which they were 
originally learned. 

If self-criticism is not to be given a funda- 
mental status in moral behavior, but is instead 
to be viewed as only one of a set of learned 
moral reactions, we are left with the problem 
of how to conceptualize the basic nature of 
internalized responses to transgression. To 
resolve the problem in a way that takes account 
of the variety and patterning of these re- 
sponses, we might specify that a behavior be 
defined as a transgression for an individual to 
the extent that it has been followed, in his 
experience, by negative (punitive) external 
sanctions. Accordingly, the negative affect 
that attaches to the behavior through social 
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punishment would be considered, when it 
became independent of such punishment, the 
primary and invariant component of internali- 
zation.* This state of negative affect may be 
regarded as endowed with motivating proper- 
ties and as being reducible through a number 
of different kinds of responses. The responses 
acquire their instrumental value because they 
reproduce significant cues in the original 
socializing situations. These cues are, in fact, 
aspects of the behavior of the child himself or 
of the socializing agents which were frequently 
associated with the termination, avoidance, or 
minimizing of punishment. The child’s criti- 
cism of his own actions or feelings is one 
example of such an instrumental response. Its 
use is probably determined by the extent to 
which the patterns of social reinforcement that 
he has experienced around transgressions have 
been characterized by the explicit verbalization 
of standards and their application to his 
behavior. When a child’s punishment incor- 
porates verbal criticism, the criticism itself 
will frequently come to signify the termination 
of the painful feelings that accompany its 
anticipation. The child can then later himself 
terminate such feelings by reproducing the 
criticism in his own behavior.’ The same kind 
of instrumental function can be ascribed to 
corrections of deviance and to responses in 
which punishment is perceived in the actions of 


* It has already been asserted that this affect should 
not in itself be referred to as guilt, since it is not neces- 
sarily accompanied by the cognitive phenomena 
usually associated with that term. To refer to the 
affect as anxiety seems more warranted, though such 
a general characterization possibly obscures a variety 
of qualities of experience dependent on the specific 
nature of the punishment from which they derive. 

The term punishment is not used here in any Te- 
Stricted sense, but rather to convey the entire range of 
reinforcements that may induce painful feelings in 4 
person. In this broad sense, the application of negative 
sanctions may take place through the most subtle and 
indirect kinds of interaction between people (as when 
for example, a mother unintentionally withdraws from 
a close physical contact with her child because of some 
thing that he is doing). E. 

*Sears, Maccoby, and Levin (1957), and Whiting 
and Child (1953), provide an interesting contrast tO 
this interpretation of the role of self-criticism. They 
suggest that the child reproduces the critical applica" 
tion of its parents’ standards to its own behavior, not 
because the criticism is associated with the termination 
of negative affect, but rather because there is à gen- 
eralized positive affect attached to parental presence 
and actions. 


* 


X 
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TABLE 3 
FREQUENCY BY SOCIAL Crass AND SEX OF CHILDREN 
Scorrnc Hicu IN EAcH CATEGORY OF 
CLASSIFICATION OF MORAL RESPONSES 


Middle class} Working 
Category of moral responses. |——|— 
Boys | Girls | Boys | Girls 
N= 34 28 27 33 
Self-criticism 16 | 18 | 6 | 13 
Confession 15 | 19 | 10 | it 
Apology 12 | 16 | 9 | 18 
Reparation 18 | 20 | 14 | 16 
Modification of future behavior 19 | 18 | 15 | 22 
Degree of activity in self-correction | 19 | 16 | 13 | 17 
Discovery and punishment 13 9 9 20 
Unpleasant fortuitous consequences | 13 | 7 | 18 | 24 
Focus on external responsibility 12 | 17 | 18 | 29 
External initiation 9|125| 9 | 23 
Display 13 | 18 | 7 | 15 
Use of external resources 10 14 10 11 


Note.—A subject’s score on any category is the number of in- 
stances of that category occurring across the five story comple- 
tions, where each category may be counted only once in a given 
story. The single exception is in the case of degree of activity in 
self-correction, where a subject's score is the summation of values 
assigned to the total pattern of moral responses in each of the 
five stories (see Method section). 


other people or in impersonal fortuitous events. 
All of these functionally equivalent responses 
might be referred to as moral consequences in 
Order to emphasize that externally oriented 
responses, and those in which moral judgment 
is not apparent, have important attributes in 
common with responses which clearly exhibit 
self-evaluation. 


Social Patterning of Moral Consequences 


Table 3 shows the frequencies with which 
children in each of the four subgroups defined 
by social class and sex received high scores on 
the various subcategories in the classification 
of moral responses.* High and low scores were 
determined, independently for each sub- 
category, by splitting the range of scores for 
the subcategory into the two segments that 
would most nearly equally divide the entire 
sample of 122 subjects. Chi square values for 


* Social class and sex comparisons are not shown for 
the cumulative frequencies in the broader response 
types (major category headings). Significant variations 
in these frequencies could easily be generated by the 
response distributions in certain specific subcategories. 
Thus, they could not be interpreted as providing any 
information beyond that already conveyed in the sub- 
categories, and would actually tend to inflate the 
apparent number of significant differences. 


each of the possible social class and sex com- 
parisons are presented in Table 4. . 

Inspection of Tables 3 and 4 reveals a series 
of significant differences which are, almost 
without exception, those anticipated. The 
most consistent differences are found among 
the aspects of moral consequences that are the 
most unequivocal indicators of moral orienta- 
tion. Thus, middle-class children, regardless of 
their sex, show more evidence of self-criticism 
than do working-class children and are con- 
siderably less likely to resolve transgressions 
through the perception of unpleasant fortuitous 
consequences or a focus on external responsi- 
bility. Similarly, the boys, irrespective of their 
socioeconomic status, do not emphasize 
external responsibility as frequently as do the 
girls and appear much less dependent on an 
external initiation of their own moral actions. 
The tendency of girls to publicly display their 
moral reactions more often than boys is also 
fairly uniform, though it does not attain 
statistical significance in the working class 
group. 

The appearance of consistent social class 
differences’ across both sexes, and of sex 
differences not confined to a particular class 
level, suggests that each of these two dimen- 
sions of social position has a general and 
independent significance for the child’s moral 
behavior. At the same time, the marked 
parallelism of the moral orientations associated 
with the two kinds of status distinction 
supports the view that their similar effects are 
mediated through common elements of social 
reinforcement defining the extent of oppor- 
tunity to evaluate and determine one’s own 
actions or, conversely, the extent of one’s 
dependence on the external environment. The 
common effects are especially striking when 
both are exhibited in the same form of moral 
response, as happens in the tendency to focus 
on external responsibility. 

It can be seen that the parallel variations in 
moral orientation associated with social class 
and sex role are not identical in the specific 
forms that they assume. Social class differences 
appear to center on the distinction between 
moral consequences defined primarily in terms 
of the child’s own actions and those defined 
primarily in terms of external events. Sex 
differences, in contrast, seem to reflect more 
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p TABLE 4 
Cur SQUARE VALUES FOR SOCIAL CLASS AND SEX COMPARISONS OF CHILDREN SCORING HIGH IN 
Each CATEGORY OF CLASSIFICATION OF MORAL RESPONSES 
Middle class vs. Working class Boys vs. Girls 
Category of moral responses a " 

Boys Girls | Total | Middle | Working | Tota 
Self-criticis: 3.02* 2.83* 5:757% 1:21 1.31 2.14 
pu 0.09 5.91** | 4.08* 2.60 0.00 0.53 
Apology 0.01 0.00 0.03 2.14 1.91 4.77* 
Reparation 0.03 2.42 1.15 0.14 0.00 0.33 
Modification of future behavior 0.06 0.01 0.00 0.17 0.38 0.86 
Degree of activity in self-correction 0.12 0.03 0.28 0.02 0.00 0.00 
Discovery and punishment 0.02 3.85* 1.57 0.05 3.40* 1:21 
Unpleasant fortuitous consequences 3.80* | 11.96** | 15.90** | 0.70 0.05 0.03 
Focus on external responsibility 4,74* 4.65* | 12.93** | 3.03* 2.79* 7.85 
External initiation 0.09 1.06 2.07 3.68* 6.50** | 11.92** 
Display 0.55 0.14 1.70 3.19* 1.67 4.80* 
Use of external resources 0.13 1.12 0.06 1.94 0.00 0.56 


Note.—Chi square values for 2 X 2 contingency tables (employing correction for continuity) based on frequencies in "Table 3. 


* p < .05, one-tailed test. 
** p < 01, one-tailed test. 


of the variability in the extent to which moral 
consequences occurring in the child's own 
actions are nevertheless dependent on the 
support of the external environment. It seems 
hardly surprising that the particular forms of 
moral response that vary with the two indices 
of status are not the same. Social class and 
sex role carry great differences in social 
definition as well as the relevant similarities 
pointed out here. These differences would 
undoubtedly influence the specific way in 
which a child's moral orientation was ex- 
pressed. It should also be noted that many of 
the social class and sex differences are not 
extreme even when they are statistically 
significant. The moderate degrees of difference 
may simply reflect the fact that the social 
experiences which affect the child's moral 
orientation are not, in our society, completely 
distinct with respect to variations in either 
socioeconomic status or sex role. 

The least evidence of significant effects of 
social position occurs among the various forms 
of correction of deviance. The absence of the 
marked associations with status found among 
other aspects of moral consequences accords 
with the view that corrections of deviance are, 
in themselves, ambiguous with respect to 
moral orientation. Although they represent 
the individual's own actions in defining the 
consequences of transgression, they are fre- 
quently highly influenced by external events, 


In fact, a central distinction between the 
moral actions of boys and girls, as has already 
been pointed out, lies in the extent to which 
they are externally initiated and displayed to 
others. 

The social class differences include, in addi- 
tion to those consistent across both sexes, 
some that are restricted only to girls. Middle- 
class girls show more evidence of confession 
and working-class girls more frequently 
perceive discovery and punishment by others 
as defining the consequences of their actions. 
The entire pattern of differences between the 
two classes provides an instructive contrast 
with the findings of two other surveys of 
children’s reactions to transgressions that 
failed to reveal significant social class varia- 
tions. One of these (Allinsmith, 1960) used a 
story completion device, similar to the one 
employed in the present study, with junior 
high school children. Many specific reactions, 
including self-criticism, corrections of deviance, 
and those characterized here as external 
resolution, were combined to derive a single 
broad evaluation of “intensity of guilt.” In the 
other survey (Sears et al., 1957), a general 
rating of the extent of development of con- 
science in kindergarten children was based on 
mothers’ reports of a wide variety of overt 
behavioral manifestations of guilt. It would 
seem that the use of a concept like internal vs. 
external orientation to treat different kinds of 
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moral response as distinct forms of behavior 
introduces a meaningful social patterning into 


their variety. Such a patterning is not apparent . 


when all internalized responses to transgression 
are presumed to be equivalent reflections of 
some underlying unitary phenomenon such as 
guilt or conscience. 

That girls rely more than boys on an ex- 
ternal definition of moral consequences is a 
finding of considerable interest since a seem- 
ingly contradictory assertion is commonly 
made that girls, in our society, show greater 
conscientiousness and moral sensitivity. Per- 
haps. it should be emphasized that the sex 
differences in moral reactions reported here 
represent variations, not in the quality or 
breadth of morality, but rather in the pattern- 
ing of specific forms of response all of which 
are defined as internalized in that they occur 
in the absence of any explicit external sanc- 
tions. The fact that the external orientation of 
girls is localized to a considerable extent in 
their use of external resources to support their 
own moral actions may be germane to resolv- 
ing an apparent discrepancy with the findings 
of one other study reporting sex differences in 
responses to transgression. 

Sears et al. (1957) reported that, among 
the roughly 20% of their 5-6 year old children 
showing evidence of “highly developed con- 
science,” there was a slightly but significantly 
higher percentage of girls. It seems quite 
possible, however, that their use of the 
mother’s observations to assess the child’s 
moral reactions tended to put a premium on 
the kinds of response that would be easily 
discernible to the mother and perhaps even 
oriented toward her approval. Expressions of 
guilt were, in fact, defined in their study so 
as to include not only overt self-criticism and 
attempts at restitution, but also other be- 
havioral manifestations that seem more 
externally oriented (for example, asking for 
forgiveness, looking “sheepish,” seeking 
punishment, or acting in such a way as to 
invite inquiry or discovery). It is also note- 
worthy that one of their most heavily used 
indices of conscience was confession. Responses 
Such as confession and apology may be re- 
garded as somewhat externally oriented 
corrections of deviance since, even though they 
represent the individual's own moral actions, 
they do require the presence of an external 


figure. The response distributions found in the 
present study indicate, for example, that the 
more frequent use of confession among middle- 
class children than among working-class 
children is restricted to girls. Indeed, the 
tendency of the girls in the middle class to 
employ confession more often than the boys 
also comes close to attaining statistical 
significance. In the combined responses for 
both social classes, girls likewise show signifi- 
cantly more evidence of apology than do boys. 
If the moral responses of boys are somewhat 
less manifest to mothers than those of girls, 
the apparent difference might conceivably 
indicate the boys’ greater independence of the 
mother as an external reinforcer rather than 
an absence of internalized reactions to trans- 
gression. 

From the point of view of parents and other 
socializing agents, the visibility of moral re- 
sponses may be considerably enhanced by the 
child’s use of overt self-criticism and appropri- 
ate verbalization of standards. These particu- 
lar forms of response might in turn be con- 
ditioned by differences in the quality and rate 
of development of verbal facility. The fact 
that girls may show in their moral behavior 
an observable conscientiousness that easily 
flows into verbal expression is certainly one 
kind of evidence of internalization. The find- 
ings of the present study suggest, however, 
that morality is not monolithic. The con- 
scientiousness of girls need not obviate the 
possibility that their moral responses are also 
dependent on their external situation in a 
way that conforms to the more general charac- 
teristics of their feminine role. Terman and 
his co-workers (1925) found, for example, in 
their studies of gifted and control children, 
that boys did not score as well as girls on five 
of seven character tests based essentially on 
verbalized knowledge of moral standards. 
But the girls were somewhat poorer on actual 
performance tests of honesty (when it appeared 
that their actions would not be known to 
others). Similarly, Hartshorne and May (1928) 
observed that boys were more resistant to 
temptation than girls on performance tests of 
“good” and “bad” behavior and were more 
consistent in their actions from one test to 
another. They concluded that differences in 
moral reputation favorable to girls were 
discrepant with differences in performance, 
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Perhaps Terman and Tyler (1954) are cor- 
rect in asserting, in their summary of sex 
differences, that the moral opinions expressed 
by girls result in their being inaccurately 
credited with superior moral qualities in their 
behavior. 

Finally, it should be pointed out that the 
nature of the associations between social 
positions and moral responses does not easily 
lend itself to the view, suggested by Piaget’s 
(1948) interpretation of moral development, 
that different types of moral orientation se- 
quentially emerge with advancing age or 
experience. It would seem more appropriate 
that internal and external orientation in moral 
behavior be understood as relatively stable 
end-results of different patterns of social 
reinforcement. Such an understanding would 
not be inconsistent with other recent studies 
(Boehm, 1957; Durkin, 1959; Kohlberg, 1958; 
MacRae, 1954; Peel, 1959) in which, as in 
Piaget’s investigations, attention was de- 
voted primarily to moral judgment. These 
studies, when taken together as a group, sug- 
gest that the child’s moral orientation, even 
as reflected only in cognitive resources, is 
more closely associated with social status and 
other indicators of cultural expectation than 
it is with age. 


Intelligence 


Tn order to answer the question of whether 
intelligence could be a factor in accounting 
for either the social class or sex differences, 
the total sample was divided into two equal 
groups on the basis of the verbal intelligence 
test on which IQ scores for all but three of the 
children were available. One group consisted 
of children whose scores were less than 110. 
In the other group were children whose scores 
were 110 or greater, These two groups were 
compared with respect to their frequencies of 
high and low scores on each category in the 
classification of moral responses. The compari- 
sons indicated no relationship whatsoever be- 
tween verbal IQ and any of the indices of moral 
consequences. 


Maternal Discipline 
Table 5 shows, for children in each of the 


four groups defined by social class and sex, 
the frequencies with which the disciplinary 
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techniques predominantly used by their 
mothers fall into the induction or sensitiza- 
tion categories. The comparisons in Table 6 
indicate that, with children of both sexes, 
middle-class mothers use more induction 
techniques and lower-class mothers use more 
sensitization techniques. There are no signifi- 
cant differences in the types of discipline used 
with the two sexes. The social class differences 
are in agreement with the findings of other 
studies in which techniques respectively 
subsumed here under induction and sensitiza- 
tion were related in the same direction to the 
mother's socioeconomic status. 

The frequencies of high and low scores on 
three indices of moral consequences, among 
children whose mothers use predominantly 
induction or sensitization techniques, are 
presented in Table 7. Only those categories 
of moral consequences are shown for which the 
associations with maternal discipline are statis- 
tically significant when taken over the entire 
sample. There is only one instance in which a 


TABLE 5 
FREQUENCY OF PREDOMINANT USE OF INDUCTION OR 
SENSITIZATION TECHNIQUES BY Mormaers oF CHIL- 
DREN IN EacH SociaL Crass AND SEx GROUP 


Sa Ne Middle class | Working class 

iominant 

discipline —— — 
Boys | Girls | Boys | Girls 

Induction 21 20 9 | 4 

Sensitization 13 7 | 18 | 18 


Note.—Total N = 120, Adequate interview data on discipline 
not available for two girls. 


TABLE 6 
Cut Square VALUES FOR SOCIAL Crass AND SEX OF 
CHILD COMPARISONS OF PREDOMINANT 
MATERNAL DISCIPLINE 
(Induction vs. Sensitization) 


Comparison Chi square value 

Middle class vs. Working class E 

Boys 3.80 

Girls 4.34* 

Both sexes 8.50** 
Boys vs. Girls 

Middle class 0.55 

Working class 0.30 

Both classes 0.55 


Note.— Chi square values for 2 X 2 contingency tables (employ- 
ing correction for continuity) based on frequencies in Table 5. 
* p < .05, one-tailed test. 
** p < 01, one-tailed test. 
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TABLE 7 
Comparison OF SCORES ON CATEGORIES OF Morat RESPONSES OF CHILDREN WHOSE Moruers UsE 
PREDOMINANTLY EACH OF THE Two Tyres or MATERNAL DISCIPLINE 


(Induction vs. Sensitization) 
(I = Induction; S = Sensitization) 


Middle class 


Category of moral responses 
T S|[»?|I 


* 


Reparation 
High 30| 8 14 
Low 11| 12 9 
Degree of activity in self-correction 3 
High 27| 7 17 
Low 14 | 13 6 
Unpleasant fortuitous consequences " 
High 10 | 10 12 
Low 31 | 10 11 


Working class Boys Girls Total sample 

S pog | i Sans rem sie). S | 
ns ns * Rp poe + 

16 19 | 13 25 | 11 44 | 24 

20 11 | 18 9) 14 20 | 32 
** ** * ** 

13 21| 11 23| 9 44 | 20 

23 9 | 20 11 | 16 20 | 36 
* * ** ** 

29 11 | 20 11 | 19 22 | 39 

7 19 | 11 23| 6 42|17 | 


Note.—Only those categories of moral responses are shown for which the association with maternal discipline was statistically sig- 
nificant when taken over the total sample (see Results and Discussion). 
a p values for chi square tests (employing correction for continuity). 


* p < .05, one-tailed test. 
** p < 01, one-tailed test. 


significant variation limited to a single sub- 
group is eliminated from presentation by this 
criterion—girls whose mothers use predomi- 
nantly sensitization techniques are more 
likely to focus on external responsibility than 
girls whose mothers use induction techniques. 
The association of the children's moral re- 
sponses with maternal discipline, while less 
striking and extensive than their association 
with social position, is generally consistent 
and in the expected direction. The response of 
reparation and a high degree of self-initiated 
acceptance of responsibility are more charac- 
teristic of children whose mothers use induc- 
tion techniques. Children whose mothers use 
sensitization techniques are more likely to 
perceive external consequences of transgres- 
sion in the form of unpleasant fortuitous 
events. 

The patterning of moral orientation with 
respect to maternal discipline tends to be 
consistent across the various social class and 
sex subgroups, thus indicating that it cannot 
be regarded as merely coincidental to the 
relationships between social position and moral 
response. Actually, neither the use of repara- 
tion nor the degree of activity in correction of 
deviance is directly related to the social class 
or sex of the child, despite the significant 
association between social class and discipline. 
These findings, when considered together with 
the fact that many of the aspects of moral 


response related to social position do not vary 
with maternal discipline, may indicate that 
the cataloguing of specific methods of punish- 
ment captures only a very partial segment of 
the total pattern of social reinforcement bear- 
ing upon the child’s moral orientation. The 
crucial elements of the pattern may conceiv- 
ably lie, for example, in the nature of certain 
response cues in the behavior of either the 
child or the socializing agent, or in the place 
of these cues in the sequence of onset and 
termination of punishment. An understanding 
of how these elements of social reinforcement 
affect the child's moral responses may require 
discriminations that are beyond the parent's 
capacity as a retrospective observer and 
demand the more exact control that is possible 
with experimental methods. 


SuMMARY AND CONCLUSIONS 


Contemporary treatments of moral develop- 
ment have characteristically given a central 
role to moral judgment and have taken the 
view that a moral response is one that is 
based on the individual's evaluation of be- 
havior in terms of his own explicit standards 
and that no longer requires the support of 
external norms and sanctions. A variety of 
research and observation, however, suggests 
that responses to transgression, even when 
internalized, often reveal an absence of self- 
evaluation and an external orientation incon- 


238 


sistent with what one might expect if they 
were entirely internally mediated. The purpose 
of the investigation reported in this paper is 
to survey the nature and variety of children’s 
moral responses to transgression, particularly 
with respect to their internal or external orien- 
tation, and to examine their relationship to the 
child’s positions in society. 

A theoretical framework emphasizing differ- 
ential patterns of social reinforcement was 
used to develop certain parallel implications of 
socioeconomic status and sex role for the 
child’s moral orientation. Higher status posi- 
tions, as located in the middle class and the 
masculine sex role, were viewed as providing 
greater reinforcement of responses charac- 
terized by control over one’s own actions as 
well as over the external environment. Lower 
status positions, as located in the working 
class and the feminine sex role, were considered 
more likely to foster responses characterized 
by the perception of one’s actions, and their 
reinforcements, as externally determined. 
These variations in response tendencies were 
expected to have significance for the child’s 
reactions to transgression. Parental discipline 
was singled out as one of the means by which 
various moral responses might be inculcated 
in the child. A distinction was made between 
two major types of discipline. Induction tech- 
niques subsumed methods that should tend 
to induce reactions to transgression that could 
readily become independent of their original 
external stimulus sources. Sensitization tech- 
niques were those that should merely sensitize 
the child to the painful external consequences 
of transgression. 

In order to circumscribe a wide variety of 
moral responses, a highly structured projective 
story completion device was constructed and 
administered to 122 sixth grade children 
equally representing the two major socio- 
economic classes and both sexes. In each 
story, the central figure committed, with 
minimal or no justification, a socially pro- 
hibited act of aggression. The responses to 
transgression occurring in the story comple- 
tions, which could be classified with high 


reliability, were assumed to reflect the sub- 


jects’ own strongest response tendencies, The 
methods of discipline applied to the children, 
as reported in questionnaire interviews with 
their mothers, were also reliably classified, 
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and the mothers divided into those whose 
disciplinary techniques were predominantly 
of the induction or of the sensitization type. 

Self-evaluation was found to play only a 
restricted role in internalized responses to 
transgression; many forms of response do not 
seem to require the use of cognitive resources 
of moral judgment. The most common kind 
of moral response, correction of deviance, 
occurred very often without any evidence of 
self-criticism. 

Various forms of external resolution, in 
which the consequences of transgression were 
perceived in discovery and punishment by 
other people or in unpleasant fortuitous events, 
or in which others were held responsible for 
the transgression or for similar actions, were 
also a pervasive aspect of moral responses. 
The frequent coincidence of such externally 
oriented responses with self-criticism did not 
support the view, expressed in some conceptu- 
alizations, that they are merely “defenses 
against guilt” whose function is to avoid self- 
evaluation. The importance of external cues 
and resources in determining responses to 
transgression was again confirmed by the 
many instances in which moral actions were 
initiated upon the influence or demands of 
others, publicly displayed, or carried out with 
the assistance of external agents. 

The limited usefulness of self-evaluation in 
accounting for internalized moral responses, 
and the extensive evidence of their frequent 
external orientation, suggested a theoretical 
perspective in which self-criticism was not 
viewed as a prerequisite of their occurrence, 
but was regarded instead as only one of a 
variety of responses that might be collectively 
referred to as moral consequences in order to 
emphasize their functional equivalence 1m 
reducing the negative affect invariably asso- 
ciated with transgression. 

The two dimensions of status represented 
in social class and sex role distinctions had the 
expected parallel significance for the child’s 
moral orientation. The middle-class children 
and the boys showed more evidence of in- 
dependence of external events in responding to 
transgression, while the working class children 
and the girls revealed a greater degree of 
external orientation. Social class differences 
were generally consistent across both sexes 
and sex differences were consistent at both 
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social class levels. None of the differences was 

attributable to the variable of intelligence. 

Obvious differences in the behavioral im- 
plications of social class and sex role, despite 
similarities along the status dimension, were 
emphasized by the fact that the parallel varia- 
tions in moral orientation were not identical 
in the specific forms which they assumed. 
The marked social patterning of the moral 
responses was interpreted as supporting the 
view that their different forms should be 
treated as distinct moral phenomena and not 
as equivalent reflections of an underlying 
unitary phenomenon such as “conscience.” 
The relationships between moral responses 
and social positions also suggested that dif- 
ferent moral orientations do not emerge se- 
quentially with advancing age or experience, 
as has been argued in some interpretations 
of moral development, but are rather the 
stable end-results of different patterns of 
social reinforcement. 

The associations of moral responses with 
maternal discipline, while in the expected 
direction, were less extensive than the associa- 
tions with social position, and to the extent 
that retrospective parental report can be 
relied on, it appeared that disciplinary tech- 
niques, in themselves, provided only a partial 
view of the mechanisms through which differ- 
ences in moral orientation were generated. 
Specific methods of punishment may provide 
only a gross picture of the total pattern of 
social reinforcement affecting the child’s moral 
orientation, and experimental methods may 
be necessary in order to make more refined 
discriminations. 
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FIELD DEPENDENCE AND INTELLECTUAL FUNCTIONING! 
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in perception, Witkin, Lewis, Hertzman, 

Machover, Meissner, and Wapner (1954) 
found a number of personality correlates of 
field dependence. Most of their results have 
since been confirmed (e.g., Gruen, 1955; 
Gardner, Holzman, Klein, Linton, & Spence, 
1959). In addition, studies have since been re- 
ported exploring relationships between field 
dependence and many other aspects of indi- 
vidual functioning. Of particular concern here 
are those studies bearing on intellectual func- 
tioning. 

The evidence now available suggests that 
field dependent subjects tend to perform less 
effectively on standard tests of intelligence 
than field independent subjects, Woerner and 
Levine (1950 unpublished) found significant 
relationships between field dependence meas- 
ures and IQ scores on the Wechsler Intelligence 
Scale for Children (WISC) using 12-year-old 
subjects. Similar relationships have since been 
reported by several other investigators using a 
variety of intelligence tests and subject popula- 
tions. 

The present study represents an attempt to 
examine the nature of this relationship through 
factor analyses of intercorrelations among tests 
of intelligence and field dependence. 

The dimension of field dependence has been 
defined by Witkin and his colleagues (1954) in 
terms of the capacity to overcome embedding 
contexts in perception. Subjects who easily 
“break up” an organized perceptual field— 
who can readily separate an item from its con- 
text—are called field independent; subjects 
who readily accept the prevailing field or con- 
text—who have difficulty in separating an item 
from its context—are called field dependent. 


Į: their early work on individual differences 


1 This study was supported in part by a research 
grant (M-628) from the National Institutes of Health, 
United States Public Health Service. H. A. Witkin 
is the principal investigator for this grant. Equipment 
used in the studies to be reported was made available 
by the Office of Naval Research, equipment-loan con- 
tract Nonr-1175 (00). 

The authors are deeply indebted to B. Goodman, 
C. Johnson, S. Nathan, F. Plumeau, and P. Plumeau 
for their assistance in carrying out these studies. 
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Commenting on the findings of Woerner and 
Levine (1950 unpublished), Witkin et al. sug- 
gest that some intellectual tasks may also re- 
quire the capacity for overcoming embedding 
contexts. They suggest that the relationship 
between IQ scores and field dependence meas- 
ures may be a function of this common re- 
quirement. 

On the basis of this interpretation, we would 
expect that some WISC subtests reflect the 
capacity for overcoming embeddedness. Such 
subtests should be highly related and fac- 
torialy similar to measures of field depend- 
ence. à 

It may be noted that the various Wechsler 
scales (Bellevue, WAIS, and WISC) have 
been studied extensively by factor analysis 
(see Cohen, 1957, 1959, for example). Most of 
these analyses are in agreement in identifying 
three major factors. 

The first of these factors, most often defined 
by the Information, Comprehension, Similari- 
ties, and Vocabulary subtests, has usually 
been labeled “Verbal Comprehension.” 

A second factor, frequently defined by the 
Arithmetic and Digit Span subtests, has been 
called “Memory,” “Freedom from Distracti- 
bility,” “Attention-concentration,” or “Con- 
centration-speed.” 

A third factor, variously labeled “Closure,” 
“Performance,” “Spatial-perceptual,” “Non- 
verbal Organization,” “Visualization,” and 
“Perceptual Speed” has usually been defined 
by the Block Design and Object Assembly 
subtests, with Picture Completion also in- 
cluded in some studies. This third factor is 
particularly relevant to the Witkin et al. 
(1954) hypothesis. 

The Block Design, Object Assembly, and 
Picture Completion subtests, which define the 
Closure factor, appear to involve the capacity 
to overcome embeddedness, In the Block 
Design subtest the subject is shown a reference 
design that he must copy by the appropriate 
arrangement of blocks. The reference design 
forms an organized whole that must be 
“broken up” for effective performance. Object 
Assembly performance involves putting to- 
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gether a jigsaw-like series of pieces to form a 
meaningful whole. For some items the subject 
is told what the whole object is and for others, 
identification of the object is fairly obvious. 
Too rigid adherence to a conceptualized image 
of the solution may impair performance on this 
task, particularly where the parts given the 
subject do not readily correspond to units mak- 
ing up his image of the whole, In the Picture 
Completion subtest the subject is shown a pic- 
ture that has some part missing. The whole 
must be analyzed into its components in order 
to locate the missing part. 

If this analysis of the subtests defining the 
Closure factor is correct, and if the relation- 
ship between field dependence and IQ is a 
function of a common cognitive style, i.e., the 
capacity to overcome embeddedness, it should 
be possible to demonstrate that tests of field 
dependence and Wechsler Closure subtests 
define a single, common factor. 


METHOD 


The study reported here is part of a larger investi- 
gation (Witkin, Dyk, Faterson, Goodenough, & Karp, 
in press) in which individual differences in psycho- 
logical differentiation are being explored. As part of 
that investigation, a number of cognitive tests, in- 
cluding the WISC and standard tests of field depend- 
ence, were administered to two groups of children. 
Group A consisted of 25 boys and 25 girls between 11.5 
and 12.5 years of age. Group B included 30 boys be- 
tween 9.5 and 10.5 years. Subjects of both groups were 
volunteers, with parental consent, drawn from a public 
school in Brooklyn, New York. 

Measures of field dependence were obtained from 
three perceptual situations, the Embedded Figures 
Test, the Rod-and-Frame Test, and the Tilting-Room- 
Tilting-Chair Test (Witkin et al., 1954). 

In the Embedded Figures Test (EFT) the subject 
is asked to locate a simple figure embedded in a complex 
geometric form. His score is the mean time taken to 
locate the simple figures for 24 test items. 

In the Rod-and-Frame Test (RFT) the subject is 
seated in a lightproof room, facing a luminous rod 
surrounded by a luminous frame. With frame and rod 
tilted, he is asked to adjust the rod to the objective 
upright. The RFT is divided into three series: where 
the subject's chair is tilted opposite to the tilt of the 
frame; where the subject's chair is tilted to the same 
side as the frame; where the subject is seated erect. 
"The score for each series is the mean number of degrees 
of deviation of the rod from the upright when the 
subject sees it is straight. 

In the tilting-room-tilting-chair situation, the 
subject is seated in a chair within a specially con- 
structed room. The room and chair can be tilted in- 
dependently to the right or left. At the beginning of 
each of the 14 trials, the room and chair are tilted. For 


the first 8 trials, called the Room Adjustment Test 
(RAT), the subject is asked to adjust the tilted room 
to the upright, while the chair remains tilted. The 
remaining trials, called the Body Adjustment Test 
(BAT), involve adjustment of the chair (and thus the 
subject's body) to the upright while the room remains 
tilted. Both RAT and BAT are divided into two series, 
Series 1 of each test including trials where room and 
chair are initially tilted to opposite sides and Series 2 
where they are tilted to the same side. For each series 
of each test the subject's score is the mean deviation, 
in degrees, of his estimates from the true upright. 

The EFT, RFT, and BAT each require the subject 
to separate an item (whether a simple figure, a rod, or 
the body) from an embedding field and have been used 
to define perceptual field dependence. Witkin et al. 
(1954) call these “part-of-a-field” tests. In contrast, 
the RAT has been called a “field-as-a-whole” test and 
apparently does not directly involve the capacity to 
overcome an embedding context. 

In all, the correlation matrix for Group A involved 
21 variables, 12 WISC subtests, 3 series of the RFT, 2 
series each for the BAT and RAT, EFT, and Sex. 

Each of the 10-year-old Group B boys received the 
WISC and field dependence tests just described. 
However, Series 1 and 2 of the RFT were combined 
with equal weight and treated as a single variable. BAT 
and RAT series were similarly combined, yielding one 
score for each test. These changes were made in the 
interest of economy, after inspection of the Group A 
results suggested similar patterns of correlation for 
the several series of each test. 

In addition to these variables, data from a number 
of other cognitive tests were available for almost all of 
the Group B subjects. These tests had been adminis- 
tered as part of the larger study from which all of the 
data reported here were taken. The additional cognitive 
variables seemed sufficiently interesting and relevant 
to warrant their inclusion in the factor analysis despite 
the problem created by the loss of one or two subjects 
on each test. 

1. The Children's Embedded Figures Test, devel- 
oped by Goodenough and Eagle (in press), is a modi- 
fied version of the EFT designed for use with chil- 
dren. As in the EFT, the subject is required to locate 
a simple figure embedded in a complex one; however, 
the complex figures are meaningful objects from which 
the simple figures may be removed. The score is the 
number of simple figures correctly identified. 

2. Recognition-efficiency test: nonserial version 
involves exposure of a series of pictures at an out-of- 
focus lens setting, subjects being asked to identify each 
picture. The test is scored for the number of pictures 
correctly identified after a single presentation. 

3. Recognition-efficiency test: serial version, devel- 
oped by Gump (1955), is similar to the nonserial 
version; however, each picture is presented in in- 
creasingly sharp focus until identified correctly. High 
scores reflect early (relatively out-of-focus) recogni- 
tion: 

4. In the Hidden Pictures Test (Thurstone, 1944) 


, * We are indebted to P. V. Gump for the loan of 
his recognition-efficiency test material. 
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the subject is asked to discover familiar objects and 
faces hidden in a complex scene. 

5. The Incidental Learning Test involves a modifi- 
cation of a technique used by Gardner et al. (1959). The 
subject is briefly shown a series of eight common three- 
letter words, each of which is printed in one of four 
colors. During the presentation, he is asked to name 
the color of each word. Following a single exposure of 
the series, the subject is tested for recall of the words 
his score being the number of words recalled. 

6. The Intentional Learning Test uses the same type 
of materials and follows the same procedure as inci- 
dental learning, except that, prior to exposure of the 
list of words, the subject is instructed to name and try 
to remember the words (ignoring the colors). Again, 
the subject is tested for recall of words after one pres- 
entation of the stimulus series, his score being the 
number of words recalled. 

7. Reconciliation of Opposites was patterned after 
the Stanford-Binet subtest of that name with several 
easy items added to make it appropriate for 10-year- 
olds. The subject is read pairs of opposites (e.g., black- 
white) and asked to tell how they are the same or 
similar. Scoring is consistént with criteria used by 
"Terman and Merrill (1937). 

8. The Cancellation Test, patterned after a satiation 
situation, uses a series of four pages filled with lines 
of random letters. The subject is instructed to cross 
out all the *t's" on the page as quickly as he can. 
Reductions in time and errors from the first to fourth 
pages are scored and combined with equal weight to 
provide an improvement score. 

The correlation matrix for the Group B subjects was 
based on 25 variables. 

Two centroid factor analyses (one for each group) 
were carried out. “Blind” rotation to oblique simple 
structure followed the graphic method outlined by 
Cattell (1952). The decisions to terminate factor 
extraction were based on Saunders’ test. 


RESULTS AND DISCUSSION 


Eight factors were extracted from the Group 
A correlation matrix and nine factors from the 
Group B matrix. Despite the small W, the 
hyperplanes seem reasonably well-defined, over 
60% of the factor loadings in each study falling 
between +.10. Rotated loadings for Groups 
A and B are presented in Tables 1 and 2, 
respectively.? 

The first three factors in each study are 
closely matched and apparently correspond to 


*Six tables, including zero-order intercorrelation 
matrices, unrotated factor loadings, and intercorrela- 
tions among factors for Groups A and B, have been 
deposited with the American Documentation Insti- 
tute. Order Document No. 6864 from ADI Auxiliary 
Publications Project, Photoduplication Service, Library 
of Congress; Washington 25, D. C., remitting in ad- 
vance $1.25 for microfilm or $1.25 for photocopies. 
Make checks payable to: Chief, Photoduplication 
Service, Library of Congress. 


the three major factors repeatedly isolated in 
previous studies of the Wechsler scales. 


Faclor 1 » 


The first factor, on which the Vocabulary, 
Information, Similarities, Arithmetic, and 
Comprehension subtests of the WISCare loaded 
in both studies, is apparently the verbal com- 
prehension factor isolated in previous studies 
of Wechsler scales. Although the loading of 
Arithmetic on this factor is not frequently re- 
ported, Cohen (1959) obtained this result with 
10.5-year-olds. 

Among the variables included only in the 
Group B study, Reconciliation of Opposites 
and Intentional Learning were loaded on Fac- 
tor 1, a result consistent with the verbal na- 
ture of these tests. Sex is loaded on this factor 
for Group A, with boys displaying greater 
verbal comprehension than girls. This result is 
not consistent with the literature on sex 
differences in verbal comprehension. 


Factor 2 


This factor, loaded in both studies by Digit 
Span, Arithmetic, and Coding subtests of the 
WISC, is apparently the attention-concentra- 
tion or memory factor previously isolated in a 
number of studies, The loading of Coding on 
this factor, though infrequent in earlier studies, 
appears reasonable. Similarly, the loading of 
the Children's EFT in the Group B study 
seems reasonable in view of the apparent atten- 
tion requirement of this test. It is surprising, 
however, that the EFT is not loaded in either 
study. 

In addition to the Children's EFT, four 
other variables included only in the Group B 
study appear to be loaded on Factor 2. Inten- 
tional Learning is loaded positively and non- 
serial Recognition-efficiency, Cancellation, and 
Sex negatively (girls attending more success- 
fully than boys). The latter result appears con- 
sistent with the literature on sex differences in 
attention. It also seems reasonable to expect 
that ability to concentrate would aid perform- 
ance in the Intentional Learning situation. 


Factor 3 

The three tests of perceptual field depend- 
ence, RFT, EFT, and BAT, have their highest 
loadings on this factor as do two other per- 
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TABLE 1 
Group A FACTOR MATRIX AFTER FINAL ROTATION 
Factors 
Test 
1 2 3 4 5 6 7 8 
—06 —08 06 58 —04 34 11 09 
—05 10 —03 67 11 41 19 —10 
—41 —04 39 02 —10 60 09 28 
09 —06 44 08 08 52 —07 05 
00 —05 68 02 09 —03 —11 —10 
—06 09 50 06 34 —26 09 —04 
RFT: Series 3 11 —30 58 —94 02 00 06 26 
EFT —07 10 50 -17 10 —03 —06 49 
Information 65 04 —08 05 —03 —05 06 21 
Comprehension 38 29 —24 —16 21 —03 —03 —10 
Arithmetic 39 4 03 09 —10 —10 —09 14 
Similarities 44 08 03 —44 09 08 08 09 
Vocabulary 63 —03 —06 —09 —07 10 10 —23 
Digit Span —06 62 01 —02 01 04 09 —18 
Picture Completion 35 —39 38 —10 31 05 65 —05 
Picture Arrangement 35 —17 09 —07 58 10 00 -02 
Block Design 07 00 42 01 —09 08 04 54 
Object Assembly -07 —03 67 00 61 02 —05 05 
Coding 04 89 —03 05 —06 07 Kés 09 
Mazes 07 24 57 03 —03 09 05 —06 
Sex 42 —144 05 09 —04 04 —08 04 


Note—In Tables 1 and 2 decimals are omitted; loadings over +.20 are in boldface type; and tests are reflected where neces- 
sary n that positive scores represent greater field independence, greater intelligence, better memory, more efficient recognition, or 
masculinity. 


Y 
TABLE 2 
Group B Facror MATRIX AFTER FINAL ROTATION 
Factors 
Test 

1 2 3 4 5 6 1 8 9 

RAT —59| 09| 37|—o8| 10| 20] -33| 10] 09 
BAT o| 09} 43| 10|— 08 | —02 | —08 
RFT; Subject tilted (Series 1 and 2) -08|-10| a| o| sa| orl of | 10|-0 
RET: Subject erect (Series 3) —05! 03| 69| 34| 10| 23| 03 | —09| -25 
EFT —03| 09| 69] 03|—03| —24| 03] —07| 09 
Information. 6| 10| 0o7| —1:0| 01|—19| 10| 04| 0 
Comprehension 46| 01|—32| 0| o| 02|—01| o1| 6 
irithmetic 53| 46| 06| 00|-30| —03| 08 | —03 | -30 
Similarities 57| 05} 06] 39| o1| 10| —41 | —06 | —05 
Vocabulary 73 | —08 | —08) 06} 06|—10| —10| 18| -0% 
igit Span 25| 53|—06| —02| o1| 06] —07| 00| -34 
Picture Completion -07 | -07 | 52| —09| -46| 08| 10| 08 | -04 
Picture Arrangement —01| —10! 11] 60] —03| —10 | —05 | —o1| 10 
xk D o| 42| 50|—10| —01| —05| —09 | os | —08 

Object Assembly o| 09| 33| 52|-—38| os] o7| O8. 07 
Coding -06| 48|—10| 04|—25| 00|—24| —02 | 29 
ams 00 ) —06| 10|—10| —09| —o8| 32] —02| 45|-05 

Recognition-efficiency: serial 24| 03 | —40 | —03 | —30 | 20] —08| 03] 06 
Recognition-efficiency: nonserial 06 | —52 07 07 | —08 02 00 23| 05 
Cancellation — —07 | —38 | —04| —08 | —09 | —38 | —10 | —09 | 07 
Incidental Learning 10 —08 | 29|—25| o| 0o2|—o3| 22| 19 
Intentional Learning _ 37| 31| 10| —:8| 10] —26 | —23 | —07 | 2 
Reconciliation of Opposites 64 | —02 04 29 08 03 14 | —08 | 0 
Children’s EFT -04| 29| e| or|-o| oo| 33| -17| 00 
Hidden Pictures 10 | —01 | 27| 02|—10| —10 | —37 | —21 | —09 


FIELD DEPENDENCE AND INTELLECTUAL FUNCTIONING 


ceptual tests, the Children’s EFT and Hidden 
Pictures, which are very similar to the EFT. 
Three WISC subtests, Block Design, Picture 
Completion, and Object Assembly are also 
highly loaded on Factor 3. The latter tests 
have often defined a closure factor in earlier 
studies. As suggested above, all of these tests 
may have, as a common requirement, the over- 
coming of an embedding context. 

The absence of RAT from Factor 3 in one 
analysis and its low loading in the other tends 
to confirm Witkin’s distinction between “field- 
as-a-whole" and “part-of-a-field” tests. Ap- 
parently the RAT is different from the other 
perceptual measures, although relationships 
between this test and field dependence meas- 
ures have been obtained in many studies. 

Several other tests appear loaded on Factor 
3 in one or both analyses. These include Com- 
prehension (loaded negatively in both analy- 
Ses), serial Recognition-efficiency and Inci- 
dental Learning (given only to Group B sub- 
jects), and Mazes (loaded for Group A but 
not B). The bases for these loadings are not 
clear at present. 

"The remaining factors are not easily matched 
for the two groups. Nor can they be readily 
identified with factors obtained in earlier 
studies. Some of these factors may be spurious, 
particularly in the Group B study where data 
were missing for some of the tests. The possi- 
bility that failure to match may be a function 
of differences in the variables included in the 
two studies or a function of developmental 
differences should also be considered. In any 
event, after the first three factors, the results 
are not clearly interpretable and do not seem 
immediately relevant to the purposes of the 
present study. 

The hypothesis that there is a factor, com- 
mon to intellectual and perceptual tests and 
involving the capacity to overcome an embed- 
ding context, receives some support from the 
results of these studies. Each of the standard 
tests of perceptual field dependence is highly 
loaded on Factor 3. The three WISC subtests 
Which have defined a closure factor in previous 
Work also appear on Factor 3. Furthermore, 
they are the only WISC subtests substan- 
tially loaded on Factor 3 in both studies. Nor 
are the tests of field dependence consistently 
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or substantially loaded on the verbal compre- 
hension or attention-concentration factors. 
"These results tend to support the Witkin hy- 
pothesis that relationships obtained in many 
studies between tests of field dependence and 
standard tests of intelligence stem, at least in 
part, from common requirements shared by 
measures of field dependence and of certain 
kinds of intellectual abilities. 

These results do not exclude the possibility 
that some intellectual tasks may be related to 
measures of field dependence for other reasons. 
For example, some of the personality corre- 
lates of field dependence may influence per- 
formance on particular types of intellectual 
problems. In fact, the high relationships be- 
tween the Incidental Learning Test and meas- 
ures of field dependence might be interpreted 
on this basis. A detailed discussion of such pos- 
sibilities is presented elsewhere (Witkin et al., 
in press). 


SUMMARY 


The present study was designed to test the 
hypotheses that some intellectual and per- 
ceptual tests have a common requirement for 
overcoming embedding contexts, and that re- 
lationships obtained between measures of field 
dependence and standard tests of intelligence 
are based on this common factor. Two factor 
analyses were conducted on matrices of correla- 
tions between cognitive tests, including tests of 
field dependence and the subtests of the WISC. 
The results tend to support both hypotheses. 
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SOME EFFECTS OF PERCEIVED STATUS ON RESPONSES TO 
INNOVATIVE BEHAVIOR! 


E. P. HOLLANDER? 
School of International Service, American University 


HAT one member of a group may do 
with impunity another may not do. 
This observation fits many circum- 
stances of social interaction and serves as a 
central element in the "idiosyncrasy credit” 
model of perceived status and conformity 
(Hollander, 1958). In terms of the model, evi- 
dences that a person is competent in some focal 
group activity, and that he has conformed to 
applicable group expectancies, result in his 
accumulating credits that he may draw on later 
for innovative behaviors directed at exercising 
influence. Moreover, such accumulations lead 
in time to alterations in expectancies that 
make certain deviations acceptable as a fea- 
ture of higher status. Recently, these relation- 
ships have been studied in an experiment with 
task oriented groups (Hollander, 1960). 
Another illustration of the operational as- 
pects of perceived status is provided by an 
experiment reported in Pepitone (1958, p. 266) 
where subjects were presented with a script of 
a technical discussion and were given varying 
prior sets as to the level of expert qualification 
of one of the two participants; the greater the 
alleged expertness of that person, the more 
favorable were the subjects’ interpretations of 
his negative acts as well as of his positive ones. 
The present experiment follows a simple, 
two-step approach in further studying differ- 
ences in response to behavior as a function of 
the perceived status of the behaving person. 
First, it may be predicted that attributing 
higher degrees of competence to a person 
should result in correspondingly greater will- 
ingness to accept that person’s authority. 
Further, this effect should be enhanced for 
each degree of ascribed competence the longer 


1The research upon which this paper is based was 
completed by the author as principal investigator of 
ONR Contract 816(12) with Washington University. 
The opinions expressed are not to be construed as 
necessarily reflecting those of the Department of the 
Navy. 

2Tt is a pleasure to acknowledge the valuable aid 
of Robert M. Taylor in accomplishing the work asso- 
ciated with this study. 
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the person is known to have belonged to the 
group; all things constant, the equally com- 
petent newcomer to the group should have 
less authority than his counterpart who has 
been there longer. Second, the status thus 
accorded the person described should be in- 
versely related to disapproval of his deviancy, 
i.e., the higher the status, the less the disap- 
proval. This relation should hold especially in 
the case of innovative deviancy. In short, the 
intent is to induce levels of perceived status 
so as to discern their effect upon responses to 
given behaviors. To produce status experi- 
mentally, the first step of this approach makes 
use of the direct “trait description" technique 
employed by Asch (1946), in his study of im- 
pressions of personality, and more recently by 
Bruner, Shapiro, and Tagiuri (1958). 


METHOD 


Subjects, Setting, and Set 


One hundred and fifty-one undergraduate students 
enrolled in lower-level psychology courses at Washing- 
ton University were utilized as subjects. Sixty-four 
were males, and 87 females. All fell within the age 
range 18-23, with a mean about 20. The entire proce- 
dure was conducted in class sections, after an intro- 
ductory statement by the investigator that this was a 
study of attitudes in groups and that responses would 
be anonymous. Forms with precise instructions were 
then distributed. 


Procedure 


The first form instructed subjects to think of a 
group to which they belonged, at the time or before, 
and to imagine in it a person of their own sex who 
was described by a succinct set of terms which fol- 
lowed. Then the form asked: “Knowing this informa- 
tion, how willing would you be to have this person in a 
position of authority in the group?” Responses were 
made on a seven-point scale labeled “definitely not” 
at one end, “very willing” at the other, and “neutral” 
in the middle. 

Each subject received a single description made up 
of four terms, of which two were varied, and two held 
constant, throughout. Competence in the group’s 
activity, arrayed at four degrees, was paired with one 
of two levels of time in group to provide eight descrip- 
tions of the stimulus person within a 2 X 4 design: 
thus either “been in group for some while" or “new to 
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group” appeared in combination with “extremely 
capable performer in group’s activity,” or “capable 
performer in group’s activity,” or “average performer 
in group’s activity,” or “poor performer in group’s 
activity.” The constant terms, “interested” and 
“generally liked,” were used to control for two other 
determinants of status postulated in the model, 
"motivation to belong” and “beta value." Examples 
of the resulting descriptions are given by the two ex- 
tremes below: 


Extremely capable performer in group's activity. 
Been in group for some while. Interested. 
Generally liked. 


Poor performer in group's activity. 
New to group. Interested. 
Generally liked. 


Having made their ratings, subjects were then given 
a second form instructing them to think again of the 
same group with the person of their own sex once again 
described exactly as before. Eight behaviors were pre- 
sented to afford a range of stimuli for responses linked 
to status. The first of these— "suggests changes from 
group plans"—was of special significance in terms of 
the influence aspect of status. The others, in order, were: 
“speaks freely about other group members," “questions 
the views of other group members,” “performs tasks 
independently,” “interrupts to express comments at 
group meeting,” “discusses group concerns with out- 
siders,” “makes own decisions on attending group 
functions,” and "casual in relations with other group 
members.” All were selected as illustrative of intra- 
group behaviors that might be variously interpreted 
and were based on responses to an open-ended question 
from a similar but independent group of subjects, In 
this second step, subjects indicated for each of these 
eight behaviors whether their evaluation of the stim- 
ulus person would go up or go down, and by what per- 
centage weight, if he displayed it. 

Scores for accorded status were readily derived 
from the rating responses made to the question con- 
cerning willingness to have the stimulus person in a 
Position of authority. These were scaled from 7 for 
“very willing” to 1 for “definitely not” with 4 assigned 
to “neutral.” The distribution of these scores was in- 
clined toward the upper values with a mean of 45, a 
median and mode of 5, and a standard deviation of 1,6. 


RESULTS AND DISCUSSION 


Table 1 presents the means for the accorded 
Status score by experimental treatments, the 
Ns for which varied from 17 to 22. In both 
columns the values proceed Steadily upward in 
line with the rising degrees of competence 
ascribed to the stimulus person. At each de- 
gree, those said to bein the group longer receive 
a higher mean score on this status measure, An 
analysis of variance completed for these data 

*See Hollander (1958) for a further explication of 


these variables, and Hollander (1961) for a detailing 
of their background in previous research. 
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TABLE 1 
MEANS FOR ACCORDED STATUS By EXPERIMENTAL 
‘TREATMENTS 
Treatment In grou fer New to group 
Extremely capable per- 6.25 5.84 
foie res 6.11 5.50 
Capable former : $ 
Average performer 5.06 4.50 
Poor performer 2.95 2.53 


yielded significant F values for both major 
variables, but not for sex difference or any of 
the interaction terms. 

Because of marked intersubject variability 
in the “percentage weight” response, the meas- 
ure of disapproval employed for the second 
part of the experiment was the percentage of 
subjects responding down" for each behavior, 
by the level of status previously accorded the 
stimulus person. These data were viewed in 
two ways. The difference in disapproval for 
each behavior, across status, was tested for sig- 
nificance by chi square, revealing only “sug- 
gests changes” to have a value significant be- 
yond the .05 level. Second, levels of accorded 
Status were correlated (rho) with ranks for 
magnitude of disapproval. A high negative 
coefficient signifies a decrease in disapproval as 
status increases, in keeping with prediction. 
This was in fact the case for two behaviors 
with significant coefficients: the key innovative 
behavior “suggests changes from group plans” 
(—.96), and “discusses group concerns with 
outsiders” (—.74). The pattern of rho values 
for the other behaviors was not uniformly 
negative. Of the three with positive signs, only 
one approximated significance, that being 
“interrupts to express comments at group 
meeting" (+.68). 

In Figure 1 curves are plotted for the two 
behaviors with significant values for rho and 
for the one with a value nearly so. Most marked 
is the curve for "suggests changes," the be- 
havior most closely tied to the assertion of 
influence. Disapproval of this behavior drops 
off quite systematically with rising accorded 
Status. A similar though less sharp effect is 
seen for “discusses with outsiders,” a behavior 
connected with the spokesman or advocate 
function; while not approved at any level, at 
the two lowest reaches of accorded status this 
is disapproved by all respondents. 
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Interrupts to 
express comments 
(rho = 4,68) 


fare 


Discusses group concerns 
with outsiders 
(rho = 7,74) 


Suggests changes 
from group plans 
(rho = 7.96) 


5 6 7 
Highest 


ACCORDED STATUS 
Fic. 1. Percentage of respondents giving downward evaluation of stimulus person for displaying indicated 


behaviors, by status accorded the stimulus person. 


A contrasting effect is seen in the positive 
curve for “interrupts,” suggesting that this 
behavior is less likely to alter the evaluation of 
à person of lower status than one of higher 
Status. This interpretation fits the proposition 
that shifts in expectancies are associated with 
increased status. In this instance, and as part 
of à probable democratic image of leadership, 
it becomes more crucial in conserving credits 
for the high status member than for the low 
Status member not to interrupt others in the 
group. 

The findings of this experiment are in line 
with the elements of the model considered. Of 
particular note is the evidence of directly dis- 
cernible inputs to status, from competence and 
time in group, that result in consistent re- 
Sponse tendencies. These effects are the more 
noteworthy for having been produced irre- 
Spective of the nature of the group—a choice 
left to each subject individually—and with 
only minimal stimulus material.‘ 


SuMMARY 


An experiment was conducted to test pos- 
tulates from the *idiosyncrasy credit” model 


‘This consideration encourages the prospect of 


appropriating this technique as a classroom demon- 
stration. 


of status. A brief description of a person they 
were to imagine in any group to which they be- 
longed was presented to 151 subjects of both 
sexes. Competence and length of time in the 
group were the major attributes manipulated. 
These were paired in eight descriptions, only 
one of which was given to each subject as a 
treatment. Responses were then made on a 
seven-point scale to signify willingness to have 
that person in a position of authority in the 
group, and this served as an index of accorded 
status. A rising mean score for accorded status 
was found for the increasing degrees of com- 
petence, and the mean for “new to group" 
was uniformly lower than that for “in group 
for some while” at each degree, The effects of 
these two major variables was significant, while 
sex difference was not. Subjects also provided 
an evaluation of the same person in terms of 
eight possible ways he might behave in the 
group. According to prediction, two behaviors 
reflecting innovative action were found to be 
disapproved significantly less the higher the 
status attributed to the innovator. 
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HYPOTHESES AND HABITS IN VERBAL “OPERANT CONDITIONING"! 


DON E. DULANY, Jr. 
University of Illinois 


(Adams, 1957; Krasner, 1958) raise two 
fundamental questions: Is this learning 
without awareness? And how well do empirical 
operant principles describe the data obtained 
in a verbal conditioning experiment? Possibly 
the effects were under the subject’s own verbal 
control, $ 
In the studies most faithful to the operant 
conditioning paradigm, subjects said words ad 
libitum and rate of plural nouns increased when 
followed by a buzzer (Greenspoon, 1951), a 
| light (Greenspoon, 1951; Sidowski, 1954), or a 
casually murmured *Umhmm" (Greenspoon, 
1955; Mandler & Kaplan, 1956; Wilson & Ver- 
planck, 1956). Greenspoon (1955) also showed 
that Umhmm was an effective reinforcer for 
other words. It is unsurprising to learn that 
one can influence the actions of another co- 
operative human being by informing him 
when his behavior is judged correct or accept- 
able. These studies attract attention because 
they apparently demonstrate learning without 
“awareness in some sense. Some see in this a 
program for identifying response classes and 
empirical reinforcers at the human level 
through the use of operant conditioning pro- 
cedures, As Verplanck (1955) puts it, “This is 
the identification of responses and reinforcing 
stimuli, and the verification and education of 
Jaws relating them to one another in human 
behavior under conditions where the subject is 
acting as naturally as possible, and where inso- 
far as possible, he is not ‘aware’ of what is 
going on" (p. 597). Salzinger (1959), in discuss- 
ing the verbal conditioning literature, con- 
dudes that, ^Very little deviation from the 
approach in animal work has been necessary 
so far to obtain lawful data in verbal behavior” 
(p. 70). But those accustomed to wonder about 
reinforcing mechanisms may think these find- 
ings antecedently improbable. Umhmm under 


1 This study was supported by grants from the 
National Science Foundation and the United States 
Public Health Service. Richard Erkes collected the 
lata of Experiment I; Janet Pierce and Margo Tuite 
ollected the data of Experiment II and assisted with 
tatistical computations. à 
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several circumstances could be a pleasant thing 
to hear, but itis unclear how a buzzer or a flash 
of a 75-watt bulb would activate any of the 
processes variously assumed to be the basis for 
a law of effect that is more than empirical. The 
improbable is not to be dismissed, certainly, 
but it does invite an examination of alterna- 
tive explanations—in this case the possibility 
that the operant conditioning effect was medi- 
ated by some kind of verbal control—hypothe- 
ses and self-instructional sets, perhaps together 
with the transfer of prior habits, 

There are hints of some alternative in the 
greater effectiveness of these contingent stim- 
uli in tasks that are puzzling enough to arouse 
problem solving. When subjects in our labora- 
tory course were asked to say words, and were 
permitted to express their hypotheses through- 
out, they seemed to grasp at any figural stimu- 
lus, even a light or buzzer, as a sign that would 
make sense of a silly, meaningless task. Sidow- 
ski (1954) found that uninstructed subjects 
conditioned as well as those advised that the 
light could be made to blink, and he had reason 
to believe that the “uninstructed” were self- 
instructed. With the more meaningful task of 
telling stories, Ball (1952) found a light to be 
ineffective as a reinforcer, Neither did a light 
influence response to a self-acceptance inven- 
tory (Nuthmann, 1957) or selection of pro- 
nouns while the subject formed complete sen- 
tences (Taffel, 1955), And Hildum and Brown 
(1956) found that Umhmm did not modify 
response to an attitude questionnaire, again a 
task the subject usually accepts at face value. 

_In the first experiment subjects are asked to 
say words and plural nouns are followed by 
Umhmm. A number of questions are asked and 
the subjects are divided into groups depending 
on what they say that they were supposed to 
do in the experiment. We should like to know 
whether these reports of behavioral hypotheses 
are related to selection of plural nouns as we 
would in theory expect such hypotheses to be. 
Theory in this case is not formal and well- 
developed, but merely a set of theoretical 
propositions of long standing—that subjects 
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under selective reinforcement tend to form be- 
havioral hypotheses, that these hypotheses 
tend to be accompanied by corresponding self- 
instructional sets (or “‘intentions”), and that 
these in turn lead to selection of the correspond- 
ing response class? Thus, with the additional 
hypothesis that subjects tend to report their 
hypotheses when questioned, subjects who re- 
port that they are supposed to say plural nouns 
should say them. But other behavioral hy- 
potheses as well must be examined for the pos- 
sibility that they are “correlated hypotheses” 
(Adams, 1957). Selection of another response 
class might, because of the subjects’ prior or- 
ganization of habits, also entail selection of 
plural nouns. If any report of a behavioral 
hypothesis is found to be related to selection of 
plural nouns, we shall, in a second experiment, 
instruct subjects to use the reported response 
class, Instruction, too, is commonly hypothe- 
sized to produce sets, and sets, in turn, to result 
in response selection, The term, “verbal con- 
trol," summarizes the set of theoretical prop- 
ositions involving behavioral hypotheses and 
instructional sets (self or social). It is used 
here because it suggests the reporting and ma- 
nipulating operations used and directs atten- 
tion to the subjects’ verbal processes, The hy- 
potheses and sets under consideration, might, 
however, be verbal, partially verbal, or en- 
tirely cognitive-neural—though in this situa- 
tion, one would think, verbalizable.* 


EXPERIMENT I 
Method 


Subjects. The subjects were 60 male undergraduates 
in the introductory psychology course, 43 experi- 
mentals and 17 controls. —— 

Setting and procedure. The experimental room, simi- 
lar to that described by the other experimenters, was 
8' X 6' X 10/, illuminated by a single hanging bulb, 
and bare except for two chairs and a table with a 


2 The hypothesis of association of behavioral hy- 
potheses and self-instructional sets probably requires 
the assumption of a fair amount of task motivation 
on the part of the subjects. 

* William James’ classical point that language may 
be inadequate for conveying complex awareness has 
been applied by Eriksen (1958) to the use of retro- 
spective criteria in studies of this type. The point is 
most troublesome for claims of learning without aware- 
ness. In this case, however, “I am supposed to say 
plural nouns" seems simple to articulate, and failure 
to do so seems strongly to imply lack of awareness. 
When reports of awareness are found to be related to 
behavior in the way awareness should be, the point 
loses some force. 


‘that you were supposed to say in order to be correct?” 
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microphone and tape recorder. The experimenter was 
a male senior in psychology who was mature in manner 
and appeared to be in his early twenties. He sat behind 
the subject and recorded his responses unobserved. 
After a few minutes of conversation to establish 
rapport, the experimenter gave the following instruc- 
tions taken from Greenspoon (1955): “What I want 
you to dois to say all the words you can think of. Do not 
use any sentences or phrases. Do not count. Please 
continue until I say stop. Go ahead.” A few, either 
baffled or perverse, had to be reminded several times 
not to give phrases. 
The four published experiments of this type have | 
varied in length and manner of scoring. Greenspoon 
(1955) solicited 50 minutes of responding and plural 
nouns were tallied by 5-minute periods. In the other 
three experiments subjects spoke some total number of 
words, and correct responses were scored by blocks of 
all responses. The present experiment required five 
blocks of 55 words of the subject, a chore that took the 
typical subject about 50 minutes. Scoring by blocks of 
all words has the advantage of reducing the large 
variance between subjects in output of plural nouns 
by eliminating the effect of wide individual differences 
in overall rate of uttering any words. 
For the experimental group the experimenter 
murmured “Umhmm” after each plural noun in the 
last four blocks of 55 responses. The intonation of | 
Umhmm varied too much for a meaningful phonetic 
description, but it could be characterized as non- 
committal to warm. While the controls were saying 
words, the experimenter remained silent. After each 
block of 55 responses, all subjects were asked, “What do 
you think the experiment is all about?” At the end of 
the session the experimenter asked that question again 
and several others: “Did you notice whether I said 
anything?” (“If so, what?” and “What do you think 
the significance of that was?”) “Was there anything 


The entire experimental session was recorded on tape. | 

‘Two people independently examined the reports— . 
and, of course, independently of any knowledge of the 
subject's performance. Their specific object was to 
sort the subjects into groups on the basis of their re- 
ported behavioral hypotheses, with a remaining cate- 
gory for those who reported nothing relevant. Sorting i 
by this single principle proved easier than expected 
and the two sorts perfectly agreed. 


Results and Discussion 


! 
Manipulations and performance. First, the 
performance of the entire experimental group 
was compared with that of the controls (see 
Figure 1). There was no need to eliminate any 
subjects from this analysis because none called f 
plural nouns “correct,” stated the purpose of | 
the experiment, or said that Umhmm was con- 
tingent on plural nouns, the usual criteria of § 
full “awareness” and basis for exclusion. No 
two of the other studies report the same 
analysis or the same learning curve, so it 15 
possible to replicate their findings here only in 
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the general sense of showing that output of 
plural nouns came under “control” of the con- 
tingent stimulus. An experimental effect is 
shown most simply by observing that for 35 of 
43 experimental subjects, mean frequency of 
plural nouns, averaged over the four rein- 
forcement blocks, was greater than frequency 
of plural nouns at the first, nonreinforcement 
l block; and that only 8 of 17 controls show the 
| Same increase. A chi square for unrelated 

groups of 5.48 with a p < .02 shows that this 
Increase is significantly associated with experi- 
mental treatments. 

That the effect does not register as consis- 
tent differences in trend over all five blocks 
can be seen from the insignificant group differ- 
ences in linear and quadratic components of 
trend (Lines B and C of Table 1) in Grant’s 
(1956) extension of the Alexander (1946) 
: analysis,‘ Apparently the effect of the rein- 

forcement is to support output of plurals at 
Some phase of reinforcement, but individual 
fluctuations over the four reinforcement blocks 
ate too irregular and too heterogeneous for 
Significant group differences in trend over all 
five blocks, either linear or curvilinear. This 
Breat heterogeneity must be in response to the 
i reinforcement because the term for linear slopes 
i between individuals is highly significant for the 
experimental group (F — 2.96, 41 and 123 df, 

; n 0005 and insignificant for the controls 

(F = 1.35, 15 and 45 df, p > .10) when each 
355 tested against its own term for individual 
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deviations from estimation. With still more 
heterogeneity in bind of individual curvilinear 
slope, which is visible in a graph of individual 
slopes, both the insignificant between-group 
and between-individual curvilinear trend terms 
are consistent with the significant chi square 
for differences in group shift. The lack of an 
overall mean difference, together with that 
chi square, also points to heterogeneity. The 
overall trend, a rise and fall in output of 
plurals, is best described by the second order 
orthogonal component (Line E). 

Subjects’ reports. The subjects’ answers to 
questioning fell into several conspicuous cate- 
gories. To Question 1, “What do you think 
this experiment is all about?," there was a 
scattering of answers such as “To study how I 
think,” “A vocabulary test,” and “I haven't 
the slightest idea”; but only one had a notice- 
able frequency. Thirty-four experimental sub- 
jects—and 11 controls—replied that the ex- 
perimenter must be studying their associations, 
that their task was to run through their asso- 
ciations. In each case the same answer had 
been given after one or more of the earlier 
blocks. This is a likely guess, since, as Bousfield 
(1953) has shown, a subject obliged to produce 
very many disconnected words usually runs 
through associated pools; and this is all of 
much consequence the subject sees himself 
doing. 

All subjects replied to Question 2 that the 
experimenter said “‘Umhmm” and were there- 
fore asked, “What do you think the significance 
of that was?” They suggested variously that 
Umhmm was an “encouragement to con- 
tinue,” “A distraction,” or “Of no signifi- 
cance,” and again only one reply was at all 
common. Eleven of the above 34 reported that 
they were supposed to associate in a series or 
in the same category whenever the experi- 
menter said “Umhmm.” These 11 subjects 
were therefore separable from the others. 
Typical replies were “I noticed that you ap- 
pear to say ‘umhmm’ when I seem to follow a 
line of more or less synonyms.... Your 
‘umhmms’ show me an indication that I’m on 
the right track. ... You seemed to be in favor 
of the more direct associations”; “I think the 
only thing is you pick at random, like I picked 
‘vegetables,’ and you wanted me to talk about 
that and every time I mentioned some vege- 
table you'd say 'umhmm' ”; ^I think you're 
trying to make me use the same—ehr—stay in 
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TABLE 1 
SUMMARY OF TREND ANALYSES 
i Reinforcement for dative Hyothesi No Associative 
ve icc mtv Axe eni Hye, 
oi 
E df | MS F df| MS F dj | MS F df| MS F 
4 nsf | 1| 24.19| 0.12 1/824.85| 4.78* 1| 11.04/0.07 1/538.69/8.01** 
s [Ede cui n g | 1|74.83| 1.82 1/582.99]11.46*** | 1| 1.35/0.05 1| 11.05/0.47 
l 
o RCM h| 1|33.85| 1.54 1| 16.60| 0.53 1| 11.94/0.51 1| 60.73/2.81 
quadratics 
i all linear sl i 1| 41.08) 0.86 1/207.43| 4.08 1| 54.02/1.88 1| 5.85/0.25 
E Overall quedimues h | © 1/203.06] 9.22*** | 11163.29| 5.20* 1/230.14/9,90*** | 1| 96.07 4. a 
F. Between-individual |i | 58203.7210.61****26/172.61| 6.80****| 38/150.32/8. 58****/24| 67.22 4.50 
G. Betveen individual i | 58| 48.03) 2.50****06| 50.87] 2.00* 38) 28.771.55 24| 23.47/1.57 
linear slopes 
H. Between individual i | 58} 22.02) 1.15 26) 31.43| 1.24 38) 23.25/1.25 24) 21.64/1.45 
quadratics 
I. Individual deviations | |174| 19.20 78| 25.39 114| 18.56) 72| 14.93 
from estimate 
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the same category . . . you arbitrarily pick sub- 
jects and try to make me say what you wanted 
me to say”; “Well, I think that you’re saying 
‘umhmm’ when I say a series of words that are 
connected together . . . well, like I said before, 
the only thing I could think of was Pd start 
relating words to a definite subject and you'd 
encourage me to do that." Most of these sub- 
jects also stated the implied converse: that no 
Umhmm signaled that they should change 
categories. None of the controls reported this 
hypothesis. 

Question 3, “Was there anything that you 
were supposed to say in order to be correct?" 
was comparatively uninformative, mainly be- 
cause it had already been answered. Eight of 
the above 11 restated the hypothesis of “rein- 
forcement for association" and 3 answered, 
quite consistently, that no particular words or 
associations were correct. The others replied 
“No” or that if there were a correct response, 
they did not know what it was. 

Reports and performance. The experimental 
subjects, then, divide most naturally into 
three groups: “reinforcement for association,” 
“associative hypothesis alone,” and “no asso- 
ciative hypothesis,” They are graphed along 
with controls in Figure 2. The great rise of the 
“reinforcement for association” group is re- 

flected in the highly significant F for between- 


group linear slopes (Line B of Table 1), The 
“associative hypothesis alone” group is hardly 
distinguishable from the control group and 
neither of the between group trend terms is sig- 
nificant. There are no significant trend differ- 
ences between the controls and the “no asso- 
ciative hypothesis” subjects, but this group is 
significantly below the controls in mean output 
of plurals (Line A), This is a little puzzling, 
but a group low in plural output could be 
selected by their failure to form the associative 
hypothesis if the Umhmm, which did follow 
plurals often associatively rendered, had any 
tole in suggesting that the experimenter was 
interested in associations. 

This analysis is obviously post hoc in some 
sense, but it does not thereby capitalize We 
chance differences, The likelihood of accepting 
a chance difference is no greater when separat- 
ing subjects on some principle before inspect- 
ing their performance than when separating 
subjects on some theoretical hypothesis before 
conducting the experiment. It is only necessary 
to identify the groups in some clear way inde- 
pendent of their performance. There does re- 
main some capitalization on chance, but it is by 
virtue of multiple comparisons. Only three 
groups are compared with controls, though, 
and the one significant difference in group 
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slope is very greatly so.5 The theoretical inter- 
Pretation of the difference is another matter. 

A post hoc hypothesis. In sum, subjects saying 
nothing that classifies them as “aware” in the 
usual senses show an acquisition effect in 
aggregate; those who say that they are to asso- 
ciate in a series when the experimenter says 

Umhmm” show a considerable acquisition 
effect; and the others show none at all. This 
result is unexpected because unlike the “cor- 
rect” hypothesis, the “reinforcement for asso- 
cation" (RFA) hypothesis has no immediately 
obvious telation to saying plural nouns. But 
let us imagine an experiment in which subjects 
Say words while we reinforce in first one seman- 
tic category, then another, tallying as correct 
all responses falling in the semantic category 
that is for the moment reinforced. For the RFA 
Subjects this was the perceived pattern of rein- 


though”) S'stematic replications are now available, 
une y vary in ways that could have affected the 
aie T of subjects reporting the RFA hypothesis. In 
Ghee ieee 8 of 63 report the hypothesis; in the 
uh x4 35. In both, the RFA subjects perform 

Won lo these and differ dramatically from controls. 
cantly ene do other subjects differ signifi- 
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forcement, and the experimenter defined a 
semantic category as correct just as long as 
plural nouns were forthcoming. It is the tally 
that differed. If a subject of the imaginary 
experiment hypothesized that he was to asso- 
ciate in the category of the moment when the 
reinforcement was “on” and to scan for a new 
category when it was “off,” he should be very 
well able to increase his tally of correct re- 
sponses. Association, we know from common 
sense and an inspection of the Minnesota 
norms (Russell & Jenkins, 1954), tends to pro- 
duce semantically related words; and with in- 
struction to associate in the “same category," 
the tendency should clearly be great. Now it 
might be that with the same instruction, 
stimulus and response items would tend also to 
be grammatically related—to be of the same 
part of speech or linguistic form class, Words 
that can take the same position in an utterance 
conceivably could be linked in some way that 
would be manifest associatively. The sug- 
gestion is that with instruction to associate in 
the same semantic category, semantic and 
grammatical associative responses might be 
correlated—that the RFA hypothesis is a cor- 
related hypothesis. 

Subjects of the RFA group report that they 
are to associate in a series upon presentation of 
the “Umhmm” and to scan fora new category 
upon its omission. And a reported hypothesis 
should tend to be coupled with some self- 
instructional set. Since the reinforcement in 
this experiment followed plural nouns we would 
then infer an associative set after plurals and a 
nonassociative set after other words. There 
was no report of this kind from controls and no 
reason to expect the same alignment of sets for 
them. In short, it may be that frequency of 
plural response to prior plural is related to an 
associative as opposed to a nonassociative set. 
“Diamonds,” when one continues a series, 
should bring “rubies” or "pearls"—plurals. 
Scan for a new category and “telephone” might 
come as easily as “cigarettes.” This post hoc 
hypothesis, to be tested in Experiment II, 
would make sense of the great increase in 
plural nouns for the RFA group, relative to 
controls, and the operant conditioning of 
plural nouns could be attributed to hypotheses 
which when acted upon are the occasion for 
manifesting prior verbal habits. 

Interpretation of the subjects’ reports. We 
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should first consider a possible objection that 
the questions asked might have suggested the 
reports obtained. The results lose generality if 
the RFA hypothesis would not have been 
formed without the early and repeated ques- 
tion, “What do you think the experiment is all 
about now?” It seems unlikely, though, that an 
adult brought to a small room and asked to say 
words, no phrases or sentences please, would 
not be provoked to wonder as much. And if 
this general question could make one hypothe- 
sis more likely, it probably would have done 
the same for the plural noun hypothesis; yet 
no one called plural nouns correct. Moreover, 
Sidowski (1954) has reported, and others have 
implied, that their subjects adopted question- 
ing, problem solving sets without special in- 
structions. The thought that the RFA hypothe- 
sis could have been suggested entirely by 
questioning at the end is embarrassed by the 
coincidence that only the RFA group shows an 
increase in plurals, Whether or not subjects in 
other studies, and how many, formed the RFA 
hypothesis is, of course, unknown; but, in any 
case, subjects who failed to report that hy- 
pothesis in the present experiment do not repli- 
cate the verbal conditioning effect. 

The present research does not accord un- 
questioned validity to the subjects’ reports or 
in any way rely upon a phenomenological data 
language. The concern is to determine whether 
the subject’s report of a behavioral hypothesis 
is related to response selection in ways that we 
would in theory expect his hypothesis as con- 
Struct to be related to response selection, We 
have considered that a behavioral hypothesis is 
related to response selection through a self- 
instructional set. The hypothesis of valid re- 
port is but one more Proposition within the 
theoretical network to be evaluated by the 
data. And these propositions, together with the 
post hoc hypothesis, predict the obtained rela- 
tion of report to response selection. Moreover, 
on the common hypothesis that instruction to 
perform a response class may arouse a set to do 
so, it is also important to learn whether the 
relation of instruction to response selection is 
like that found for the corresponding report 
and response selection, The hypothesis of valid 
report will be further supported if the sets to be 
induced by the experimenter's instructions in 
Experiment II are found to be related to re- 
sponse selection in the same way found for the 
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reports from which self-instructional sets were 
inferred in Experiment I. In short, the subjects? 
reports are interpretable if they behave as ex- 
pected from the network of hypothesized rela- 
tions among instructions, hypotheses, sets, re- | 
ports, and response selection. The logic of 
science embodied in discussions of a nomologi- 
cal network and construct validity (Carnap, 
1956; Cronbach & Meehl, 1955; Hempel, 1952; 
Sellars, 1948; Spence, 1958) may be appropri- 
ate, not only for the subject’s response to a 
personality questionnaire and extended per- 
sonal history, but also for his response to 
questions and more immediate experimental 
history. 


EXPERIMENT II 


This experiment tests the post hoc hypothesis - 
of Experiment I, that frequency of responding | 
with plural nouns to plural nouns is related to 
an associative as opposed to a nonassociative 
set. It also tests the corresponding hypothesis - 
for Other Words, that frequency of responding - 
with Other Words to Other Words is related in 
the same way to those sets, a possibility that | 
has implications for Greenspoon's (1955) re- - 
ported conditioning of Other Words. In Experi- 
ment I, subjects hypothesized what was ex- - 
pected of them; those hypotheses are the basis 
for instructing subjects in what is expected of 
them in Experiment II. By inducing sets in 
this way, the suspected verbal habits might be 
manifest in the absence of verbal reinforce- 
ment. 


Method 


One hundred plural nouns and 100 Other Words 
were drawn at random from the protocols of the um 
experiment and were used as stimulus words in a wor 
association test. Within each block of 50 stimulus words 
presented, 25 plural nouns and 25 Other Words were 
randomly intermixed. The subject responded to € 
stimulus word in the first and third blocks of 50 words 
with an associative set and to words of the second an 
fourth blocks with a nonassociative set. Just before 
the first and third blocks of words were presented, the 
experimenter read the following instructions to induce 
an associative set: “This is a study of word habits: 
Now listen to each word I read and then tell me the 
first word you think of that is in the same eed 
that is related—that would continue in a series.” 10 
induce the nonassociative set prior to the second i 
fourth blocks the experimenter said, “Now listen, 
each word I read and tell me the first word you t 4 
of that is mot in the same category, that is unrelate 
that would not continue in a series.” The experimentet, 
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a female graduate student, read each word aloud and 
recorded the subject's responses. Subjects were run in 
the same experimental setting as that of Experiment 
I. At the end of the session the experimenter asked, 
“Was there anything you were supposed to say in order 
to be correct?," and the subject's responses were tape 
recorded. 

Subjects of the primary sample were seven male 
undergraduates, presumably representative of the 
same population as those of Experiment I. Two others 
were dropped from the experiment when they failed to 
follow instructions, in one case by persistently giving 
simple associations that were not “in a series" (for 
example, blue-bird rather than blue-green or yellow), in 
the other case by failure to shift to a nonassociative 
set. In both cases the session was halted and question- 
ing revealed that the subject had not understood the 
instructions. It is often difficult to judge for a particular 
response whether or not a subject has followed instruc- 
tions; it is not so difficult after a series of responses to 
judge that he consistently does not. An additional five 
high school students and five graduate students in 
psychology were also run as a. partial check on the gen- 
erality of the habit. None knew the hypothesis under 
investigation or the identity of the investigator for 
Whom the assistant collected the data. When ques- 
tioned none mentioned grammatical categories or re- 
plied that a particular kind of response was correct. 
The graduate students accepted the procedures as an 
assessment of verbal habits, of a character unknown to 
them, and expressed some surprise upon being told 
later that the experimenter was interested in gram- 
matical rather than semantic categories. Most of the 
others submitted themselves to a “personality test.” 


Resulis and Discussion 


Table 2 presents mean frequency of plural 
and other responses, for both samples, in the 
Several experimental conditions, These values 
Summarize the effects obtained ; tests of signifi- 
cance are based on the analysis of individual 
Contingency tables. Phi coefficients and chi 
Squares for the relation of frequency of plural 
response to plural stimulus (plural vs. other 
Tesponse) with associative vs. nonassociative 
Sets are given in Table 3. Yates’ correction was 
applied to the phi formula when expected fre- 
quencies were less than five. All phi coefficients 
ftom the undergraduate subjects are positive 
and all converted chi squares are highly signifi- 
cant. Data from the high school and graduate 
Students (Table 3) are consistent with this, 
Contrary to common misunderstanding of the 

independence” requirement, phi and chi 
Square are appropriately computed upon a 
ke individual's responses. When significant 
- Cy imply replicability for a particular sub- 
Ject. Generality across subjects can be inferred 
tom 16 individual replications, which, ignoring 
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TABLE 2 
MEAN FREQUENCIES OF PLURAL RESPONSE AND OTHER 
RESPONSE TO PLURAL STIMULI AND OTHER STIMULI 
UNDER ASSOCIATIVE AND NONASSOCIATIVE SETS 


Plural stimuli | Other stimuli 
Subjects Nonas- Nonas- 
dative soci | | soca 
xij set "t set 
College undergraduates 
Plural R 32.3 | 11.6] 1.7] 4.7 
Other R 17.7 | 38.4 | 48.3 | 45.3 
High school and gradu- 
ate students 
Plural R 35.8 | 13.2 | 3.6 | 6.4 
Other R 14.2 | 36.8 | 46.4 | 43.6 
All subjects 
Plural R 34.4 | 12.5| 2.9] 5.7 
Other R 15.7 | 37.5 | 47.3 | 44.3 
se ESE SE ee TL Qi Tp COUTE 
TABLE 3 


ASSOCIATION OF PLURAL RESPONSE AND OTHER 
RESPONSE WITH ASSOCIATIVE AND NONASSOCIATIVE 
SETS ror PLURAL STIMULI AND OTHER 


ST™ULI 

co 

Plural stimuli Other stimuli 

Subjects $| 1 P<| @ |»xj« 

College undergraduates 

1 «27| 7.3 | 01 405 | 0.3) 
2 +40] 16.2*| .001 | .09 | 0.9) 

3 -40| 15.8") .001 | ..19 | 3.4| .10 
4 -64| 41.2 | .001 | 13 | 1.6 
5 +65] 42.7 | .001 | .08 | 0.7, 
6 +28] 8.1 | .01 16 | 2.6] 
7 +33) 10.9 | .001 07 | 0.5 

High school and graduate 
students 

1HS -17| 3.0 | 10 | .05 | 0.3 
2HS +22) 4.8 | .05 AL | 1.3 
3HS 442, 17.4 | .001 |. .11 | 1.3 

4HS -31| 9.7 | .005 | .19 | 3.5| .10 
SHS +56) 31.6 | .001 |—.08 | 0.7) 
6G +70) 49.6 | .001 | .05 | 0.3] 
7G «52| 26.5 | .001 | .15 | 2.3) 

8G 38| 14.4 | .001 | .00 

9G «82| 67.1 | .001 | .09 | 0.9) 

10G +57] 32.6 | .001 | .23 | 5.4| .02 


^ Phi's to 4 decimal places, which differed before rounding, 
were used in the computation of chi square. 


size of effects, has a chance probability of 277, 
The results clearly support the post hoc hy- 
pothesis of a relation of plural response to 
plural stimulus with a set to associate in a 
series as opposed to a nonassociative set. They 
suggest a basis for the increase in plural nouns 
of the RFA group in Experiment I. 1 
"Table 3 also presents analogous information 
Íor the association of frequency of Other re- 
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sponse to Other stimulus with the two induced 
sets, The tests reported come from contingency 
tables representing the association of other re- 
sponse vs. plural response with associative vs. 
nonassociative sets. These phi’s tend toward 
zero or low positive relation. Taking both 
samples, only two of the chi squares for this 
relation have p values lower than .10, and only 
one lower than .02. But 15 of the 17 phi's are 
positive, and by the sign test this outcome has 
a chance probability of less than .01 (two- 
sided). The data at least support the possi- 
bility that hypotheses and associative habits 
could have mediated the operant conditioning 
of Other Words reported by Greenspoon 
(1955). None of these effects is large, but then 
Greenspoon (1955) reports a comparatively 
weaker conditioning effect for Other Words— 
an F at the .05 level for the three conditions 
Umhmm, Control, and Uh-huh. 

The latter contingency tables can, of course, 
just as well be interpreted as showing a positive 
relation between plural response to Other 
Words and a nonassociative set. Viewed in this 
way, they suggest that scanning for a new 
category after Other Words, reported by 
subjects of Experiment I, might also have in- 
creased the tendency toward production of 
plural nouns, 

Since categorization may follow many pos- 
sible attributes, it would be difficult for an ob- 
server to judge with great accuracy whether or 
not the subject’s responses were in every 
instance “in the same category” as the subject 
would categorize. However, it was only neces- 
sary to observe that the associative and non- 
associative instructions were generally fol- 
lowed in order to interpret the data of Table 3 
as evidence for the correlation of two Tesponse 
classes, responses matched in grammatical 
category to stimulus and responses matched in 
semantic category to stimulus. 

The verbal association experiment permits 
greater control over the manipulation of sets, 
as well as continuity with a large literature. It 
does not permit a clear comparison of the 
magnitude of effects with that of the free re- 
sponse procedure. Nevertheless, the procedure 
demonstrates the kind of effect hypothesized 
and shows it to be appreciable under fairly 

conservative conditions. The number of stimuli 
is fixed in Experiment II, and it does not pro- 
vide for cumulative effects of the type possible 
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in Experiment I. Moreover, in the free re- 
sponse procedure, the associative response is 
often given after a series of two or more plurals 
and the availability of another plural ought to 
be even greater in that case than after a single 
plural. The hypothesis only requires that we 
find the same kind of relation of instructional 
sets to behavior that we have found for 
reports and behavior, reports from which self- 
instructional sets were inferred. The quantita- 
tive effects of instructions uniformly adminis- 
tered and self-instructions irregularly emerging 
are not fairly compared in any obvious way. 

It is worth noting that the grammatical as- 
sociative habits transferred without reports by 
the subjects that they were supposed to pro- 
duce plurals or Other Words on plural and 
Other stimulus. This finding agrees with the 
common view that transfer may occur without 
awareness even if acquisition may not. And 
finally, the experiment again calls to mind (cf. 
Watt, 1905) the importance of instructional 
sets in the selection of associative habits to be 
manifest. 


GENERAL DISCUSSION 


Experiment II supports the interpretation 
placed on Experiment I. If RFA subjects acted 
on the hypotheses they reported, they should 
have increased their output of plural nouns. An 
associative set after plural nouns should have 
increased the rate of plural nouns, and s0 
Should have a nonassociative set after Other 
Words, though much less so. Taken together, 
the experiments suggest a mechanism of verb 
control that may have generality. Subjects 
may hypothesize what is expected of them and 
instruct themselves to act accordingly. Acting 
accordingly may constitute a response or oper- 
ation as relevant cues are encountered, and the 
execution of correct responses as the direct 
manifestation of prior habits. In Experiment I, 
hypotheses are reported, and we infer cor ' 
responding self-instructional sets to associate ID 
series on cue of the reinforcement and to sca? 
for a new category on cue of its omission. Wit 
this alignment of sets, to judge from Experi 
ment II, grammatical associative habits at 
manifest in plural response to prior words A 
uttered. " i 

Relation to other findings. If impressive s: | 
dence of verbal operant conditioning withow 
awareness, beyond the effect of Umhmm 0? 
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plural nouns and Other Words, remains un- 
touched by the implications of this study, then 
the present findings of course have less signifi- 
cance. Sidowski (1954) reports that a light was 
effective reinforcement for the conditioning of 
plural nouns and Greenspoon (1951) reports 
the same for both a light and a buzzer. Umhmm 
rather than a light or buzzer was used in the 
present study not only because it has been used 
more often, but also for the greater generality 
of any finding that the contingent stimulus 
acts as a cue, a source of hypotheses and an 
occasion for acting on them, rather than as a 
gratifying social acknowledgment that rein- 
forces without awareness, A light or a buzzer 
ought to be a less gratifying acknowledgment 
and a more figural cue. 

Greenspoon (1955) also found that Huh-uh 
decreased the frequency of plural nouns, 
Though plausible, it would strain a point to 
Suppose that Huh-uh could have signaled 
“change categories,” a hypothesis which Ex- 
periment II suggests would depress the fre- 
quency of plurals. In any case, the significant 
effect reported comes from an F for three con- 
ditions—Umhmm, Control, and Huh-uh—and 
the significance of the variance might be due to 
the effect of Umhmm: the Umhmm group 
differs by ¢ test from controls at all four rein- 
forcement blocks, the Huh-uh group at only 
one block. Wilson and Verplanck (1956) add 
that Umhmm or Good increased the rate of 
Saying adverbs in six of seven cases, Though it 
Should be documented, analogous adverb-to- 
adverb association habits are a strong possi- 
bility, and given the same task used in the 
present study, their subjects seem no less likely 
to form the RFA hypothesis. Furthermore, 
controls were not used and the overall decline 
in frequency of plural nouns produced by con- 
trols here suggests that adverbs might increase 
In rate without reinforcement. A striking cumu- 
lative Curve for frequency of pecking with 
application of a contingent stimulus is com- 
Pelling evidence for the functional control of 
that stimulus, A Significant shift in output of 
many human response classes may not be. 

u, Ce areas of semantic content such as 
, travel words” or "living thing words" are re- 
Md (Wilson & Verplanck, 1956) the RFA 
SM diesis should be no less likely, and a set to 

Ssociate in the same category should obviously 
Produce more words in that category. Wilson 
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and Verplanck do acknowledge that many of 
their subjects stated “correlated hypotheses" 
(see Adams, 1957), for example, “geographic 
locations” for “travel words,” but they report 
that these subjects performed no differently 
than did subjects reporting nothing of rele- 
vance. Krasner (1958) and Adams (1957) have 
reviewed a number of other studies, employing 
procedures other than the free operant pro- 
duction of single words, in which learning with- 
out awareness is reported. Whatever the signifi- 
cance or deficiency of these studies, they are 
not discussed here beyond observing that both 
reviewers question the adequacy of criteria for 
awareness and the exclusion of correlated hy- 
potheses. 

The question of learning without awareness. 
Does Experiment I show learning without 
awareness? Even to ask the question requires 
that we specify some conception of awareness. 
Certainly there are many objects of awareness 
and the question is not whether unconscious 
subjects learned. There is no obvious solution 
in the literature, where investigators describe a 
number of awarenesses their subjects are said 
to learn without. But the matter should not be 
entirely arbitrary if we are interested in other 
than learning without this-that-and-the-other 
awareness variously revealed. 

Questions of theoretical significance require 
that we specify some awareness that is theo- 
retically relevant. First of all, we wish a theo- 
retical term for awareness that will enter into 
theoretical propositions relating it to response 
selection. A "correct behavioral hypothesis"— 
the hypothesis that a correct response class is 
correct—is such a term. It is related to response 
selection within the theoretical network de- 
scribed. Verbalization of the contingency of the 
reinforcement on the correct response class, the 
most favored conception, does not so clearly 
predict selection of correct responses because 
it may leave the subject wondering what 
Umhmm was for and what he was supposed to 
do. Nor is there a clear inference of verbal 
antecedents in any number of acceptable state- 
ments of the purpose of an experiment or a re- 
port that learning occurred. 

But the matter is additionally complicated 
by Adams’ (1957) important suggestion that 
"correlated hypotheses" might account for 
many of the effects reported. By "correlated 
hypothesis" Adams seems to mean any aware- 
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ness other than of the correct response class 
that could account for better than chance per- 
formance. But this suggestion is not useful 
enough for prediction and too useful for post 
hoc explanation. The concept begs redefinition 
by some other specifiable relation to the cor- 
rect response class. Let us consider instead that 
a hypothesis is a “correlated hypothesis” if the 
response class it calls correct is correlated with 
the response class the experimenter calls cor- 
rect. Any hypothesis naming some incorrect 
response class as correct may be related within 
the theoretical network to selection of the re- 
sponse class it names, If that response class and 
the correct response class are correlated, then 
such an hypothesis becomes an antecedent 
from which to predict selection of the correct 
response class. To identify a correlated hy- 
pothesis, we might instruct the occurrence, 
then nonoccurrence, of the response class it 
calls correct, and observe a correlation of 
presence and absence of those responses with 
presence and absence of the responses the 
experimenter calls correct. Or where the 
hypothesized response class is identified with 
limited reliability, as is “associating in a series,” 
we might observe a correlation of those in- 
structions with presence and absence of mem- 
bers of the correct response class. This alterna- 
tive follows on the hypotheses that instruction 
may induce the hypothesis-related set and re- 
sponse class. On the evidence of Experiment II, 
then, “I am supposed to associate in a series 
when you say ‘Umhmm’ ” was identified as a 
correlated hypothesis for “I am supposed to 
say plural nouns." 

Thus, either a correct or correlated hypothe- 
Sis—as redefined—should in theory lead to 
selection of the correct response class, and to- 
gether they provide a conception of awareness 
relevant to a second theoretical concern. Since 
"learning without awareness" is commonly 
interpreted as supporting a theory of response 
selection by automatic action of aftereffects 
(Thorndike, 1933), we should also like a con- 
ception of awareness relevant to that theory. 
The classical theory of automatic strengthen- 
ing, unqualified by further assumptions, clearly 
implies that learning should occur in the 
absence of either a correct or a correlated hy- 
pothesis. Any such evidence would tend to 
support that theory as opposed to some alterna- 
tive of verbal control. However, with report of 
a correct or correlated hypothesis as the indi- 
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cator of awareness, Experiment I presents no 
evidence for learning without awareness, 

But is there a way to recognize that learning 
occurred only with reports of awareness and 
still hold that it occurred by automatic 
strengthening? The problem is to account for 
the relation of reports to performance in a way 
consistent with that theory. 

1. Could the hypotheses and report some- 
how be in response to the increased usage of 
plural nouns or to the process of automatic 
strengthening itself? Perhaps subjects rational- 
ize their plural output as “associating in a 
series after "Umhmm,' ” or the hypothesis and _ 
report might be a phenomenal and verbal 
emergent of the process of automatic strength- 
ening. These assumptions lack plausibility and - 
generality. With no principle to link selection 
of plurals with this particular hypothesis and ~ 
report, we are left to wonder why subjects did © 
not report the opposite, or what a subject 
would report if we automatically strengthened 
his adjectives or ear-tugging. 

2. There is a more serious variant of this set 
of assumptions. Suppose that whatever re- 
sponse class is automatically selected, a hy- 
pothesis or report of that response class follows 
as a rationalization or as a phenomenal or 
verbal emergent. And suppose it was associat- 
ing in a series after Umhmm that was the re- 
sponse class automatically strengthened. This 
is probably more consistency than a theory of 
rationalization can manage—would all subjects 
who learn need to rationalize their behavior? 
Or it would require radical augmentation of à 
classical theory of automatic strengthening. To 
assume that automatic strengthening entails 
phenomenal or verbal emergents is, in fact, @ 
new and radical theory, and should be brought 
forward strongly, if at all, so that it might be 
evaluated. 

3. Most appealing is the thought that both 
awareness—the RFA hypothesis—and auto 
matic strengthening of “associating in a series 
might be consequences of the reinforcement. 
Perhaps learning occurred only with awareness 
because enough reinforcement to produc 
learning is enough to produce awareness. But 
this assumption must be supported with others: - 
Number of reinforcements was not system 
atically varied, but programed alike for all, an! 
those who learned show no initial advantage êt 
the first block which could bring them mor? 


reinforcements thereafter (Mara = >” 
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Motter = 5.9). Still we might assume that the 
Umhmm was at the same time made more re- 
warding and more figural for some, and those 
subjects learned and reported the correlated 
hypothesis. Or some unknown kinds of indi- 
vidual receptivities to reinforcement and dis- 
positions to awareness might be correlated. 

Explanations 2 and 3 would still require an 
hypothesis of correlated response classes in 
order to account for the coincident selection of 
plural nouns, Explanation 2 does not suggest 
why some learned automatically and some not. 
Neither explanation would account for other 
findings that some subjects report a nonrein- 
forced response class as correct and in fact 
select it (Dulany, 1960). And, of course, they 
leave the effects of instruction in Experiment 
II to be explained in some other way. In sum- 
mary, a theory of automatic strengthening by 
aftereffects apparently does not account for 
these findings without auxiliary assumptions or 
radical augmentation. A little thought will 
usually supply assumptions to brace a chal- 
lenged theory and it is rarely disproved. But 
the result does not seem a better theory than a 
theory that subjects report processes prior to 
and instrumental in response selection. It 
Seems better to interpret the results within a 
network of simple hypotheses that appear 
sufficient to account for our findings, and that 
bring within the same formulation the com- 
mon relation of reports and of instruction to 
response selection. 

The question of operant conditioning. Neither 
reports of behavioral hypotheses nor the 
mechanism of verbal control outlined here enter 
into the language of operant conditioning as 
described by Skinner (1938) and extended by 
many (for example, Salzinger, 1959) to the 
Verbal conditioning literature. Operant con- 
ditioning is said to occur when rate of response 
's brought under the functional control of a 
Contingent stimulus (Skinner, 1938, 1953). 
How well do empirical operant principles de- 
Scribe the data of subjects in Experiment I? 
There are, of course, many possible abstractions 
Upon the data, and the fit of empirical operant 
Principles is no less good for uncovering medi- 
id mechanisms. A free operant came under 
the functional control of a contingent stimulus. 

pirical operant principles will describe the 
Performance of subjects with the RFA hy- 
P and loosely fit the aggregate behavior 
€ total experimental group. But the de- 
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scriptive language of operant conditioning gives 
no account of the difference in performance of 
those reporting and those not reporting the 
critical hypothesis. With attention to the sub- 
jects’ reports there is less mystery that some 
subjects “condition” and some do not, If there 
are other resources within Skinner's (1953, 
1957) extended system to account for such 
findings, an accumulation of similar findings 
would provide a challenge to call upon them. 
No formulation is obliged to account for ir- 
relevant findings or to answer irrelevant 
questions. But verbal conditioners, too, ap- 
parently see awareness and possible verbal 
control as critical to their position, and it is 
they who have raised the question, A lack of 
awareness has been set as a condition for 
specifying the laws relating empirical rein- 
forcers and response classes (Verplanck, 1955, 
p. 597). And most investigators are at pains to 
report that verbal conditioning occurred with- 
out, or unrelated to, reports of some kind of 
awareness, The languages of operant con- 
ditioning and verbal control are alternative 
formulations in the sense that each may have 
heuristic value. Wherever, there is evidence of 
a relation of reported behavioral hypotheses or 
of behavioral instructions to response selection, 
they would not seem to be equally adequate 
accounts of the data. 

Still to be considered is the point that, apart 
from the question of awareness or verbal con- 
trol, operant conditioning procedures have 
been shown to “identify” plural nouns as a re- 
sponse class. Identification of response classes 
is, of course, a clearly stated and fundamental 
aim of operant analysis, and in the verbal 
operant conditioning literature this identifica- 
tion is made through the manipulation of a 
contingent stimulus. Verplanck (1955) re- 
phrases Skinner's (1938) definition of a re- 
sponse class in this way: “‘...a part of be- 
havior (a) that is recurrently identifiable and 
hence enumerable; and (b) whose rate of oc- 
currence can be determined as a systematic 
function of certain classes of environmental 
variables" (p. 595). Salzinger (1959) concurs. 
By the definition given, however, plural nouns 
ought to be identified as a response class when 
subjects are instructed to utter them and do 
so—voluntarily. That they will is easily shown, 
and this method of identification has been a 
routine part of exploratory work preceding this 
and other of our experiments on the selection 
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of various response classes (Dulany, 1960). It 
clearly answers the question whether a col- 
lection of possible responses will function as a 
behavioral variable by showing their rate of 
occurrence to be a function of presence and 
absence of instruction, 

The empirical relation of plural nouns to a 
contingent stimulus is interesting and valuable 
from many standpoints. But though manipula- 
tion of a contingent stimulus is a sensible re- 
course with prelinguistic organisms, for the 
many human response classes accessible to 
awareness and capable of verbal control, in- 
struction should be a simpler and more fruitful 
way of identifying functional response classes. 
We should not then have to depend upon the 
emergence of awareness or, alternatively, upon 
a fragile and disputable finding of “condition- 
ing without awareness.” The unsettled em- 
pirical question is whether response classes 
identified so simply will behave as response 
classes under selective reinforcement in the 
absence of awareness and verbal control. The 
present experiments offer no assurance that 
plural nouns will ——— 


SUMMARY 


Tt is not certain that “verbal operant con- 
ditioning” occurs without awareness or is ade- 
quately described in the language of operant 
conditioning. 

In the first experiment, subjects said words, 
and plural nouns were reinforced with Umhmm, 
Tn the aggregate, subjects showed a significant 
shift to plural nouns when compared with 
controls. Upon questioning, approximately 
25% of the experimental subjects reported the 
hypothesis that whenever the experimenter 
said “Umhmm” they were to associate in series 
and that no acknowledgment meant that they 
were to change semantic categories. This group 
produced a highly significant acquisition effect, 
and the remaining subjects none. With moti- 
vated subjects these hypotheses should be ac- 
companied by self-instructional sets. For the 
successful experimental subjects, then, we infer 
that a “series associative set” tended to follow 
plural nouns and a “nonassociative set” tended 
to follow Other Words. None of the controls 
reported the critical hypothesis. If response 
after plural nouns brings more plural nouns 
with a set to continue a series than with a set 
to find a new category, the present verbal con- 
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ditioning effect might be ascribed to the medi- 


ation of hypotheses, sets, and the transfer of 
prior verbal habits. 

Experiment II presented a word association 
test with verbal reinforcement excluded. Fre- 
quency of plural nouns in response to plural 
nouns was significantly associated with a set 
to associate in series as opposed to a set to 
change categories. Other response to Other 
Words showed the same, though weaker rela- 
tion to these sets. 

_ The paper discusses the implications of these 
findings for some of the reports of verbal 
operant conditioning, for the question of learn- 
ing without awareness, and for the adequacy 
of operant conditioning principles to describe 
the data obtained, making the following 
principal points: 

1. The “conditioning” of plural nouns in 
the present study was mediated by a mecha- 
nism of verbal control. Subjects may hypothe- 
size that some response class is correct and 
instruct themselves to respond accordingly: 
Response to relevant cues as encountered 
may yield correct responses as the manifesta- 
tion of prior habits. 

2. This mechanism provides a possible ac- 
count of the verbal conditioning effects ob- 
tained in several often-cited experiments. f 

3. The concept of “correlated hypothesis 
requires further explication lest it become too 
convenient a post hoc explanation. To identify 
a correlated hypothesis we may instruct the 
occurrence and nonoccurrence of the response 
class it calls correct and observe a correlation 
of these instructions (or of the instructed 
response class) with presence and absence 0 
members of the response class the exper 
menter calls correct. 


4. The question of learning or conditioning . 


without awareness is made more meaning 
by specifying an awareness more clearly 
relevant to theory. On common theoretic 
propositions, one kind of awareness—a correct 
or a correlated hypothesis—should lead t0 
selection of the correct response class. 
a theory of automatic action of aftereffects 
implies that learning should occur in the ab 
sence of that kind of awareness. 4 
5. With report of the correct or a correlate 
response class as the criterion of awareness 
Experiment I presents no evidence for learning 
or conditioning without awareness. 


i 
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6. The language of operant conditioning, as 
it has been extended to the verbal conditioning 
experiment, does not describe the differences 
in performance of those reporting and those 
not reporting the critical hypotheses. 

7. The usefulness of operant conditioning 
procedures for "identifying" many functional 
response classes at the human level should be 
evaluated against the relative ease of identi- 
fying response classes by using instructions. 

8. The methodological strategy illustrated 
by these experiments—the joint use of reports 


and instructions—may be useful in the analysis 


of verbal control in other learning and condi- 
tioning paradigms. 
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been a central problem in psycho- 

pathology. The elusive resolution of this 
problem is equaled in the clinical sphere only 
by that of the unsettled etiology of the func- 
tional disorders. The comparable lack of 
knowledge concerning prognosis and etiology 
may be consequent to an inherent relation 
between the outcome of a disorder and the 
nature of the disorder. Apart from such theo- 
retical considerations, the problem of prognosis 
is also of concern to the practitioner because 
of its obvious practical import. In view of the 
limited resources of the typical hospital setting, 
the ability to predict outcome successfully 
would be of considerable aid in making ad- 
ministrative decisions and optimally utilizing 
treatment resources. 

A number of reviewers have examined the 
literature concerned with prognosis (Bellak, 
1948; Blair, 1940; Chase & Silverman, 1941; 
Huston & Pepernick, 1958; Langfeldt, 1937; 
Mayer-Gross & Moore, 1944; Phillips, 1949; 
Zubin & Windle, 1954), Certain investigators 
(Pascal, Swensen, Feldman, Cole, & Bayard, 
1953) have been impressed with the plethora 
of studies done on prognosis, while others 
(Schofield, Hathaway, Hastings, & Bell, 1954) 
have been distressed by the dearth of such 
inquiries. There appears to be almost universal 
agreement, however, (Huston & Pepernick, 
1958; Langfeldt, 1937; Malamud & Render, 
1939; Mayer-Gross & Moore, 1944; Pascal 
et al., 1953; Schofield et al., 1954; Zubin & 
Windle, 1954), that studies done in this area 
have produced remarkably few definitive 
findings. One reason for the lack of such 
findings is the failure to meet the methodologi- 
cal problems involved in prognostic Tesearch, 
e.g., Operational definitions of both predictor 
and criterion variables, utilization of an ap- 


F ROM Kraeplin onward, prognosis has 
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propriate time interval in the assessment of 
outcome, control of significant variables not. 
employed as predictors, etc. A comprehensive 
list of such methodological difficulties, ac 
companied by a warning concerning the need 
for their effective handling, was presented by 
Malamud and Render (1939). Evidence that 
this warning failed to deter the production of 
a number of methodologically unsound studies” 
can be found in reviews by Zubin and Windle | 
(1954) and Huston and Pepernick (1958), 
These authors, writing more than 15 years 
after Malamud and Render, advance many 
of the same methodological criticisms noted 
earlier. 

A number of studies have attempted to 
conform to sound methodological require- 
ments, but they have lacked a theoretical 
framework (Bayard & Pascal, 1954; Cole, 
Swensen, & Pascal, 1954; Dunham & Meltzer, 
1946; Feldman, Pascal, & Swensen, 1954; 
Malamud & Render, 1939; Pascal et al., 1953; 
Schofield et al., 1954; Swensen & Pascal, 
1954a, 1954b). This empirical emphasis char- 
acterizes the literature on prognosis and has 
been especially evident in the search for prog- 
nostic predictors among case history data. 
Although the present authors are impressed in | 
principle with the merits of the actuarial 
approach (Meehl, 1954), such empiricism has 
in this instance given rise to a multitude 0 
diverse and incongruent findings. Inconsistent 
and frequently contradictory findings have 
been reported on the relationship of prognosis 
to such variables as age (Chase & Silverman) 
1941; Dunham & Meltzer, 1946; Guttmann, 
Mayer-Gross, & Slater, 1939; Huston f 
Pepernick, 1958; Langfeldt, 1937; Malamu 
& Render, 1939; Pascal et al., 1953; Rennie, 
1939; Rupp & Fletcher, 1940; Schofield et aly 
1954; Silverman, 1941; Stalker, 1939), mari 
status (Chase & Silverman, 1941; e 
& Meltzer, 1946; Huston & Pepernick, 195 ji 
Malamud & Render, 1939; Pascal et 8b 
1953; Schofield et al., 1954), occupation ani 
employment history (Dunham & Metz 
1946; Hunt & Appel, 1936; Huston & Pepe 
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nick, 1958; Kant, 1944; Pascal et al., 1953; 
Schofield et al., 1954; Stalker, 1939), and 
education and intelligence (Bellak, 1948; 
Bowman & Raymond, 1929; Chase & Silver- 
man, 1941; Dewan, 1948; Dunham & Meltzer, 
1946; Felner & Weil, 1955; Huston & 
Pepernick, 1958; Jacob, 1940; Langfeldt, 1937; 
Lewis & Blanchard, 1931; Malamud & Render, 
1939; Malamud, Sands, Malamud, & Powers, 
1949; Pascal et al., 1953; Rennie, 1939, Scho- 
field et al., 1954; Stalker, 1939). In view of the 
observed ambiguity of the relation of these 
variables to prognosis, it is not surprising to 
find that many investigators follow either of 
two paths: They draw their conclusions con- 
cerning the importance of such variables by 
simply counting the observations pro and con, 
or they completely discount the significance 
of such variables as predictors of outcome in 
favor of more dynamic considerations (Blair, 
1940; French & Kasanin, 1941; Strecker & 
Willey, 1928; Sullivan, 1928). In sum, little 
in the way of reliable conclusions can be 
drawn from the extensive literature on prog- 
nosis. 

Continued piecemeal and empirical investi- 
gation of case history items offers little of 
heuristic value toward an understanding of 
Prognosis, What appears to be needed is a 
theoretical framework which can meaningfully 
include such biographical items and thus pro- 
vide them with a conceptual foundation. 
In a series of studies reported earlier (Phillips, 
Kaden, & Waldman, 1959; Phillips & Rabino- 
Vitch, 1958; Phillips & Zigler, 1961; Zigler & 
Phillips, 1960, 1961), the present authors and 
their colleagues have suggested the central role 
of personal and social maturity and have made 
Systematic use of a developmental approach 
to psychopathology. We propose that this 
theoretical orientation can be fruitfully applied 
in the study of prognosis. 

Central to the developmental approach is 
the view that the individual progresses through 
Successive stages or levels of maturity and that 
individuals differ in the final level of maturation 
attained. At each level of maturation, society 
Presents a complex of tasks with which the 
ndividual May cope successfully or deal with 
_ appropriately. That is, for every maturity 

evel there is a normal pattern of adaptation 
ae as a pathological deviation from this 
attern, Psychopathology then represents 


e 
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various forms of inappropriate resolution, The 
various pathologies (syndromes) may be con- 
ceptually ordered and viewed as representing 
inadequate resolutions at successive stages of 
Social maturation (Phillips, 1960; Zigler & 
Phillips, 1960, 1961). 

This position negates the view that 
normality is the absence of pathology and 
suggests instead that pathology can best be 
understood within the context of normal 
maturation. Thus it places major emphasis on 
the adaptive potential of the individual. The 
major implication of this position is that 
remission from the various pathologies is repre- 
sented by the establishment of a successful 
resolution of the individual's adaptive diffi- 
culties at a level appropriate to him rather 
than at some ideal end-state. That is, normality 
per se is not considered identical to the highest 
level of maturity. Increasing maturity, how- 
ever, should allow the individual to bring 
greater resources to bear on the mastery of 
those tasks set by society. Thus in pathological 
individuals a higher degree of maturity should 
imply a greater potential for undoing in- 
appropriate solutions to these tasks. Therefore 
maturity level should bear a definite relation 
to prognosis. 

The present investigators feel that it is pre- 
mature to attempt the construction of a com- 
prehensive index of maturity. Since a measure 
of this sort is lacking, they have employed 
social competence, defined by the variables of 
age, intelligence, education, occupation, em- 
ployment history, and marital status as an 
approximation of personal and social maturity. 
The concept of social competence differs from 
the narrower constructs of social class, skill 
level, or social adjustment. While it may sub- 
sume these narrower measures, it is most 
appropriately viewed as a broad, multidefined 
variable which reflects the developmental level 
of the individual, i.e., the level of maturation 
attained. Evidence that social competence 
is a valid indicator of the individual’s over-all 
level of maturity may be found in several 
studies (Grace, 1956; Lane, 1955; Smith & 
Phillips, 1959) that have reported a positive 
relationship between social effectiveness and 
often employed perceptual indices of develop- 
mental level. Earlier studies have confirmed 
the investigators’ position that a relationship 
exists between social competence and the 


266 


incidence and form of mental disorders 
(Phillips & Zigler, 1961; Zigler & Phillips, 
1960, 1961). It is the purpose here to test 
the proposition that prognosis is also related 
to social competence. 


Stupy I 


Method 


Subjects. This study was based on the examination 
of the case histories of 251 patients admitted to Wor- 
cester State Hospital during a 9-year period (1945- 
1954). The particular case histories employed were 
those of patients who had no previous admissions to.a 
psychiatric hospital, were referred to the hospital psy- 
chology department for psychological appraisal, were 
diagnosed as suffering from a functional disorder, and 
were committed to the hospital following the 40-day 
observation period. Although this sample includes a 
wide variety of cases, extremely deteriorated or very 
agitated patients are seldom referred for psychological 
evaluation and are not adequately represented in this 
sample. 

Social competence score. The variables of age, intelli- 
gence, education, occupation, employment history, and 
marital status were used as social competence indices. 
Each variable was divided into three categories with 
each category conceptualized as representing a step 
along a social competence continuum. The rationale em- 
ployed for ordering the categories of each variable was 
the same as that used earlier (Zigler & Phillips, 1960). 
"The categories of each variable and their order from low 
to high are presented below: 

1. Age: 24 and below; 25-44; 45 years and above. 

2. Intelligence: IQs obtained on a standard intelli- 
gence test of 84 or less; 85-115; 116 and above. 

$. Education: none or some grades including un- 
graded or special classes; finished grade school, 
some high school, or high school; some college or 
more 

4. Occupation: The Dictionary of Occupational Titles 
(United States Government Printing Office, 1949) 
was employed to place each occupation into the 
categories of unskilled or semiskilled, skilled and 
service, or clerical and sales or professional and 
managerial. 

5. Employment history: usually unemployed; sea- 
sonal, fluctuating, frequent shifts, or part-time 
employment; regularly employed. 

6. Marital status: single; separated, divorced, re- 
married, or widowed; single continuous marriage. 

The case history of each individual was examined in 
order to assign him a score on each of the six social com- 
petence indices. For any of the indices, placement in 
the lowest category resulted in being given a score of 0 
for that index. Similarly, assignment to the middle cate- 
gory resulted in a score of 1 and to the highest category, 
a score of 2. The over-all social competence score for 
each person was the mean of the scores obtained on the 
individual indices. This averaging procedure was re- 
quired because data were not available for every indi- 
vidual on all six variables. Thus the final social com- 
petence score for any patient could range from 0 (the 
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lowest category on every variable for which informa- 
tion was available) to 2 (the highest category on every 
variable for which information was available). The dis- 
tribution of social competence scores of the 251 patients 
was cut as close as possible to the median to form High 
and Low social competence groups. 

Outcome measures. The outcome measures employed 
were length of institutionalization; following discharge 
from the hospital, the patient's readmission to any 
mental hospital in Massachusetts; and for those pa- 
tients readmitted, the time interval between discharge 
from the first admission and the second admission. 

Although these measures have the advantage of being 
operational, they can only be considered gross indi- 
cators of prognosis. The length of hospitalization meas- 
ure is especially vulnerable to contamination because of 
its extreme sensitivity to changes in the administrative 
policies of the hospital. The shortcomings of this meas- 
ure are balanced somewhat by the readmission meas- 
ures. These latter indices appear to be more sensitive 
indicators of an individual’s adaptive resources, 


Results 


The High and Low competence groups were 
initially compared on the number of patients 
in each group who were still in the hospital as 
opposed to the number who had been released. 
The results of this 2 X 2 contingency table 
(à = 4.52, p < .025) indicated that a higher 
proportion of patients in the Low as compared 
to the High group had remained in the hospital. 
(One-tailed tests of significance were used 
throughout this investigation whenever tests 
were made of the general prediction that an 
individual’s premorbid social competence was 
positively related to outcome in psychiatric 
disorder.) y ; 

A second evaluation of the relationship 
between social competence score and length of 
institutionalization was computed. The dis- 
tribution of scores on this latter measure Was. 
divided at the median, and a 2 X 2 contingency 


table was set up. The findings (ê = 11.26, 
p < .001) indicated that patients in the Hig 
] ri 


group remained in the hospital for a sho 
length of time than did patients in the Low 
group. In order to assess how much of thi 
difference was due to the time scores of indi- 
viduals who remained in the hospital, t 
analysis was redone, excluding such patients 
from consideration. This analysis reve 
that of the patients who had been releas 
from the hospital, those in the High group hi 
been released aíter'a shorter time than thos? 
in the Low group (2 = 7.53, p < .005). | 
An analysis was then performed on th? 
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number of patients in each of the groups who 
had been rehospitalized versus the number who 
had remained out. This analysis (x? = 4.00, 
p < .025) revealed that a larger proportion of 
the Low as opposed to the High group had 
been rehospitalized. The final analysis was 
conducted on the relationship of social compe- 
tence to the time interval between discharge 
and readmission. The distribution of these 
time intervals was split at the median, and a 
social competence group X time interval con- 
tingency table was set up. The findings associ- 
ated with this table (x? = 1.52, p < .20) 
gave no evidence of a significant relationship 
between premorbid social competence and the 
length of time between release and readmission 
for patients readmitted to a hospital. 


Stupy II 


The findings reported above indicate that 
patients who have manifested a relatively good 
level of social competence are more likely to 
have been released from the hospital, spend a 
Shorter period of hospitalization, and are less 
likely to be rehospitalized than patients whose 
premorbid history was poor. However, the 
nature of the relationship between social 
competence and outcome is still not clear. As 
reported earlier (Zigler & Phillips, 1961), an 
individual’s social competence is related to 
the diagnosis he receives; Therefore, the dif- 
ferences in outcome for the two competence 
groups may be due to differences in their 
diagnostic composition rather than being 
directly related to social competence, 

3 Furthermore, patients high on the particular 
indices employed to measure social compe- 
tence, eg., intelligence, may receive more 
intensive psychiatric treatment. It could be 
argued that the better outcome in the High 
competence group was the result of such 
Superior treatment. A final question can be 
raised concerning the purity of each of the two 
groups on the competence variable. An earlier 
Study (Zigler & Phillips, 1961) revealed that 
all hospitalized patients tend to be drawn 
rom the less socially competent segment of 
the population, The averaging procedure em- 
ployed in Study I and the nature of the dis- 
tribution of competence scores were such that 
2 patient high on but one or two competence 
Indices could be included in the High group. 
These considerations, in conjunction with 
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the finding of Study I, suggested that a more 
refined study was in order. Employing the out- 
come measures used in Study I, groups that 
were clearly high or low on the competence 
variable were compared, special attention being 
given to the diagnostic and treatment factors, 


Method 


High and low social. competence groups. In the present 
study, any patient who fell above the median of the 
distribution on at least four of the six social competence 
indices wasplaced in the High social competence group. 
Any patient who fell below the median on at least four 
of the six indices was placed in the Low social com- 
petence group. This procedure resulted in 30 and 36 
patients being assigned to the High and Low social 
competence groups, respectively, The remaining. pa- 
tients who did not meet one of these two criteria were 
removed from further consideration, The diagnostic 
composition of the two groups appeared to be fairly 
comparable. The High group was composed of 16 indi- 
viduals diagnosed as schizophrenic, 7 manic depressives, 
and 7 psychoneurotics. The Low group was composed of 
25 schizophrenics, 4 manic depressives, 3 psycho- 
neurotics, and 4 individuals diagnosed as suffering from 
a character disorder. An examination of the case his- 
tories of the two groups revealed no systematic differ- 
ences between the groups on psychiatric treatment re- 
ceived. In the High group, 15 individuals received some 
form of convulsive therapy, 5 received psychotherapy, 7 
received a combination of convulsive therapy and 
psychotherapy, with the remaining 3 patients receiving 
routine hospital care. In the Low group, 17 individuals 
received some form of convulsive therapy, 7 received 
psychotherapy, 8 received a combination of convulsive 
therapy and psychotherapy, with the remaining 4 pa- 
tients receiving routine hospital care. 


Results i 


The two groups were initially compared on 
the time interval which had elapsed between 
admission to the hospital and the carrying out 
of the present study. The mean time intervals 
for the Low and High groups were 9.6 and 
10.5 years, respectively. This difference was 
not a statistically significant one (¢ < 1.00). 

The first outcome measure to be analyzed 
was length of institutionalization. Seven in- 
dividuals in the Low and one individual in the 
High group had not been discharged from the 
hospital since their original admission, The 
length of institutionalization ascribed to each 
of these patients was that time interval be- 
tween their admission and the initiation of 
the present study. The mean lengths of insti- 
tutionalization for individuals in the Low and 
High groups were 47.6 and 22.7 months, 
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respectively. This difference is a highly sig- 
nificant one (/ = 3.15, p < .005). In order to 
assess whether this difference was due only to 
the extreme scores of individuals who have 
remained in the hospital, a second analysis 
was done, excluding such subjects from con- 
sideration. Without the seven subjects in the 
Low and the one subject in the High group, 
the mean lengths of institutionalization for 
subjects in the Low and High groups were 
30.9 and 19.6 months, respectively. Although a 
conservative estimate of the difference be- 
tween the two groups, this difference remains 
a highly significant one (¢ = 2.76, p < .005). 
Since the two groups were not perfectly 
matched on diagnosis, an analysis was done 
on length of institutionalization of only the 
schizophrenics in the High and Low groups. 
The mean length of institutionalization for the 
schizophrenics in the Low and High groups 
were 46.4 and 26.2 months, respectively. This 
difference is also a significant one (¢ = 1.85, 
p < 05). 

The next outcome measure to be analyzed 
was return to the hospital. Of the 29 patients 
discharged in the High group, 4 were read- 
mitted to a psychiatric setting. Of the 29 
patients discharged in the Low group, 11 were 
readmitted. This difference is a significant one 
(x? = 4.40, p < .02). Fisher’s exact method was 
employed to see whether this difference ob- 
tained for the schizophrenics alone. Of the 20 
discharged schizophrenics in the Low group, 
6 were rehospitalized, whereas only 1 of the 15 
discharged schizophrenics in the High group 
was rehospitalized. This difference is also a 
significant one (p < .05). 

In regard to the time interval between dis- 
charge and readmission, no significant differ- 
ence was found between the Low (M = 21.6 
months) and High (M = 22.8 months) indi- 
viduals (/ < 1.00). A final analysis was done 
which combined both the length of institu- 
tionalization and the readmission measures. 
Of the 30 patients in the High group, 5 were 
either still in or had returned to a hospital. Of 
the 36 patients in the Low group, 18 were 
either still in or had returned to a hospital. 
The difference is significant (x2 = 844, p < 
.005), It may be concluded that individuals 
in the Low group have both a longer period of 
institutionalization and a greater likelihood 
of rehospitalization than individuals in the 
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High group. Thus the findings of the second 
study confirm those reported in the first. 


Discussion 
Findings of the present study support the 


hypothesis that a positive relationship exists — 


between premorbid social competence and 
prognosis. These findings within the schizo- 
phrenic portion of the sample employed are 
in agreement with earlier reported findings 
(Farina & Webb, 1956; Henderson & Gillespie, 
1936; Hunt & Appel, 1936; Mauz, 1930; 
Phillips, 1953; Stalker, 1939; Strauss, 1931) 
that a relationship obtains between premorbid 
adequacy and prognosis. It should be noted 
that in the study reported here, groups that 
were almost perfectly matched in regard to 
type of psychiatric treatment differed sig- 
nificantly on the outcome measures. It would 
thus appear that premorbid social competence 
is a more significant variable in determining 
outcome in a psychiatric disorder than is 
treatment received. 

Previous studies (Grace, 1956; Lane, 1955; 
Smith & Phillips, 1959) have related social 
competence to performance on Rorschach 
measures of perceptual development. Based 
on the use of these indices, Levine (1959) re 
ported a relationship between length of hos- 
pitalization and developmental Rorschach 
indices. The results of the present study, i 
conjunction with the findings of these earlier 
studies, support the view that a theoretically 
oriented, global approach to the problem of 
prognosis is superior to an atomistic approach 
employing isolated variables. 

Although there is clear evidence that a rela- 
tionship exists between maturity and outcome 
further work must be done in order to isolate 
those specific components of higher develop: 
ment that eventuate in a good prognosis. There 
is a certain face validity in the view that the 
greater the psychological resources of the 1 
dividual the better is his prognosis. Howevet 
it is the present investigators’ belief that ? 
specific delineation of those factors related t0 
a better prognosis is needed. 

Several hypotheses suggest themselves. Tht 
first of these is that a more mature person ca? 
more adequately utilize the opportunities lor 
change afforded by a psychotherapeutic rela 
tionship. Such a view would appear to undetile 
the selection of patients for psychoanalyt 
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and, to a lesser extent, other forms of psycho- 
therapy. Furthermore, it is possible also that 
an interaction effect exists between the effec- 
tiveness of the organic therapies and personal 
maturity. 

A second hypothesis is suggested by the view 
of Phillips and Rabinovitch (1958) that the 
more mature individual is more likely to have 
incorporated the values of society and displays 
considerable guilt and anxiety when he does 
not successfully meet these values. Therefore, 
it may be hypothesized that a maladaptive, 
pathological solution to life’s problems is much 
less acceptable to the developmentally high 
than to the developmentally low individual. 
This unacceptability of a pathological solution 
should result in an improved prognosis. 

A final hypothesis may be related to the 
nature of the problems which precipitate the 
mental disorder. As earlier studies have re- 
ported, in individuals who have displayed 
good premorbid social adequacy, there is a 
tendency for a specific traumatic event to 
precipitate the disorder (Becker, 1956; 
Garmezy & Rodnick, 1959; Kantor, Wallner, 
& Winder, 1953; Phillips, 1953; Weiner, 1958; 
Zimet & Fine, 1959). Antithetically, in indi- 
viduals with poor premorbid social adequacy, 
there is a tendency for the disorder not to be 
preceded by such events. Instead, these indi- 
viduals show an insidious history of in- 
appropriate socialization. Although higher 
levels of maturation correspond to specific 
Problems unique to those levels, there are 
certain basic problems of socialization which 
must be mastered by all individuals, Thus it 
may be hypothesized that the developmentally 
low individual does not have the resources to 
master successfully these basic problems, and 
this lack mitigates against a good prognosis. 
On the other hand, the developmentally high 
individual has mastered these basic problems 
and must only deal with later problems, corre- 
sponding to his higher level of maturity. Since 
these real problems of adulthood are more 
circumscribed, there is a greater likelihood that 
they can eventually be mastered thus reversing 
the pathological process. 

; The findings of the present study offer 
urther evidence that the developmental 
Approach has considerable value for our under- 
Standing of psychopathology. In conjunction 
With the results of earlier studies (Phillips & 
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Zigler, 1961; Zigler & Phillips, 1960, 1961), 
it may be concluded that level of maturity as 
defined by social competence indices is related 
to the incidence of mental disorder, symptoms 
manifested, diagnosis received, and prognosis. 

This body of work is in essential agreement 
with the position that mental disorders are 
continuing processes in which the premorbid, 
initial, middle, and ultimate stages are inter- 
related (e.g., Zubin & Windle, 1954). The 
developmental position advanced in this and 
earlier studies appears capable of providing a 
conceptual framework within which such inter- 
relationships become meaningful. 

This position suggests that psychopathology 
can best be conceptualized as a unitary phe- 
nomenon within which the various types of 
mental disorders are also meaningfully related, 
rather than as a collection of discrete entities, 
Clearest evidence for the validity of this view 
would be the isolation and utilization of dimen- 
sions whose applicability transcends individual 
disorders, being applicable instead to all of 
psychopathology. One such dimension is that 
of social competence as employed in this study. 
'The applicability of this dimension to all of 
psychopathology is demonstrated by the find- 
ing of the present study that the relationship 
between social competence and prognosis is 
found for both schizophrenic and nonschizo- 
phrenic groups. 

This complex of good premorbid adjustment 
and good prognosis has, in combination with 
type of onset, been employed to differentiate 
the reactive from the process type of schizo- 
phrenia. This process-reactive distinction has 
in turn been related to an organic-functional 
dichotomy in etiology (Becker, 1956; Bellak, 
1948; Brackbill 1956; Finkelstein, 1953; 
Kantor et al., 1953). The finding that similar 
complexes of premorbid social competence 
and prognosis may be found within diagostic 
groups whose etiology has conventionally been 
considered nonorganic, e.g., psychoneurotics 
and character disorders, calls into question 
the view that such complexes mirror the 
presence or absence of organic involvement. 
This view is further called into question by 
the finding that type of onset, one variable 
employed in making the process-reactive dis- 
tinction, is also related to the premorbid social 
adequacy (Phillips, 1953). The findings re- 
ported here thus suggest tbat the process- 
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reactive distinction is not unique to schizo- 
phrenia and may be entirely reducible to the 
social competence dimension. 


SUMMARY 


Two studies testing the hypothesis that a 
relationship exists between a patient’s pre- 
morbid social competence and the outcome of 
his disorder were reported. The case histories of 
251 patients were examined in order to estab- 
lish High and Low social competence groups. 
The variables of age, intelligence, education, 
occupation, employment history, and marital 
status were employed as indices of social 
competence. It was found that patients in the 
Low group have both a longer period of institu- 
tionalization and a greater likelihood of re- 
hospitalization than patients in the High 
group. These findings were discussed within the 
framework of the developmental approach to 
psychopathology. 


REFERENCES 


Bayar, J., & Pascat, G. Studies of prognostic criteria 
in the case records of hospitalized mental patients. 
J. consult. Psychol., 1954, 18, 122-126. 

Becker, W. A genetic approach to the interpretation 
and evaluation of the process-reactive distinction 
in schizophrenia. J. abnorm. soc. Psychol., 1956, 58, 
229-236. 

BELLAK, L. Dementia praecox. New York: Grune & 
Stratton, 1948, 

Bram, D, Prognosis in Schizophrenia. J. ment. Sci., 
1940, 86, 378-477. 

Bowman, K., & RAYMOND, A. Physical findings in 
schizophrenia. Amer. J. Psychiat., 1929, 8, 901-913, 

BRACKBILL, G. Studies of brain dysfunction in schizo- 
phrenia. Psychol, Bull., 1956, 58, 210-226. 

CHASE, L. S., & Si.vERMAN, S. Prognostic criteria in 
Eu mireris Amer. J. Psychiat., 1941, 98, 360— 

Core, M., SwENsEN, C., & Pascat, G. Prognostic sig- 
nificance of precipitating stress in mental illness. 
J. consult. Psychol., 1954, 18, 171-175, 

Dewan, J. Intelligence and emotional stability. Amer, 
J. Psychiat., 1948, 104, 548-554. 

pics & UR B. Predicting length of hos- 
pita! ion of mental patients. Amer. J. i 
1946, 62, 123-131. > M so 

eps. A,& ee W. Premorbid adjustment and 
subsequent di: e. J. nerv, ment. Dis. 
124, 612-613, tiec] 

FELDMAN, D., Pascat, G., & Swensen, C. Direction of 
aggression as a prognostic variable in mental ill- 
ness. J. consult. Psychol., 1954, 18, 167-170. 

FELLNER, C., & Wen, P. Low normal intelligence and 
Sehizontirenia, Amer. J. Psychiat., 1955, 112, 349- 

3. 


EDWARD ZIGLER AND LESLIE PHILLIPS 


FINKELSTEIN, Z. A study in schizophreniform psy- 
choses, Acta psychiat. neurol. Scand., Kbh., 1953, 
28, 45. 

Frencu, T., & KasaurN, J. A psychodynamic study of 
the recovery of two schizophrenic cases. Psycho- 
anal. Quart., 1941, 10, 21-22. 

Garmezy, N., & RopNick, E. Premorbid adjustment 
and performance in schizophrenia. J. nero. ment, 
Dis., 1959, 129, 450-466. 

Grace, N. A developmental comparison of word usage 
with structural aspects of perception and social 
adjustment. Unpublished doctoral dissertation, 
Duke University, 1956. 

GurrMANN, E., Mayer-Gross, W., & SLATER, E. 
Short distance prognosis in schizophrenia. J. 
neurol. Psychiat., 1939, 2, 25-34. 

Henverson, D., & GILLESPIE, R. A textbook of psy- 
chiatry, London: Oxford Univer. Press, 1936. 
Huwr, R., & Appet, K. Prognosis in the psychoses 
lying midway between schizophrenia and manic- 
depressive psychoses. Amer. J. Psychiat, 1936, 

93, 313-339. 

Huston, P., & Perernicx, M. Prognosis in schizo- 
phrenia. In L. Bellak (Ed.), Schizophrenia, New 
York: Logos, 1958. Pp. 531-546. 

Jacos, J. Prediction of outcome-on-furlough of demen- 
tia praecox patients. Genet. psychol. Monogr., 1940, 
22, 425-453. 2 

KaNr, O. Evaluation of prognostic criteria in schizo- 
phrenia. J. nerv. ment. Dis., 1944, 100, 598-605. 

KANTOR, R., WALLNER, J., & Winer, C. Process and 
reactive schizophrenia. J. consult. Psychol., 1953, 
17, 157-162. 

Lanz, J. Social effectiveness and developmental level. 
J. Pers., 1955, 28, 274-284, 

Laneretpr, G. The prognosis in schizophrenia and the 
factors influencing the course of the disease. London: 
Humphrey Milford, 1937. y 

Leve, D. Rorschach genetic level and mental dis- 
order. J. proj. Tech., 1959, 28, 436-439. 

Lewis, N., & Brancuarp, E. Clinical findings in “re- 
covered” cases of schizophrenia. Amer. J. Psychiahy 
1931, 11, 481-492. e 

Matamun, W., & RENDER, N. Course of prognosis in 
schizophrenia. Amer. J. Psychiat., 1939, 95, 1039- 
1057. 

Matamop, W., Sanps, S., MArAMUD, I., & POWERS, P 
The involutional psychoses: A socio-psychlatr 
follow-up study. Amer. J. Psychiat., 1949, 105, 
567-572. B. 

Mavz, F. Die Prognostik der engogen Psychosen. Leipzig: 
George Thieme, 1930. rid 

Maver-Gross, W., & Moore, N. P. Schizophrenia: 
Recent progress in psychiatry. J. ment. Sci, 1995, 
90, 231-251. « 5 

MErHL, P. Clinical versus actuarial prediction. Min 
neapolis: Univer. Minnesota Press, 1954. & 

Pascat, G., SWENSEN, C., FELDMAN, D., Cote, M, a 
Bavanp, J. Prognostic criteria in the case T 
of hospitalized mental patients. J. consult. Psychol» 
1953, 17, 163-171. "ET 

Puntes, L. Personality factors and prognosis : 
schizophrenia. Unpublished doctoral dissertatioD: 
University of Chicago, 1949, 


SOCIAL COMPETENCE AND OUTCOME IN PSYCHIATRIC DISORDER 


Pups, L. Case history data and prognosis in schizo- 
phrenia. J. nerv. ment. Dis., 1953, 117, 515-525. 

Puiturs, L. Studies in social competence. Paper read 
at Eastern Psychological Association, New York, 
April 1960. 

Puis, L., KADEN, S., & WatpMan, M. Rorschach 
indices of developmental level. J. genet. Psychol., 
1959, 94, 267-285. 

Puis, L., & RABrINovrrCH, M. Social role and pat- 
terns of symptomatic behaviors. J. abmorm. soc. 
Psychol., 1958, 57, 181-186. 

Purus, L., & ZIGLER, E. Social comptence: The 
action-thought parameter and vicariousness in 
normal and pathological behaviors. J. abnorm. soc. 
Psychol., 1961, 63, 137—146. 

Renn, T. Follow-up study of 500 patients with 
schizophrenia admitted to the hospital from 1913 
to 1923. Arch. Neurol. Psychiat., 1939, 42, 877—891. 

Rupr, C., & FLETCHER, E. Five to ten years follow-up 
study of 641 schizophrenic cases. Amer. J. Psy- 
chiat., 1940, 96, 877-888. 

Scuorretp, W., HaTHAwaAy, S., Hastincs, D., & BELL, 
D. Prognostic factors in schizophrenia. J. consult. 
Psychol., 1954, 18, 155-166. 

Stverman, D. Prognosis in schizophrenia: A study of 
271 cases. Psychiat. Quart., 1941, 16, 477—493. 
Surra, L., & PuiLLrPs, L. Social effectiveness and de- 
velopmental level in adolescence. J. Pers., 1959, 27, 

239-249, 

Srarxer, H. The prognosis in schizophrenia. J. ment. 
Sci., 1939, 85, 1227-1240. 

Srrauss, E. Some principles underlying prognosis in 
schizophrenia. Proc. Roy. Soc. Med., Sect. Psy- 
chiat., 1931, 24, 27-32. 


271 


SrRECKER, E., & Wey, G. Prognosis in schizo- 
phrenia. Proc. Assoc. Res. Nerv. Ment. Dis. 1928, 
b, 403-431. 

SuLLIVAN, H. Tentative criteria of malignancy in 
E Amer. J. Psychiat., 1928, 74, 759- 

87. 

SwENSEN, C., & Pascat, G. Duration of illness as a 
prognostic indicator in mental illness. J. consult. 
Psychol., 1954, 18, 363-365. (a) 

SWENSEN, C., & Pascat, G. Prognostic significance of 
type of onset of mental illness. J. consult. Psychol., 
1954, 18, 127-130. (b) 

Unirep STATES GOVERNMENT PRINTING OFFICE. The 
dictionary of occupational titles, Washington: 
USGPO, 1949. 

Weer, H. Diagnosis and symptomatology. In L, 
Bellak (Ed.), Schizophrenia. New York: Logos, 
1958. Pp. 107-173. 

ZIGLER, E, & Pures, L. Social effectiveness and 
symptomatic behaviors. J. abnorm. soc. Psychol., 
1960, 61, 231-238. 

ZiGLER, E., & Puurs, L. Case history data and psy- 
chiatric diagnosis. J. consult. Psychol., 1961, 25, 
458. 

ZiwET, C., & Five, H. Perceptual differentiation and 
two dimensions of schizophrenia. J. nerv, ment, Dis., 
1959, 129, 435-441. 

ZUBIN, J., & WiNDLE, C. Psychological prognosis of 
outcome in the mental disorders. J. abnorm. soc. 
Psychol., 1954, 49, 272-281. 


(Received August 18, 1960) 


Journal of Abnormal and Social Psychology 
1961, Vol. 63, No. 2, 272-282 


PATTERNS OF CARDIAC AROUSAL DURING COMPLEX 
MENTAL ACTIVITY 


SIDNEY J. BLATT! 
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fields as motivational theory, experi- 

mental psychology, and psychoanalytic 
ego psychology have reflected converging 
emphasis on the organism’s capacity and 
desire to interact effectively with its environ- 
ment. This adaptive motivational force was 
noticed early in animal research in the animal’s 
exploration, search, and observation of its 
surroundings, The adaptive aspect of this 
behavior has been demonstrated by the re- 
search on latent learning and incidental learn- 
ing. Extensive recent research on exploration, 
curiosity, activity, and the need for novelty in 
animals has been well summarized (Berlyne, 
1960; Hebb, 1955; White, 1959) and indicates 
the basic need of the organism to explore a 
novel situation or to create a stimulus change 
in a repetitive and bland environment (Zim- 
bardo & Miller, 1958). These findings, as well 
as the research on the severe effects of sensory 
deprivation (Hebb, 1958; Lilly, 1956) have 
compelled many theorists to formulate the 
organism’s need for effective interaction with 
its environment as a motivational force that is 
relatively independent of any primary drive 
or tissue need (White, 1959; Woodworth, 
1958). 

A similar trend and emphasis has also been 
apparent in the development of psycho- 
analytic theory (Gill 1959), where the ego 
theorists all have highlighted the autonomous 
ego functions: those aspects of the personality 
Structure concerned with adaptation? which 
are functionally independent of instinctual 
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* Adaptation as used in this paper is more than just 
a passive conformity, but rather includes an active, 
creative, constructive coping and interaction with the 

environment, 


I: recent years, trends in such divergent 


drives and derive in part from an independent 
source. Hendrick (1942) also wrote of an 
“instinct to master," an “inborn drive to do 
and to learn how to do,” a “pleasure in ex- 
ercising a function successfully regardless of 
its sensual value.” Fenichel (1945) wrote of 
mastery as a “general aim of every organism 
but not a specific instinct,” ^a pleasure of 
enjoying one’s abilities” that derives from the 
Pleasure of functioning without anxiety. 
Actually this motivational force for exploration 
and adaptive functioning was discussed by 
Bühler (1930) as the Funktionlust and by 
Freud (1927, 1949) when he alluded to the 
fact that ego functions are supplied with their 
own energy independent of instincts, and that 
there is pleasure in their exercise. Intrinsic 
satisfaction in functioning and the desire for 
stimulation and growth are seen in many 
personality theories (Goldstein, 1939; Maslow, 
1954; Rogers, 1951) as the sine qua non of 
mental health and psychological maturity. It 
is these factors that are impaired in pathology, 
distorted by conflict, and thwarted by de 
fensive maneuvers. For some theorists, such as 
Woodworth (1958) and. White (1959), coping 
or dealing with the environment is viewed as 4 
fundamental element in motivation. 
Cognitive processes are an essential com- 
ponent of adaptive functioning. Curiosity, 
exploration, and the need for novelty all 
imply a need for intellectual stimulation. 
Playful exploration, the desire to effect à 
stimulus change, the enjoyment of work and of 
novelty all stress intrinsic satisfaction M 
cognitive functioning. As Hebb (1955) has 
insistently pointed out, we “underestimate the 
human need of intellectual activity” and the 
degree to which man’s activity is spent 2 
raising the level of stimulation and excitement 
Hebb (1955) has conceptualized excitement 
and exploration in physiological terms an 
views it as serving an arousal or vigilance 
function that establishes a level of cortic 
excitation without which learning could not 
occur. Arousal serves as a drive, as an enet- 
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gizer, and efficient learning or functioning is 
only possible when the level of arousal is high. 
In extreme situations, however, such as those 
provoking paralysis from terror or fright, 
excessive arousal interferes with functioning. 
Such an "inverted U shaped" relationship 
between arousal (as measured by autonomic 
variables) and level of performance has been 
frequently demonstrated (Duffy, 1951, 1957; 
Freeman, 1940; Malmo, 1957). There is an 
optimal level of arousal for efficient perform- 
ance, and levels of arousal either below or (in 
extreme situations) above this optimal level 
interfere with effective functioning. The 
inverted U shaped curve, however, has been 
demonstrated mainly in the relationship be- 
tween arousal and functioning in relatively 
simple tasks such as reaction time, rotary 
pursuit, and mirror tracing. Scant data are 
available about the relationship between 
arousal and efficiency of thought in complex 
problems. From the conceptualization of the 
arousal continuum, similar relationships should 
exist between autonomic arousal and complex 
problem solving. The purpose of the present 
study is to explore the relationships between a 
Measure of autonomic arousal (cardiac rate) 
and the efficiency of complex mental activity. 
„Duffy (1951, 1957) proposes two major 
dimensions of arousal: intensity and direction. 
Direction, expressed in dynamic character- 
istics of the behavior that occur concomitant 
with arousal, indicates the degree to which the 
arousal and behavior are goal directed. Thus 
not only should autonomic arousal be high in 
efficient functioning, but the arousal reactions 
should also have direction and should occur in 
Tesponse to important points in the behavior or 
thought process. 

In prior research (Blatt, 1958; Blatt & 
Stein, 1959) a model was developed of the 
Process by which subjects solve a complex 
logical problem. In this model of the problem 
Solving process two crucial points were identi- 
fied: where the subject had available, im- 
plicitly at least, the necessary and sufficient 
information for solution, and where the sub- 
Jects’ predominant activity shifts from ques- 
tions of analysis about one-to-one relationships 
to the more complex questions of synthesis 
that attempt to organize and integrate in- 
ormation to achieve solution (analysis- 
Synthesis shift). These two points, which 
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subjects are not able to report, delineate three 
phases of the problem solving process. The 
initial phase extends from the beginning of the 
problem to the point at which the subject has 
available implicitly the necessary and sufficient 
information for solution. This is followed by 
the lag phase, in which the subject gathers 
additional information prior to shifting from 
analysis to synthesis. In the synthesis phase 
the subject’s behavior reflects primary concern 
with coordinating the information he has 
obtained. 

The body of research and theory reviewed 
at the outset leads to the expectation that 
efficiency in complex mental activity should 
be characterized by heightened arousal, which 
should occur, in part, at important points in 
the thought process. The following hypotheses 
were the focus of the present study: 

1. Efficient problem solvers have a higher 
level of cardiac rate and a greater variability 
of cardiac rate than inefficient ones during 
complex mental activity. 

2. Among efficient problem solvers, eleva- 
tions in the cardiac rate occur at those points 
in the thought process at which necessary and 
sufficient information for solution has become 
available, at which the predominant activity 
has changed from analysis to synthesis, and at 
solution. 


METHOD 


Concurrent recordings of cardiac rate were obtained 
as subjects attempted to solve problems on the John- 
Rimoldi Problem Solving Apparatus (PSI). Since a 
more detailed description of the PSI apparatus is avail- 
able (Blatt & Stein, 1959; John, 1957), only a brief 
description of the apparatus is presented here. A 
diagrammatic representation of the demonstration 
problem and its solution are presented in Figure 1. 

On the apparatus the subject is presented with a 
panel containing a circular array of nine lights, plus a 
center light. Next to each of the nine outer lights there 
is a button, which, when pressed, lights up its cor- 
responding light. Some of the lights are interrelated, so 
that when Button A is pressed, Light A comes on, 
followed in the next time cycle (3 seconds later) by 
Light B, with which it was related. The existence of a 
relationship between the lights is indicated by arrows 
on a removable disc, and there are different discs for 
each problem, An arrow indicates one of three types of 

į ips: 
ok direct one-to-one effect—such that the 
activation of A causes B to light in the next time 

n al. 

E. facilitatory or combiner effect—such that A 

plus another light, X, which also has an arrow going 
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Fic. 1. Diagrammatic representation of the disc and the solution sequence of the demonstration problem. 


to B, can light B if, and only if, A and X are lit 

simultaneously. 

3. A blocking effect—such that A prevents X 

(which has an arrow to B) from lighting B, i.e., X 

can light B only if A has not been activated. The 

subject is instructed about the types of relationships 
that exist on the apparatus, but he is left to discover 
or infer the specific relationships within each problem. 

In each problem the subject's task is to discover the 
one correct sequence of the three buttons at the bottom 
of the circle that will light the center light, which has no 
activating button. The subject may use any of the 
buttons that he wishes to discover relationships, but 
may use only the three buttons at the bottom in the 
final solution. 

Thus on the apparatus the subject must press 
buttons to gather information about the logical rela- 
tionships within the problem and must also press but- 
tons when testing his various attempts to integrate this 
information. All button presses are automatically re- 


corded, creating a complete record of the problem solv- 
ing performance. The subject's problem solving process 
can be reconstructed sequentially, step by step; and 
each of the subject’s responses identified as necessary 
for solution or not. Each sequence of interacting button 
presses is regarded as putting a question to the ap- 
paratus and the number of unnecessary questions the 
subject asks while solving the problem measures is 
problem solving efficiency. Also in the sequential 
analysis of the responses, the crucial points of nec 
essary and sufficient information and analysis-Syn- 
thesis shift can be identifed for each subject. TM 
analysis-synthesis shift point is the point that bes 
meets the following two conditions: most of E 
analytic questions have already been asked, and s 
of the synthetic questions still remain to be asked. 
Analytic questions are defined as those in which one 
side of the question is unity and the question is ask 

to find the constituent parts of that unity (Duncker, 
1945): there is one cause and one effect, or one cause 
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and multiple effects, or multiple causes with a single 
effect. Synthetic questions are those in which there are 
multiple causes and multiple effects, where the ques- 
tion involves an attempt to integrate information. The 
analysis-synthesis shift point is located where the 
difference between the percentage of analytic questions 
asked and the percentage of synthetic questions asked 
is maximum (a/A— s/S = Maximum). (a and s are, 
respectively, the number of preceding analytic and 
synthetic questions.) This maximum value also in- 
dicates the degree to which the problem solving process 
was organized into these two phases. For example, if 
all the analytic questions had already occurred before 
a specific point (a/A = 100) and all the synthesis 
questions were still to be asked (s/S = 0), then the sepa- 
ration of the phases would be perfect or 100. A less clear 
Separation between the two phases would be cor- 
respondingly indicated by smaller values. 

Cardiac rate was recorded by chest leads throughout 
the entire experimental procedure of instructions, 
practice problem (presented to the subjects as first 
experimental problem), experimental problem, and four 
interposed rest periods (10 minutes each). Cardiac rate 
was measured for each 30-second interval during the 
entire experiment, the measure of heart rate being 
prorated to 1(o beat at each end of these intervals. The 
mean and standard deviation of these values were 
obtained for each of the experimental periods. Though 
the issue of response specificity (Lacey, 1950) arises, 
recent research (Schnore, 1959) indicates that heart 
Tate reflects most consistently the general level of 
arousal, Cardiac rate was selected as the physiological 
measure of arousal in the present study also because it 
18S among the more reliable autonomic measures, re- 
Sponds rapidly, and can be recorded continuously with- 
Maret limiting the subject's activity and move- 

Eighteen first- and second-year male graduate 
Students who had volunteered to take part in a study 
of problem solving served as subjects. They were paid 
$6.00 each for participating in the study, which at the 
maximum required 3 hours. After the experimental 
Procedures the subjects were asked for retrospective 
See of their problem solving process, their con- 
te pastor of the problem, the methods by which 
E i empted to solve it, their awareness of any 

a moments, their feelings about the problem, and 
E: gree of arousal they experienced during the 
urse of the experimental problem. 


RESULTS 
Efficient vs, T. nefficient Problem Solvers 


the basis of the problem solving per- 
ae on the experimental problem, the 
AES Se were divided into two groups of 
E: ds cient? (11-50 unnecessary questions) 
E inefficient” subjects (73-161 unneces- 
ES mesi. All subjects completed the 
j ia igo 2 of the 9 inefficient subjects 
WEE the experimental problem in the 
ed time of 1 hour. 
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TABLE 1 
ANALYSIS OF VARIANCE OF MEAN CARDIAC RATE FOR 
EFFICIENT AND INEFFICIENT SUBJECTS 


Subjects Source MS F ? 


Total (N — 18) | Groups 445.41 | 3.05 | .10 


Error term 145.96 
Occasions 43.79 |10.85 |«.001 
Group X Occasion 18.68 | 4.63 |<.001 
Error term 4.04 
Efficient Occasion 28.22 | 7.68 |<.001 
(N = 9) Error term. 3.67 
Inefficient Occasion 34.25 | 7,78 |<.001 
(N =9) Error term 4,40 
TABLE 2 


Mean CARDIAC RATE OF EFFICIENT AND 
“INEFFICIENT SUBJECTS 


Efficient 

ion subjects s ? (one- 
dm OPES DES taie) 

Tnitial rest (A) 87.32 84.18 ns 

Instructions 88.60 — 85.26 ns 

Rest (B) 88.18 | 83.82 ns 
Practice problem 96.24 87.92 <.10 
Rest (C) 89.28 80.82 <.05 
Experimental problem 94.16 80.80 «.01 
Rest (D) 87.50 75.84 <.01 


The mean and the standard deviation of 
cardiac rate for each of the seven experimental 
occasions were obtained from the 30 second 
measures of cardiac rate. Table 1 presents a 
comparison of the mean cardiac rates, using a 
Lindquist (1953) Type I analysis of variance. 
The total between-group difference in cardiac 
rate for the entire experimental procedure 
approached significance at the .10 level, 
suggesting that the efficient group had a 
higher over-all cardiac rate. There was a 
significant total between-occasion difference 
(<.001), indicating that the various experi- 
mental occasions evoked different levels of 
cardiac rate. The between-occasion difference 
within each of the groups was highly signifi- 
cant (<.001), indicating that the different 
experimental occasions evoke changes in rate 
for both the efficient and inefficient subjects. 

Table 2 presents the mean cardiac rate for 
the two groups during each experimental 
occasion and a comparison by / test of the 
groups during each of the occasions. The two 
groups did not differ significantly in mean 
cardiac rate during the early experimental 
conditions of initial rest and instructions. 
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After the instructions, however, the cardiac 
tate of the efficient subjects became signifi- 
cantly greater than that of the inefficient 
subjects. The difference between the two 
groups reached statistical significance during 
the practice problem and continued significant 
for the remainder of the experiment, being 
particularly marked during the experimental 
problem. 

To test the differences in level of cardiac 
rate within each group between the different 
occasions, the Lindquist (1953) procedure for 
testing simple effects in Type I analysis of 
variance with significant Group X Occasion 
interaction was used. Within the efficient 
group the cardiac rate during the practice and 
experimental problems was significantly 
greater (<.05) than it was in the other con- 
ditions. Within the inefficient groups the mean 
cardiac rate during the practice problem was 
significantly greater (<.05) than during all 
the other occasions except the initial rest and 
the instructions. However, the cardiac rate of 
the inefficient group during the experimental 
problem was significantly lower (<.05) than 
it was during the initial rest, instructions, and 
practice problem. 

In comparing the variability of heart rate 
of the two groups during the seven occasions, 
the same statistical procedures were employed. 
The standard deviation of each subject’s 
cardiac rate was obtained for each of the 
experimental occasions from the 30-second 
measures of cardiac rate. Table 3 presents a 
comparison of these standard deviations of 
cardiac rate, using a Lindquist Type I analysis 
of variance. The total between-group difference 
was significant at less than the .025 level, 
indicating a greater over-all intraindividual 
variability of cardiac rate for the efficient 
group. There was a significant total between- 
occasion difference (<.001), indicating that 
the various experimental occasions evoked 
different levels of variability. The between- 
occasion difference within each group was 
obtained since there was a significant Group 
X Occasion interaction. The between-occasion 
difference within the efficient group was 
highly significant (<.001); however, the 
between-occasion difference for the inefficient 
group was not significant and in fact, ap- 
proached zero. 
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TABLE 3 
ANALYSIS OF VARIANCE OF STANDARD DEVIATION oF 
CARDIAC RATE FOR EFFICIENT AND 
INEFFICIENT SUBJECTS 


Subjects Source d| MS| FR p] 

Total (V = 18) Groups 1 [13.71 | 7.01 |<.025 
Error term 6 | 1.96 
Occasions 6 | 3.01 | 5.28 |<.01 
Group X Occasion | 6 | 2.85 | 5.00 |<.001 
Error term 96) .57 

Efücient (N — 9) | Occasion 6 | 5.69 | 5.95 |<.001 
Error term 48| .96 

Inefficient (V = 9)| Occasion 6| .16]| .88| ns 
Error term 48| .18 

TABLE 4 


MEAN STANDARD DEVIATION OF CARDIAC RATE FOR 
EFFICIENT AND INEFFICIENT SUBJECTS 


Efficient Inefficient 

Occasion subjecta teles 2 (one 
Initial rest (A) 3.628 3.380 ns 
Instructions 3.850 3.366 ns 
Rest (B) 4.112 3.882 ns 
Practice problem 7.530 3.866 <.01 
Rest (C) 4.274 3.678 ns 
Experimental problem 6.794 3.180 .005 
Rest (D) 3.934 3.534 ns 

TABLE 5 


ANALYSIS OF VARIANCE OF CARDIAC RATE OF 
EFFICIENT AND INEFFICIENT SUBJECTS 
DURING Six OCCASIONS OF THE 
PROBLEM SOLVING PROCESS 
(Experimental problem only) 


Subjects Source díj| MS | F 

Total (N = 18) | Groups 1 |2002.26 |10.04 | -005 
Error term 16 | 199.41 

Occasions 5| 46.32 | 4.68 |<.00l 

Group X Occasion | 5| 44.09 | 4.46 |<-005 
Error term 80| 9.89 

Efficient Occasion s| 89.20 | 5.42 |<.00! 
W = 9) Error term 40 | 16.45 

Inefficient Occasion 5| 1.21] .36] 9 
(N = 9) Error term 40| 3.32 


Table 4 presents the mean standard devi 
ation of cardiac rate for the two groups during 
each of the experimental occasions and 4 
comparison by # test of the groups during each 
of the occasions. Though efficient subjects wer? 
significantly more variable in cardiac rate 
over the entire experiment, this difference 
resulted mainly from the significant difference 
in variability between the two groups on thé 
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Fro. 2. Cardiac rate of efficient and inefficient groups during the three phases and three points of the experi- 


mental problem, 


practice problem (<.0 i 
E M VAS 1) and the experimental 

To test the differences in variability of 
cardiac rate between the different occasions 
S each of the groups, the Lindquist 
^ 3) procedure for testing simple effects in 
E i analysis of variance with significant 
5 En. Occasion interaction was used. Within 
o €Ucient group the same pattern was found 
3 pou of cardiac rate as was observed 
B oe of the levels of cardiac rate: 
um wed was significantly greater (<.05) 
fan B ie practice and experimental problems 
With uring the other experimental occasions. 
Eis. the inefficient group there were no 

cant differences in variability between 
Br the occasions. 

5 mue thus far support the hypothesis 
ore i problem Solvers are significantly 
E. = Ims variable than inefficient sub- 
CN rdiac rate while attempting to cope 

a complex cognitive problem. 


Cardiac Arousal at Crucial Points in Problem 
Solving 

The second hypothesis states that the 
cardiac arousal of efficient subjects should 
occur at crucial points in the problem solving 
process, The three crucial points and the 
three phases of the problem solving process 
that they demark, identify six occasions for 
analysis. By a Lindquist (1953) Type I 
analysis of variance, the cardiac rates for the 
two groups of subjects during each of the six 
occasions were compared (Table 5)? The 


3 A comparison of the intraindividual variation of 
cardiac rate during these six conditions of the experi- 
mental problem was not possible since the measure of 
cardiac rate at the three points of the process is based 
on heart rate during one 30-second interval measure, 
In each of the three phases of the process however, 
efficient subjects had significantly greater variability 
of cardiac rate than inefficient subjects and there were 
no significant within-group differences in variability 
for the three phases. 
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Fic. 3, Cardiac rate of two efficient subjects during the experimental problem. 


total between-group difference of cardiac rate tion of the between-occasion differences 2 
during the experimental problem was sig- each group separately showed them to 1 
nificant (<.005), and as was indicated earlier, almost wholly attributable to the B 
the efficient subjects had a higher cardiac group. These findings are presented graphically 
rate. There was also a significant total be- in Figure 2. Educ 
tween-occasion difference (<.001). Examina- Using a / test for correlated distribution 
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Fic. 4. Cardiac rate of two inefficient subjects during the experimental problem. 


(McNemar, 1955), the cardiac rate at the 
Points of necessary and sufficient information, 
analysis-synthesis shift, and solution were 
“pared with the mean cardiac rate of the 
ee Md after each of these points. 
Š indicated in Figure 2, efficient subjects had 
evations in cardiac rate at all three points, 
which were significantly greater than the 
i rate of the phase that preceded 
e Ae A phase that followed each point. For 
ee i us group, on the other hand, only 
mi m cant difference was noted, between 
SE of necessary and sufficient information 
es mean value of the lag phase. Figure 
dr b. individual records of the two effi- 
tha de jects who most clearly demonstrate 
pes ME Figure 4 presents individual 
[beg RG inefficient subjects. These data 
Bids e hypothesis that elevations in 
ER da of efficient subjects accompany 
Biens opments in their problem solving 
Eu ga end of the experimental problem 
ject was asked for a retrospective 


report of his problem solving. None of the 
subjects was aware of, or conceptualized, the 
crucial points of the process. Subjects were 
also asked for a description of their feelings 
while solving the problem and to rate, on a 
six-point scale, their degree of “arousal, 
anxiety, or tension’ while solving the experi- 
mental problem.‘ With the score of 6 indicating 
maximum arousal, anxiety, or tension, the 
mean score for the efficient group was 2.71, 
lower, though insignificantly so, than the 
mean score of 3.57 for the inefficient group. 


DISCUSSION 


Arousal, as measured by elevations in cardiac 
rate, is clearly related to the efficiency with 
which subjects solve the complex logical 
problems of the present study. Efficient sub- 
jects had significant increases in cardiac rate 
and variability of cardiac rate during problem 


4 Four of the 18 subjects were not given the ques- 
tionnaire about their feelings of arousal during the 
experiment; for this analysis there are 7 rather than 9 


subjects in each group. 
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solving both in comparison to their own resting 
baseline and in comparison to the pattern of 
cardiac response of inefficient subjects. These 
findings support the hypothesis that arousal is 
characteristic of efficient functioning and that 
arousal seems to have an adaptive or facili- 
tating effect. The facilitating effect of arousal 
may be, as Hebb (1955) postulates, “to tone 
up the cortex with a background supporting 
action that is completely necessary if messages 
are to have their effect"; for “without a 
foundation of arousal the cue (or learning) 
function cannot exist." In a recent exploratory 
study, Beckman and Stein (1961) report that 
efficiency on the PSI apparatus correlates 
significantly with reduced amounts of alpha 
in resting EEG records. Their findings of a 
general "cortical excitation" in the efficient 
subject are consistent with those of the present 
study and tend to support the interpretation 
that autonomic arousal during efficient func- 
tioning reflects the general tendency toward 
higher levels of cortical excitation. 

The results of the present study also in- 
dicate that arousal is not a total reaction but, 
rather, occurs differentially and, in part, at 
crucial points in the problem solving process, 
Efficient subjects had significant elevations of 
cardiac rate at three crucial points in the 
problem solving process even though they 
could not identify or conceptualize such 
points or stages in retrospective report. 
Arousal at these crucial points in the problem 
solving process may not only be indicative of 
the implicit understanding that efficient 
subjects have of the problem, but may also 
Serve as a stimulus through which the subject 
begins to make more explicit his implicit 
conceptualization of the problem. Thus in 
addition to facilitating the cue or learning 
function, arousal, or variations in arousal, 
may also have cue properties. The stimulus 
properties of autonomic responses have been 
extensively discussed by Lacey and Lacey 
(1958). 

In the relationship between problem solving 
efficiency and autonomic arousal, the vari- 
ability of autonomic response seems to be as 


5 Cardiac arousal at these crucial points in the 
thought process which are not reported by subjects has 
been discussed in an earlier paper (Blatt, 1960) as the 
intuitive, sensitive, affect-laden functioning frequently 
conceptualized as preconscious. 
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important as the level of arousal. Lacey and | 
Lacey (1958) have discussed the degree of | 
autonomic fluctuation as a reliable individual | 
characteristic. They found that fluctuations 
in heart rate and skin resistance during a 
resting state were significantly related to 
motor impulsivity and to the length of time 
an individual could maintain maximal readi- 
ness for response. They concluded that though | 
autonomic variability may have disruptive | 
effects in motor tasks, there are also indica- 
tions that it has a facilitatory effect on re- 
ceptor-cortical functions. In a variable arousal 
pattern, subjects can respond differentially to | 
various aspects and elements within a situ- 
ation, and these differential reactions may 
Serve to facilitate functioning. If the facili- 
tating effect of arousal is to “tone up the 
cortex” (Hebb, 1955), a constant state of 
arousal, regardless of level, may reach habitu- 
ation. Changes and variations in arousal level 
may be more effective in maintaining a general | 
level of cortical excitation. At the extreme 
ends of the arousal continua. where arousal is 
exceedingly low or exceedingly high, there is 
relatively little variability of response. It i$ 
only in the middle ranges of the arousal 
continua where reactions can vary and fluc- 
tuate. Therefore, this capacity for differential 
reaction and response may be the important 
facilitating effect rather than the absolute level 
of the arousal. 4 
Some of the psychological and motivational 
counterparts of autonomic arousal seem to be 
the positive attraction of a complex and 
challenging task. In response to a question 
naire about their feelings during the exper 
mental problem, efficient subjects frequently 
commented in retrospect about the “satis 
faction of working out the problem—it h^ 
like trying to play a musical instrument; 
“elation as solution was reached—the problem 
was rather stimulating,” “the problem Wa 
lots of fun, I enjoyed it.” Inefficient subjects 
frequently wrote of “frustration, but I knew 
would solve it if I kept thinking in terms 9' 
it," "annoyance that you must go along only 
one path and continually get stuck at the 
same point along that path,” “elation whet 
pressing a button caused a reaction whit 
fitted into my preconceived ideas, but de 
termination when I saw the end in sight, 
"interesting, required concentration ® 
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throughout.” Though the cardiac arousal and 
comments of intrigue, elation, stimulation, 
and excitement of the efficient subjects can be 
interpreted as indicative of a greater level of 
involvement, motivation, or a feeling of 
success, in many ways inefficient subjects 
seemed equally well motivated. Subjects had 
no idea of their relative degree of efficiency 
for no information was given them through 
which they could judge their performance 
other than having been told of the time limit 
and that this was a “reasonable amount of 
time.” On reaching solution, inefficient sub- 
jects frequently expressed great satisfaction at 
having solved the problem. In terms of the 
output of physical energy, inefficient subjects 
pressed more buttons at a much faster rate 
than the efficient subjects, and in this sense, 
worked harder. Inefficient subjects also 
tended to report a higher degree of experienced 
arousal or tension. 

Thus the difference in arousal patterns of 
efficient and inefficient subjects does not seem 
to be simply an issue of the degree of motiva- 
tion, but rather more one of the type of 
Motivation. As suggested in their spontaneous 
comments, efficient subjects seemed freer from 
internal needs and pressures and were better 
able to attend to and appreciate the nuances 
and subtleties of the problem and to see it as 
an exciting and intriguing puzzle. Inefficient 
subjects seem to express in their comments a 
heed to master, control, or impose their own 
preconceived structure on the problem rather 
pn naturally following the leads that evolve 
ni the problem. The playful exploration and 

eelings of discovery of the efficient subjects 
EDS more conducive to an appreciation of the 
saad a and complexities of the problem 
: © more efficient functioning. In earlier 
esearch (Blatt & Stein, 1959) efficiency on the 
I was found to have a highly significant 
aud correlation with the esthetic value of 
he dune Vemon-Lindzey Scale of Values. 
iL etic value, with its emphasis on form. 
armony, seems to typify this playful 
exploration and the fi : 
D he freedom to attend to and 
ate the enjoyable and exciting aspects 
on ex environment. 
€ Importance of playful exploration and 
p 
Eje See” exercise of ego functions 
deis vues a major focus of recent develop- 
experimental psychology and in 
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personality theories. In developmental proc- 
esses the role of play is seen as a crucial 
aspect for the objective recognition of the 
environment. As Schachtel (1954) states, it is 


-.. only thought which is sufficiently free from urgent 
needs or fears that can contemplate its objects fully 
and recognize it in relative independence from the 
thinker’s needs and fears—that is, as something ob- 
jective. In thinking about a problem one is usually 
successful only if one does not press too hard for solu- 
tion; that is, one is more likely to be successful if the 
thought is truly object-centered, free to contemplate 
the object from all sides, than if the thought is goal- 
centered, under the pressure of having to produce a 
solution immediately. 


It is this playful exploration, this exciting 
appreciation of the problem from all sides, 
and the intrinsic satisfaction in functioning, 
which seems to be the psychological counter- 
part of the adaptive arousal indicated by the 
elevations in cardiac rate during efficient 
problem solving. 


SUMMARY 


Concomitant recordings of heart rate were 
obtained from 18 male young adult subjects 
during complex problem solving on the John- 
Rimoldi PSI apparatus. The level of cardiac 
rate and its variability were compared for rela- 
tively efficient and inefficient subjects. Though 
the two groups of efficient and inefficient 
subjects were initially similar in cardiac 
patterns, there was a highly significant in- 
crease in cardiac rate and variability in the 
efficient subjects while they were attempting 
to solve the problems. These increases in 
cardiac rate and variability of cardiac rate of 
efficient subjects were significantly greater 
than their own initial resting baseline as well 
as being significantly greater than the changes 
in cardiac patterns of the inefficient subjects. 
The elevations of cardiac rate of efficient 
subjects occurred, in part, at crucial moments 
in the thought process: where necessary and 
sufficient information for solution was avail- 
able, where the subject’s predominant activity 
changed from analysis to synthesis, and at 
solution. 

These findings were discussed in terms of 
the role of autonomic arousal in efficient 
functioning and in terms of some of the possi- 
ble psychological or motivational counterparts. 
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PRAISE AND CENSURE AS MOTIVATING VARIABLES IN THE 
MOTOR BEHAVIOR AND LEARNING OF SCHIZOPHRENICS! 


ROY C. LONG? 
University of Texas 


established the performance deficit and 
learning impairment of schizophrenics in 
general (Babcock, 1933; Boring, 1913; Hull, 
1917; Kent, 1911). More recent experimental 
work, preceded by extensive clinical observa- 
tion, has also demonstrated the hypersensi- 
tivity of schizophrenics and the detrimental 
effect of censure upon their performance in a 
variety of tasks, whereas praise seems to evoke 
responses similar to those of normals (Bleke, 
1955; Garmezy, 1952; Hahn, 1956; Webb, 
1955). Olson (1958) found that while a negative 
experimental condition (representing social 
punishment) may not be deterimental to the 
performance of schizophrenics, it has less 
effect than a positive condition in causing them 
to better their performances. A heightened 
generalization gradient has been shown to char- 
acterize the schizophrenic patient (Dunn, 1954 - 
Garmezy, 1952; Mednick, 1955), apparently 
linked to the learning impairment and reaction 
to censure, Similarly, Mednick (1958) has 
differentiated high drive schizophrenics 
a from low drive schizophrenics (chron- 
At thus appears that acutes may be easily 
stimulated, particularly by censure, because 
ego defenses have not been tightened suffi- 
py at the early stage of the disorder to 
ale high drive associated with threat of dis- 
ees Clinical observations have generally 
m ps the converse to be true of the chronically 
Ims ey Bs more rigid, with concretized de- 
Boat nn are usually quite difficult to arouse 
Pd eir state of seclusiveness. An attitude 
des Pproval from those in their environment 
ee taken its toll with a resultant 
a drive level. Fromm-Reichmann (1950) 
ascribed the seclusiveness of the severe 
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Í js now several decades since investigators 


Schizophrenic patient to the wish to avoid 
“another rebuke in a long row of thwarting re- 
buffs which the schizophrenic has experienced. 
in childhood and conditioned him to expect in 
repetition." 

Several investigators (Farber & Spence, 


‘1953; Spence & Taylor, 1953; Taylor, 1956; 


Taylor & Spence, 1954) have tested predictions 
derived from learning theory about perform- 
ances in simple and complex tasks under differ- 
ing drive intensities. Interfering response tend- 
encies under high drive have been considered 
to impair efficiency on complex tasks, whereas 
high drive facilitates performance on simple 
conditioning tasks. The present study at- 
tempted to extend the assumptions of Hull 
(1943) about drive level to encompass the type 
of social situation (censure or praise) in which 
the schizophrenic typically manifests perform- 
ance deficit. 

The social condition of censure was con- 
sidered as a threatening, drive arousing situa- 
tion which further increases the individual's 
general drive state. On the other hand, it was 
assumed that a praise condition could be reas- 
suring, modifying a subject's need to achieve 
and decreasing his drive state. A neutral or 
nonevaluative condition would seem to have a 
motivational effect somewhere between that of 
praise and that of censure. The design provided 
tasks whereby a subject could, for example, 
demonstrate increased drive level on a second 
task, if he had been censured during the first 
task, by increasing his output or efficiency on 
the second task and gain a’ tangible reward 
(cigarettes). The effects of praise and of non- 
evaluation during the first task, both immedi- 
ate and in carrying over to the second task, 
could be determined in the same manner. 

The primary purpose of the study was to in- 
vestigate what effect the over-all drive state 
of the schizophrenic, produced through the 
interaction of initial drive level matched with 
social condition, has in influencing his gross 
output (operationally defined as the number of 
trials a subject took during a task). Major 
Hypotheses 1 and 2 pertained to this question, 
and secondary Hypotheses 3, 4, 5, and 6 were 
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concerned with what the gross output accom- 
plishes either in immediate change in per- 
formance on a simple discrimination task (A) 
or in efficiency on the second and more complex 
learning task (B), involving motor responses 
in a dominant-nondominant multiple-choice 
situation. (Efficiency on Task B was defined as 
the number of correct responses among all 
those which the subject made on the task.) 
The interaction between socially rewarding, 
neutral, or socially censuring stimuli and the 
original drive level of the individual seemed 
attainable by imposing one of the three types 
of social condition upon one of the two degrees 
of drive level, thereby creating a “set” under 
which the individual would work. Specific 
hypotheses are elaborated in the presentation 
of the results. 


METHOD 


Subjects. Seventy-two adult male patients from a 
Veterans Administration neuropsychiatric hospital 
served as subjects, Age ranged from 20-60. Thirty-six 
had been psychiatrically diagnosed as being in the acute 
(high drive) stage of schizophrenia; the remaining 36 
were classified as chronics (low drive). The usual sub- 
types of schizophrenia were included. No subjects with 
organic brain involvement or with signs of mental 
deficiency were used. All acute subjects were of similar 
age, intellectual function, educational attainment, and 
approximate length of illness; the same was true for 
chronics. It was not the intent of this study to match 
acutes with chronics but to investigate the distinct 
responsiveness of both groups. Matching would involve 
interference with the differentiating characteristics. 
Twelve of the acutes and 12 of the chronics were 
randomly assigned to one of the three social condition 
subgroups (praise, neutrality, censure). Everyone was 
tested individually on both tasks, 

Task A. The first task was a visual-motor discrimina- 
tion situation, the primary purpose of which was to 
Serve as a means of subjecting a subject to praise or 
censure. It required him to transfer colored marbles as 
fast as he could from a bin on the left-hand side of a 
table to appropriate holes in a panel set flush with the 
table surface on the right-hand side. A bin beneath the 
holed panel caught the inserted marbles, but only after 
they had interrupted an electronic beam. As each 
marble broke the beam, this activated a Veeder counter, 
and the total number of marbles inserted during the 
time period of 7 minutes was recorded. 

The counter reading at the end of the first minute 
of Task A represented the subject’s base rate. No 
censure or praise was given up to this time. In evaluat- 
ing later performance changes, the rate for the first 
minute was used as a base for comparison, though it 
was not included in the time period (second through 
seventh minute) during which the effect of the condi- 
tions was intended to apply. A pilot study proved 

fatigue to be a negligible factor during this time interval. 
Total performance scores, inclusive of base rate, were 
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computed for all six of the experimental groups. 
differences between type of subject, as well as | 
conditions, could be given an over-all evaluatio 
was also done for the base rates to see how 
acute subgroups, as well as chronic subgroup 
matched. Since information as to the effect of. 
censure or praise on performance rate in compa 
with the base rate (whether high or low) w 
primary aim, computation to supply this wa: 
each individual prior to statistical analysis. The: 
used to derive a measure of the increase or 
performance (mean number of marbles i 
minute) is shown below: 


198.6 Number of marbles inserted di 
entire 7 minutes 
265 Minus the Base Rate 
6)172.1 Total marbles inserted during. 
to seventh minutes : 
28.7 Mean number of marbles/minute du 
second to seventh minutes — — 
26.5 ^ Comparison with Base Rate 
T2 Change (increase) in number of m 


minute 
(It can be seen that a negative number of 
minute would indicate a decrease.) : 
‘The general instructions given for Task A 
a demonstration of how the colored marl 
inserted one at a time into a hole of mai 
When this testing period was ended, it was 
only that the subject slide his chair slightly 
move from the table and place himself before: 
board of Task B, which was sitting just to 
of the table. 

Task B. The second piece of equipment c 
of four red, jeweled reflectors, illuminated by 
pilot bulbs arranged horizontally on a panel 
push button was situated 3.5" directly below! T 
Illumination of the reflectors was made in 
order, and each one stayed lighted for a p 
seconds unless it was turned off by the subject's 
the correct button to extinguish it. If the correc! 
was not pressed, the reflector remained lighted 
full 7 seconds, followed by a 2-second interval 
which none of the reflectors were lighted; then 
one in random order was turned on aul 
through the use of appropriate electrical con! 
revolving drum. A chute and transpare 
receptacle was provided, into which a cigarette | 
whenever the subject extinguished a light by 
the correct button, Since the subject a 
tangible reward being given for his effort, 
assumed that achievement need could be sai 
accumulated these valued objects, even 
verbal comment was made to him. 

In Task B, initially dominant (pre-e: pei 
acquired) position habits were arbitrarily € 
as either correct or incorrect. Morin and Gran 
reported findings in line with the assumption 
conditions in which the various lights are S! 
activated, the initially dominant response 
response element which is in direct spatial 
ence, i.e., directly underneath. Half of the tod 
S-R combinations in the present situation 
1 and 4) involved the initially dominant 
whereas the remaining half (Numbers 2 and ) 


PRAISE AND CENSURE WITH SCHIZOPHRENICS 


The effects of the statements (praise, neutrality, 
and censure) used in the present study would be 
expected to interact with the particular S-R combina- 
tion involved. For those combinations in which the 
initially dominant response was the correct one, drive 
heightening conditions should increase the efficiency 
of performance, but should impair efficiency where the 
dominant is not the correct response. Drive reducing 
conditions should work conversely in facilitating per- 
formance efficiency for nondominant-correct response 
combinations more than for dominant-correct response 
combinations, 

Task B was designed not only to indicate the effi- 
ciency of the subject's performance in this alternate- 
Choice motor-learning problem, but primarily to obtain 
a measure of drive level, so the number of trials taken 
was left up to him. The only verbalization required 
throughout the testing session was that he indicate 
when he was ready to quit and leave with the number 
of cigarettes he had won. The number of trials he took 
on the task was used as a measure of drive output. A 
Secondary measure of learning efficiency was gained by 
setting an arbitrary criterion of eight correct responses 
(e, four dominant and four nondominant correct 
responses in a row). 

The instructions given for this task emphasized 
that the pushing of the “correct” key would bring a 
Mec reward and that the subject should say when 
0 stop. 

j Experimental social conditions. The attempt to 
instill the attitude of success (praise) or failure (censure) 
regarding performance on Task A was done by making 
a statement to each subject after the first minute he 
Worked on the task and again after each minute through 
the sixth, No comments were made at all during the 
neutral condition. After the seventh minute, the subject 
Was stopped, and instructions were immediately given 
for Task B. 
m uos which were made to the subjects 
thet ed the only way they had of knowing whether 
nul performance was considered good or bad; no 
adt was provided. The following comments 
Praise condition. “That’s fine," *You're doing 

Very well" “Very good," and “You really know 

how to do this very well." 

MM condition. ."You're not doing well," 

jm i Bn ” “That isn't good,” “You don't know 

Riders ues very well,” and “You aren't very 


e * dg A 
same statement was never given twice in succession. 


RESULTS 


à a Ypothesis 1. Acute (high drive) schizo- 
Cn Show greater output on Task B if they 
E een censured beforehand than if they 

Ve been praised or had nothing said to them. 
Wins acute group, as predicted, made signifi- 
iow Breater output (p > .01) on Task B fol- 
SAER censure on A, thus performing in a 
ae. i Similar to that which previous studies 
ie ‘own fornormals. Themedian chi square 
as employed to cover the wide variability 
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TABLE 1 
Ovreut on Task B 
Success Neutral Failure Total 
Acutes 
(Median — 35.0) 
Above median 6 2 10 18 
Below median 6 10 2 18 
"Total 12 12 12 36 
dj=2 x? = 10,67* 
Chronics 
(Median = 22.0) 
Above median 8 6 3 17 
Below median 4 6 9 19 
Total 12 12 12 36 


* Significant at .01 level. 


of subjects, and the distribution and values 
are shown in Table 1. 

Hypothesis 2. Because of low drive responsi- 
tivity, chronic schizophrenics cannot be moti- 
vated enough by censure to change Task B 
output for it to differ significantly from that 
following praise or silence. 

The initial drive level of the chronics proved 
to be too low, as predicted, for there to bea sig- 
nificant difference (p < .12) in their output on 
Task B according to whatever change in drive 
state the experimental social conditions might 
have provoked. The results were worth noting, 
despite not reaching significance, because the 
chronics seemed to increase output when 
praised or had nothing said to them and did 
little when censured. This was the reverse of 
how acutes responded. 

Hypothesis 3. Under censure conditions on 
Task A, acutes show significantly better im- 
mediate performance than under praise. 

Hypothesis 4, Chronic schizophrenics possess 
such an initially low drive that their immediate 
performance upon Task A is not affected differ- 
entially by any imposed social condition. 

The effect of censure, neutrality, or praise 
upon the immediate performance changes of 
either acutes or chronics on simple Task A 
was not significantly demonstrated, although 
acute subjects showed some disposition to 
behave in the predicted direction for acutes, 
and the same reverse order was noted for the 
chronics as was found with output on Task B. 

Hypothesis 5. Acutes’ efficiency on Task B is 
less following censure because of aroused ir- 
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TABLE 2 
PERCENTAGES FOR TvPE OF CORRECT 
RESPONSE ON Task B 
Praise Neutral Censure 
i | Non- i | Non- i- | Non- 
Domi- ;. | Domi- = | Domi- domi: 
BARE MES nant | Come | nant | Tome 
Acutes 47% | 16% | 48% | 11% | 41% | 16% 
(63%) (59%) (57%) 
Chronics | 35% | 4% | 49% | 0%} 48% | 4% 
(39%) (49%) (47%) 
TABLE 3 
Socrat DATA on SUBJECTS 
aaa Mi 
= ean 
Con- | Average | iilos. | S SS lieti of 
ditions (Years) tual fune-| S Bg | illness 
en tioning | 32:5 | (Years) 
GO) Falls te 
a 
Acutes | Praise 33.7 | 104.6 | 10.5 8.2 
Neutral | 31.9 99.6 | 10.8 7.3 
Censure | 31.3 | 100.0 8.9 1.4 
X. 32.3 101.4 10.1 7.6 
Chronics | Praise 41.8 77.3 9.5 | 16.3 
Neutral | 40,9 85.4 9.3 | 13.6 
Censure | 36.9. 87.3 99 | 12.6 
Xp 39.8 83.4 9.6 14.2 
Xa - Xp 7.5 18.0 0.5 6.6 
Significance of ¢ p= .01 | p= .01 — »-.00 
POPE S pd UT do 


relevant response tendencies than it is follow- 
ing a praise or neutral condition. 

It was considered that censure might help 
chronics slowly reach a more “optimum” drive 
State, i.e., somewhat more responsive but not 
Such that numerous irrelevant response tend- 
encies overwhelm them. The next prediction 
was made to test this idea. 

Hypothesis 6. Following censure on Task A, 
chronics’ efficiency on B is greater than when 
neutral or praise conditions are employed. 

The experimental conditions did not signifi- 
cantly affect learning efficiency of acutes or 
chronics for Task B. This is shown in detail 
on Table 2. By reading across the rows, it may 
be seen that similar Percentages of dominant- 
correct responses were made by acutes accord- 
ing to condition, and by chronics according to 
condition. There was also Similarity in the per- 
centage of nondominant-correct Tesponses 
made within these patient subgroups. 

An analysis of variance confirmed that there 
was a highly significant difference (6 > .001) 
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nant) given by all subjects (F = 96.41; df = 
1). Both acutes and chronics were most efficient 
when a dominant response was the correct one, 

Since the variables of age, IQ, length of ill 
ness, and educational attainment had been 
held constant for the three acute subgroups, 
as well as for the chronic subgroups, a sideline 


in type of response (dominant vs. nondomi- 


comparison between acutes and chronic | 


proved interesting. Acutes were not found to be 
significantly superior to chronics on Task A, 
although their superior performance was ex- 
pected. On the other hand, 12 of the 36 acutes 
reached the arbitrary learning criterion on 
"Task B of eight consecutive correct responses, 
whereas none of the chronics learned the task. 
Acutes also made a higher percentage of the 


more difficult nondominant correct responses 


Variance analysis showed not only that there — 


was a difference between dominant and non- 
dominant response type (D), but also that 
acutes and chronics differed significantly (F = 
31.77; df = 1) in making these responses. In- 
spection of Table 2 reveals that the percentage 
of nondominant-correct responses was 16 for 
acutes under both censure and praise condi- 
tions in contrast to only 4 for chronics. Neutral 
acutes also were obviously better than neutral 
chronics by a difference of 10 percentage points. 

The social data collected on the subjects 
and compiled in Table 3 offers an explanation 
for the acutes’ superior performance on Tas 
B. The differences in age, IQ, and length of 
illness between the acute and chronic groups 
are all significant at the .01 level. However, it 
must be noted that there is very little differ 
ence in educational achievement between the 
two types of subjects. This latter finding sus 
gests that the premorbid intellectual potent! | 
of all 72 subjects was quite similar, but that 
the seven more years of illness in the chronit - 
group, mostly spent in a hospital setting, may 
account for the IQ differences. Such differences 
between the acutes and chronics apparently 
determined the differential efficiency of Pe” 
formance on the task. 


Discussion 


Tt was shown that an attitude established 
on one task could differentially evidence its 
by carrying over or generalizing to a seco"? 
task. This result is in line with previous r 
search which established the heightened 89 
eralization gradient as characteristic of 


Sa 


à 
' schizophrenic individual. The output differ- 
ences for the acute subgroups were as expected 
— from learning theory extensions. The marked 

increase under the failure condition can be 

——explained by the upsurge in drive state of the 
subject and his efforts to handle the perceived 
failure and unsatisfied achievement need by 

putting out more work and thereby winning 
the reward of cigarettes. This did not occur 
when drive reducing conditions (praise and 
neutral) were employed. The individual's drive 
state was apparently lowered and his achieve- 
ment need was satisfied either by the praise or, 

_ Seemingly, even better by accepting the notion 
that “no news is good news,” i.e., the neutral 
condition. 

The prediction that chronic subjects could 
Not be motivated enough by social condition to 
cause differential output on Task B was con- 
firmed. Nothing said to them resulted in any 
significant change. The trends obtained, while 
Not significant (p < .12), were reversed to 

lose presented by acutes and may warrant 
Turther investigation. 

— With regard to the secondary area encom- 
passed by the study, there were also trends in 
the predicted direction for acutes as to their 
mean change in performance on Task A, as 
Well as the reverse trends for the chronics, al- 
though Statistical significance was again not 
teached. The marked variability of the sub- 
Jects Was clearly a cause for this. Failure to 

differences in acutes’ or chronics’ efficiency 
Performance on Task B according to whether 
censure or praise was employed warrants more 
sought, Since there was a significant differ- 
Bm the type of response given, i.e., 
E (um more dominant than nondominant 
C acutes and approximately 9-10 times 
i Erit than nondominant for the 
Mics E ere is little doubt that thenondomi- 
allel ections were much more difficult for 
n ise to make. Vet they were equally 
fore mp the three conditions. There- 

7 taere seemed to be no increase or de- 
oen p response tendencies as a 
B redictions at was said to a subject, and the 
iba made from theoretical extensions 

Epear to be in error, 

Tis here about this may be considered: 
that the Ti y the findings regarding output 
acute subi ements were having an effect on 
on deci jects. Perhaps the focus of attention 
! deciding whether or not to remain at the 
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task carried more weight than concentrating 
on the type of response. Also, the majority of 
the subjects gave a predominantly larger num- 


‘ber of dominant responses. When a dominant 


response was the correct one to make, as it was 
50% of the time, the subject was reinforced by 
winning a cigarette. If drive level could be re- 
duced and achievement need satisfied by win- 
ning a cigarette every second or even third 
trial, the motivation might be toward sitting 
through trials and not being overly concerned 
about nondominance failures. Drive state 
could be evidenced merely by output or num- 
ber of trials taken. In this connection, the fund 
of knowledge about the higher resistance to 
extinction of partially reinforced learning is 
apropos. Subjects were certain to make some 
of the easy dominant-correct responses. Domi- 
nants were correct half the time and were ap- 
parently continued because of the partial rein- 
forcement. This could have been operative in 
determining the number of trials taken as well. 
It does not provide a complete answer, how- 
ever, because the percentage of dominant-cor- 
rect responses was similar for all subgroups 
although there were significant differences in 
number of trials taken by acutes. 

The over-all answer may be that neither 
neutrality nor a statement of praise lowered 
the drive state of the acute enough for him to 
overcome the irrelevant response tendencies 
and “solve” the task better than when he was 
censured, whereas the chronics’ drive state 
was never increased to the point where he 
would make other than the easy response, and 
his lowered intellectual efficiency prevented 
his learning the more difficult one. 


SUMMARY 


In this study concerning the motivation of 
schizophrenics, Hullian assumptions regard- 
ing drive level were extended to encompass 
social situations in which schizophrenics have 
been shown to manifest performance changes, 
and to differentiate acute schizophrenics from 
chronic schizophrenics on the basis of high vs. 
low drive level. j 

Three experimental social conditions, drive 
arousing censure, drive decreasing praise, and 
a neutral (silent) condition were used with 72 
adult male schizophrenics. Thirty-six acutes 
were of similar intellectual function and ap- 
proximately equal in age, educational attain- 
ment, and length of illness. Thirty-six chronics 
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were also intermatched on these variables. The 
design of the study permitted a subject to 
demonstrate changes in drive level on a motor 
learning task (B) according to whether he had 
been censured or praised previously on a sim- 
ple psychomotor task (A). 

The generalizing effect of the social condi- 
tions (censure, neutral, praise) from one task 
to the other was found. 

The acute group demonstrated significantly 
(p > .01) greater output on Task B following 
censure on Task A than following praise or neu- 
tral conditions on Task A, and as had been 
hypothesized, performed more as previous 
studies have shown typical for normals. 

As predicted, the initial low drive chronics 
could not be motivated to show significantly 
different response (p < .12) to the social situa- 
tions. 

The effect of censure, neutrality, or praise 
upon the learning efficiency on Task B of both 
acutes and chronics or upon their immediate 
performance during Task A was not signifi- 
cantly demonstrated. 

Acutes were found to be significantly su- 
perior to chronics only on Task B. This could 
be attributed to the acutes’ youth, higher in- 
tellectual function, and to their shorter length 
of illness, which was half that of chronics. 
Educational attainment was almost identical 
for acutes and chronics, however, suggesting 
that their premorbid mental potential was 
nearly equal. 

Tn general, the primary thesis of the study 
regarding the effect upon the effort or gross 
output made by schizophrenics on a task, fol- 
lowing previous treatment of drive level by 
censure or praise, was supported. The sec- 
ondary hypotheses dealing with the efficiency 
of this output could not be supported. The re- 
sults suggested some genuine differential moti- 
vating effects of the social conditions and gave 
Support to the employment of the drive level 
concept in further study with pathological 
groups. 


Roy C. Lone 
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scoop and Luria (1954) reported “A 
O Blind Analysis of a Multiple Person- 

ality” in this journal. They used the 
Semantic Differential as a clinical test and ob- 
tained provocative results. Since the findings 
were based upon a single clinical case, however, 
the authors were limited to a discussion of in- 
teresting observations and speculations about 
the patient’s personality dynamics. The pres- 
ent study takes advantage of the clinical ap- 
plicability of the same instrument, but expands 
the field of observation to 120 subjects. Its 
Purpose is to test some commonly accepted 
hypotheses about the role of identification and 
the nature of word meaning in psycho- 
pathology. 

According to psychoanalytic theory one of 
the crucial developmental steps in the forma- 
tion of personality is the identification of the 
child with his like-sexed parent. The father is 
regarded as a “behavior model” whom the 
healthy boy uses for a self-pattern. The 
ae schizophrenic is supposed to have 
i ed to use his father as a model for identifica- 
lon and to tend to identify too closely with 

mother. Other hypotheses about sexual 
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identification in paranoid schizophrenia sup- 
pose, on the contrary, a general lack of identi- 
fication with both parental figures. Some of 
these hypotheses about pathological sexual 
identification are tested in this study. A second 
series of hypotheses explores the variability of 
word meaning as related to psychopathology. 


METHOD 


Instrument. A form of the Semantic Differential was 
used as a measure of identification. The intimate rela- 
tionship between word meaning, Semantic Differential 
responses, and identification has been discussed by 
Lazowick (1955) who provides empirical evidence sup- 
porting the validity of the Semantic Differential as a 
measure of identification. Three of the Semantic Differ- 
ential scales used in the study were selected from 
among Osgood’s (Osgood & Suci, 1955) three main 
factors of meaning: good-bad, strong-weak, active- 
passive. The other seven scales were chosen for their 
particular relevance to personal adjustment and inter- 
personal relationships: personal-impersonal, masculine- 
feminine, carefree-careful, firm-flexible, proud-humble, 
familiar-unfamiliar, solitary-sociable. Thirty stimulus 
words were rated on this form of the test by each sub- 
ject. Among these stimuli were included such words as 
ME, FATHER, and MOTHER, which were used to build 
models of hypotheses about identification. 

Subjecis. Subjects were chosen from paranoid schizo- 
phrenic and anxiety neurotic populations since problems 
of identification have been assumed to play important 
roles in both types of disturbance. The first experi- 
mental group consisted of 30 hospitalized paranoid 
schizophrenics. These and all other subjects were 
Caucasian males between the ages of 18 and 48. Age, 
education, and occupational level were controlled. The 
first control group consisted of 30 hospitalized tuber- 
culosis patients, chosen because tuberculosis patients 
are one of the few groups of chronically hospitalized 
patients other than psychiatric patients. Tuberculosis 
patients have been observed to be somewhat immature 
and dependent; however, this observation seems to be 
more a function of their prolonged hospitalization than 
of prior personality configuration (Dorken, 1951; Ellis 
& Brown, 1950; Hand, 1952; Seidenfeld, 1944; Stanton, 
1939; Stewart & Vineberg, 1955; Wechsberg & Sparer, 
1948). This is exactly the control effect desired in this 
study. Psychological test reports, interviews, and social 
history information were used to rule out obviously 
emotionally disturbed tuberculosis patients. In addition 
patients who had multiple tuberculosis hospitalizations 
were eliminated since this implies an immature inability 
to follow simple medical directions (Stewart & 
Vinel 1955). 

rupe experimental group was composed of 
30 anxiety neurotic patients from an outpatient 


p 


290 


Veterans Administration mental hygiene clinic. Control 
subjects for this group were selected from among the 
employees of several Los Angeles business firms. Since 
these subjects were selected from the community with- 
out the convenient (but biasing) recruiting effect of 
university or hospital affiliation, the use of clinical 
interviews or testing as a method of screening was not 
feasible. The testing was done on employer time on the 
business or plant premises, Anonymous reports by 
the control subjects that they had never had psychiatric 
treatment or hospitalization were used as the only 
safeguard against the presence of emotional disturbance. 
Any contamination of this control group with neurotic 
subjects should militate against finding significant 
differences in comparison with the neurotic group. 

Measurement, Semantic Differential data have been 
conceived as generating profiles characteristic of the 
meaning of the rated stimulus words. A given profile 
determines a single point in a multidimensional 
semantic space. The nature of these dimensions depends 
upon the bipolar scales used in constructing the test. 
Since all stimulus words are rated on the same set of 
scales the distance between two points in the same 
multidimensional space provides an index of the simi- 
larity or difference of meaning of the two stimulus 
words. The following formula permits the calculation 
of the distance between any pair of points in semantic 
space. 


D=/(%— Yy t] 


Where: D — distance 

X; = the scalar score for a given stimulus word 
on the ith scale. 

Y; — the scalar score for a second stimulus word 
on the ith scale. 

Actually a more efficient formula involves the use 
of the inverse of the covariance matrices of the differ- 
ences between scales (X; — Y;) and (X; — Y;). This 
expression corresponds to the exponent of a multivariate 
normal frequency function for correlated variables. 
Since the actual correlation between scales in this study 
is low (algebraic mean = .21), only a negligible decrease 
in efficiency results from the use of the simpler distance 
measure given in the formula. 

Hypotheses, A special series of validity hypotheses 
are included to insure the general validity of the test 
instrument for these groups and to determine whether 
the psychotic subjects were able to understand the test. 
Each of five validity hypotheses predicted that a pair 
of words similar in meaning would generate semantic 
profiles that would be more similar than a second pair 
of words opposed in meaning. For example, Hypothesis 
I predicts that the words man and rATHER should 
generate profiles more similar than are generated by 
the words HERO and FAILURE (see Table 1). 

The next series of word models tests the hypothesis 
that idiosyncratic word meaning is an integral part of 
psychopathology. The experimental groups are expected 
to agree less than the control groups as to the meaning 
of five words randomly selected from among the 30 
concepts used. Variability in word meaning was meas- 

ured in terms of the distance between a subject's 
profile for a given word and the mean group profile for 
that word. For example, a mean profile for the word 
CHILD was calculated for the paranoid schizophrenic 
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TABLE 1 
VALIDITY HYPOTHESES 


Results of sign test 


Word pairs n 
Plus Minus 
D (MAN, FATHER) —D (HERO, FAILURE) 16 104* 
D (HIT, ANGER) —D (LOVE, KILL) 14 106* 
D (sty, curr) —D (ANGER, CHILD) 41 T9* 
D (Love, KISS) —D (KILL, TREE) 14 106* 
D (HERO, success) —D (FATHER, FEMALE) 25 95* 


Note.—D stands for “the distance between... .”” 
* Significant at or beyond .05 level. 


group and the distance was measured between this 
mean profile and the individual profiles on the word 
CHILD for each paranoid schizophrenic subject. This 
hypothesis is relevant to the clinical observation that 
communication with schizophrenic patients is very 
difficult; it tests the inference that this difficulty is 
related to a lack of agreement on the very meaning of 
words. It is also relevant to Johnson's exposition of the 
“general semanticists" view that both neurotic and 
psychotic disorders are due to disturbances in word 
meaning (Hayakawa, 1949; Johnson, 1946). 

In Table 2 the exact form of the hypotheses about 
identification appear. Hypothesis I may be read as 
follows: When the semantic distance between the points 
representing the words me and MOTHER is subtracted 
from the corresponding distance between me and 
FATHER, a larger algebraic difference should result for 
the experimental subjects than for the control subjects. 
That is, for each control subject the words ME and 
FATHER should have many common elements of meaning 
as a result of a healthy masculine identification, and 
therefore should determine points relatively adjacent 
in semantic space. On the other hand we expect 4 
rather distinct differentiation in meanings betwee? 
the words me and morner for a control subject. Accord- 
ing to the hypothesis that confusion in sexual identifica- 
tion is an essential feature of the process resulting 0 
paranoid schizophrenia, the results of these measures 
should be significantly different from those of the 
paranoid schizophrenic subjects. 

Hypothesis II is the operational form of the hypoth- 
esis that the basic problem in paranoid schizophrenlé 
is a lack of identification with both parental figures: 
Thus the distance between the words me and FATHER 
is added to the distance between the words ME all 
MOTHER, the resulting sum being hypothetically smallet 
for the control groups than for the experimental groups. 
Hypothesis III tests a variation of the first hypothesi 
about sexual identification, Without tapping parent 
identification directly measurements were obtain 
which are relevant to sexual identification by us%8 
the words ME, MAN and ME, FEMALES Hypothesis 


* A more logically satisfying model of this hypothest 
might have used the antonyms (MALE, FEMALE) ^ 
(MAN, WOMAN). These pairs of stimulus words were a 
available in this study because it is part of a larger study 
in which the number of stimulus words was limit n A 
the memory capacity of the IBM 650 Electronic Da 
Processing System, 


Worp MEANING AND SEXUAL IDENTIFICATION IN PARANOIDS 291 
TABLE 2 
PARENTAL AND/OR SEXUAL IDENTIFICATION: CONCEPT OF PARENTS 
i Sum of ranks, Mann-Whitney U tests 
Ede Word pairs Predicted difference ERU euim 7 
schizophrenic! (TB). | nearste | Control 
T D (ME, FATHER) EC 871 959 1050 780* 
— D (ME, MOTHER) 
II D (ME, FATHER) E>C 1062 768* 1030 800* 
-+D (ME, MOTHER) 
TII D (ME, FEMALE) CoE 921 909 739 1091* 
—D (ME, MAN) 
IV D (ME, MAN) E>C 986 844 1084 746* 
V D (MOTHER, SIN) C>E 939 891 914 916 
—D (MoTHER, Love) 
VI D (FATHER,.SIN) C>E 942 888 771 1059* 
—D (FATHER, LOVE) 


à * Where E stands for experimental groups and C stands for control groups. Thus for Hypothesis I it is predicted that the in- 
icated difference between distances will be greater for subjects in the experimental group than for those in the control group. 


* Significant at or beyond .05 level. 


Predicts a smaller distance between the words ME and 
MAN for the control than for the experimental groups, 
thus examining the identification of the subjects as 
a man" (with all of its implications about adequacy, 

etc.) independent of feminine identification. 
Hypothesis V deals with the subject's picture of his 
da in relation to a highly positive word (LOVE) 
and a highly negative word (sty). The control groups 
are expected to obtain significantly larger scores on this 
ium 9n the basis of the commonly accepted clinical 
mu buon of the mother's key role in the emotional 
s monent of the child. Control subjects should 
Mu orm and rove relatively adjacently in 
à dic UE but MOTHER and SIN at a relatively great 
dae e d as in the other hypotheses, the subject 
M now that these particular stimulus words 
iMm s Pave the measurements are not deter- 
5 HA p liberate, Conscious statements about the 
ii en ‘fides the stimulus words. Hypothesis 
Wikies is to V except that the key stimulus word 
ie m tests the importance of a favorable father 

the emotional development of the child. 


REsurts AND Discussion 


i iu Were punched into IBM cards and 
i ae between words were computed on 
EUM ‘ee Electronic Data Processing 
or each T auxiliary equipment. The results 
meastires 'ypothesis consisted of 30 distance 

ifferenc per group. Since the direction of 

ithe cf was predicted a one-tailed Mann- 
Y U test served as a test of significance. 
im $56 of the validity hypotheses a simple 


Sign test w; 
toe fu Used on the pooled data of all 


The meaningfulness of all other hypotheses 
depends upon the results of the validity hy- 
potheses as reported in Table 1. All of the 
differences are significant. Since a special com- 
parison of the validity measures for the 
paranoid schizophrenic group and its control 
group revealed no significant difference on any 
of the five validity subhypotheses, we may 
conclude that the subjects were able to under- 
stand the task. 

The next set of hypotheses concerns the de- 
gree of agreement or idiosyncrasy of word 
meaning as measured by variability about 
mean profiles. The results indicate a significant 
difference between the paranoid schizophrenic 
group and its control group for four of the five 
words tested (see Table 3). There are also sig- 
nificant differences between the anxiety neu- 
rotic group and its control group for two of the 
five words tested. In every case the differences 
are in the predicted direction. 

"These results support the clinical impression 
that the peculiar use of language by paranoid 
schizophrenic patients is at least partly due to 
a disturbance in word meaning itself. In view 
of the results of the validity hypotheses, this 
distortion evidently occurs even when the 
paranoid schizophrenic patient has a gener- 
ally accurate understanding of a word's mean- 
ing: even when the language of the paranoid 
schizophrenic is apparently coherent, there evi- 
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TABLE 3 
VARIABILITY OR IDIOSYNCRASY OF 
Worp MEANING 
Sum of ranks, Mann-Whitney U test 
Word Paranoid Control Anxi 
schizo | GPB). neurotic Control 
CHILD 1037 793* 875 955 
PENIS 991 839 861 969 
ANGER 1040 790* 868 962 
MOTHER 1027 803* 1079 751* 
ME 1039 791* 1092 738* 


* Significant at or beyond .05 level. 


dently remain hidden blocks to communication. 
For example, the results indicate that “normal” 
subjects agree fairly well on the subtle, con- 
notative aspects of the meaning of the word 
ANGER. Paranoid schizophrenic subjects, on 
the other hand, agree less on the connotative 
meanings of the word ANGER and may, in fact, 
have very different conceptions of “ANGER” 
among themselves. One would then predict 
that in an apparently rational conversation 
with a paranoid schizophrenic there may sud- 
denly come a startling point of disagreement 
due to these subtle differences in word meaning. 
For example, paranoid schizophrenics might 
vary in their understanding of ANGER from such 
concepts as “mild, inoffensive irritation” to 
“world destructive, murderous rage,” whereas 
normal subjects would agree more specifically 
on a meaning somewhere between these ex- 
tremes. 

Some disagreement as to word meaning is re- 
flected in the case of the anxiety neurotic 
group, but the effect is apparently less general. 
We would thus expect some misunderstanding 
and blocking in communication with anxiety 
neurotic patients, but on a less widespread 
basis. These experimental results perhaps re- 
flect the irrationality of neurotic patients in 
certain sensitive conflict areas whereas mis- 
understandings occur in almost any com- 
munication with paranoid schizophrenics, 

; The results of the hypotheses involving 
identification and the parental picture are re- 
ported in Table 2. For the paranoid schizo- 
phrenic group, only one of the six hypotheses is 
supported at a significant level. For the anxiety 
neurotic group, five of the six hypotheses are 
significantly supported. These results give no 
support for the hypothesis that the paranoid 
schizophrenic tends to identify with the wrong 
parental or sexual figure (Hypotheses I and 
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III). Nor is there evidence to suggest a lack of 
identification as a “man” (Hypothesis IV), 
whatever combination of sexual, virile, and 
generic connotations this term may have. The 
only significant difference between the paranoid 
schizophrenic group and its control group ap- 
peared in the extent of combined identification 
with both parental figures (Hypothesis II). 
These results tend to support the position that 
the paranoid schizophrenic, like other schizo- 
phrenics, is extremely isolated from other 
people, that he cannot empathize with them or 
understand them, all of which perhaps roots 
from an original failure to identify with parents 
of either sex. 

The neatness of this explanation is disturbed, 
however, when the results for the neurotic 
group are considered. For the anxiety neurotic 
group, as compared with its control group, five 
of the six hypotheses are supported at a signifi- 
cant level. Included among these is the hy- 
pothesis (II) about combined identification 
with both parental figures. Thus it is found 
that the kind of “distancing” or "estrange- 
ment" from parental figures that is implied by 
this measure is found in anxiety neurotic as 
well as paranoid schizophrenic patients. How- 
ever, both of the hypotheses about confused 
sexual identification (or identification with the 
wrong figure) are significantly supported for 
the anxiety neurotic group although thty 
were not supported for the paranoid schizo- 
phrenics. Thus the results suggest that the 
concept of “confusion in sexual identification’ 
is consistently and meaningfully associated 
with anxiety neurosis, rather than the common 
clinical assumption that confusion in sexual 
identification, or an identification with the 
opposite sexed parent constitutes an essential 
etiological factor in paranoid schizophrenl™ 
The evidence for confusion in sexual identifica 
tion for the anxiety neurotic group exists Wit 
respect to both Hypothesis I (parental models) 
MP III (more general expression of sexua 
role). 

Finally the word rATHER is placed closer 1 
SIN than to Love for the anxiety neurotics 9? 
compared with their control subjects. This indi- 
cates a more negative, less loving picture of tH? 
father figure for the neurotic group. No SU 
negative image seems to surround the moth? 
figure as tested in Hypothesis V. It will be ™ 
membered that all subjects in this study We" 
males. 
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Worp MEANING AND SEXUAL IDENTIFICATION IN PARANOIDS 


The results of this study can be compared 
with work done by Luria and by Lazowick in- 
volving the role of identification in neurotic 
conditions. Different forms of the Semantic 
Differential were used in these studies and 
different measurement operations and subject 
populations were used, but these differences 
provide an acid test of the breadth, consistency, 
and generalizability of findings. Luria tested 
"neurotic" college students (Osgood, Suci, & 
Tannenbaum, 1957) and found no general tend- 
ency among a mixed group of male and female 
students to identify with the parent of the 
wrong sex. Although this is contrary to the 
present findings, a comparable study by Lazo- 
wick (1955) indicates the probable reason for 
this difference. He found that “neurotic” male 
students (as defined by high scores on the 
Taylor Manifest Anxiety scale) perceived sig- 
nificantly more semantic similarity between the 
words (FATHER, MYSELF) as compared to 
(MOTHER, MYSELF) but the reverse did not hold 
true for the female subjects. He discussed these 
findings in terms of the relatively masculine 
goals of college women. Since the present study 
included only male subjects, its results are con- 
sistent with the study of Lazowick and the 
negative findings of Luria may well be due to 

€ presence of female college students in the 
Sample. The inclusion of the female subjects 
may also explain the fact that Luria’s analysis 
ie the responses on the evaluative scales for 
the words MOTHER and FATHER indicated that 
the neurotic group tended to “vilify” both 
(res figures. as hypothesized by Mowrer 
es ). This is in agreement with the results 
a ypothesis V (involving the words MOTHER, 
a eee but is not in agreement with the 
alg Hypothesis VI (involving the words 
De nd LOVE). This inconsistency may 
jit ii from the inclusion of female sub- 
sively ae s study as opposed to the exclu- 

€ population used in the present one. 


SUMMARY 


form of the Semantic Differential was used 


A 
® study differences between hospitalized 
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paranoid schizophrenic subjects and a control 
group, and anxiety neurotic subjects and a 
second control group. Widespread areas of 
variability in word meaning for paranoid 
schizophrenic subjects, and limited areas of 
variability in word meaning for anxiety neu- 
rotic subjects were found. The paranoid schizo- 
phrenic group had a generally poor identifica- 
tion with both parental figures, as did the 
anxiety neurotic group. Evidence was found 
for “confusion in sexual identification” for the 
anxiety neurotic group but not, as is commonly 
assumed, for the paranoid schizophrenic group. 
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SOME EFFECTS OF “SUSPICIOUS” VERSUS “TRUSTING” 
TRAINING SCHEDULES! 


HAROLD H. KELLEY anp KENNETH RING 
University of Minnesota 


something, a trainer often uses his ability 
to affect the rewards and punishments 
that person receives. For example, to teach 
her child to wash his hands before each meal, 
a mother may selectively administer rewards 
and punishments, rewarding the child when 
he washes and/or punishing him when he fails 
to do so. In the terms proposed by Thibaut 
and Kelley (1959), the mother has fate 
control over the child—the ability to affect 
his outcomes, his fate, regardless of anything 
he might do. Through selective use of this 
fate control, providing better outcomes when 
the child produces the desired behavior and 
poorer outcomes when he exhibits nondesired 
behavior, behavioral preferences on the part 
of the child can be created. Thibaut and 
Kelley describe this as converting fate control 
to behavior control. 
In the conversion of fate control, the problem 
of monitoring the trainee’s behavior is a 
critical one. If the trainer is to teach the 
trainee to produce a given behavior, it is 
desirable that he consistently reward that 
behavior and, speaking relatively, punish other 
behaviors. Without such consistency, the 
trainee cannot develop a clear idea of what is 
expected of him and will show imperfect 
adherence to the trainer’s prescriptions. 
Trainer consistency requires that he know on 
each behavior occasion what the trainee has 
done so that he may adjust his own behavior 
accordingly. However, it is often true that the 
trainer is unable to monitor all of the occasions 
on which the trainee makes relevant behavior 
choices. Trainers usually have other activities 
and duties that interfere with the maintenance 
of surveillance over any given trainee. On the 
other hand, a trainer need not depend entirely 
upon his own efforts in this respect. It is 
possible for the trainee to contribute to the 
monitoring by bringing his actions to the 
1 This research was supported by the Laboratory for 
Research in Social Relations and by Grant G-5553 to 
the senior author from the National Science Founda- 


tion. We are greatly indebted to John Hatton for his 
assistance in carrying out the experiment. 


T order to teach another person to do 


attention of the trainer and by presenting 
evidence as to his most recent behavioral 
choices. 

The trainee, then, can contribute to his 
own surveillance and thereby increase the 
consistency of the training he receives and his 
mastery of the trainer's demands. Once we 
recognize this possibility, it becomes important 
to ask what factors in the trainer's behavior 
affect the degree to which the trainee aids and 
abets the monitoring process. The general 
hypothesis in the present experiment is that 
an important factor in this respect is the 
action the trainer typically takes in the absence 
of monitoring evidence. 

Let us consider those occasions when the 
trainer knows the trainee has been confronted 
with a pertinent behavior choice but does not 
know what choice he has made. The trainer 
has available several courses of action: he can 
assume the trainee made the correct choice 
and reward him, he can assume he made the 
incorrect choice and punish him, or he can 
disregard the situation and take no action. 
The first course of action is likely to encourage 
concealment of behavior. As long as the 
trainee can obtain reward if the trainer fails to 
see his actions, he will have no reason to 
learn to show them to the trainer. He can 
obtain positive outcomes by making either the 
correct response publicly or the wrong response 
privately. (It is assumed here that learning 
the discrimination desired by the trainer has 
no intrinsic value for the trainee, but serves 
only to help him accommodate to the demands 
of the trainer. This is especially likely to be 
true in situations where the trainee has not 
voluntarily engaged himself in the relationship 
‘with the trainer.) On the other hand, while 
the second approach may seem unnecessariy 
punitive and rather cruel, it is likely to have 
advantages from the point of view of encourag 
ing the trainee to bring his behavior to the 
trainer's attention. The punishment woul 
tend to discourage any tendencies towa" 
concealment the trainee might have. The 
discomfort can be avoided (and rewa" 
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. obtained) with certainty only by making the 
correct choice and presenting evidence of it to 
the trainer. The third course of action is 
intermediate in these respects. The trainer's 
inaction encourages neither hiding nor showing. 
However, insofar as the trainee finds the 
incorrect behavior intrinsically more rewarding 
(and this is true in many of the more important 
training situations), the trainee will find 
concealment to be advantageous. Under these 
conditions, the trainer's inaction in the 
absence of monitoring information would be 
expected to have effects similar to giving 
reward. 

The present experiment compares the first 
two of these courses of action. They are 
referred to, respectively, as a trusting schedule 
and a suspicious schedule. We propose that in 
the absence of knowledge of the trainee’s 
action, the trainer who adopts a training 
Schedule based on a suspicious attitude will 
be more successful than one who adopts a 
schedule based on a trusting attitude. Specifi- 
cally, it is suggested that the suspicious 
schedule will encourage the trainee to aid the 
Monitoring Process to a greater extent than 
will the trusting one. In consequence, the 
moe will tend to learn the trainer's demands 
etter under suspicious treatment and will 
iay feel he has greater mastery over 
aa in the relationship than if he had 
(Co EE with trust. The details of these 
due ized effects will be elaborated. after 
Xpertmental procedure has been described. 


METHOD 
Subjects 


tie 52 male subjects in the experimental groups and 
graduates e s control group were all under- 

„enrolled in introduct chok urses 
at the University of MESSEN REA EC 


Procedure 
pedach subject found himself in a room with the ex- 
cor eun m another “subject” who was actually a 
Tepresented T ye experimenter. The confederate was 
Psychology. Th eing an advanced student in clinical 
interested s ey were told that the experimenter was 
person (hee the effect of opinions of a quali- 
an unquali ned Clinical student) on the performance of 
pendently, th, Person (the subject). Working inde- 
cards on C5 d would be given the same series of 
describing ch of which would appear two statements 
decide wi caological Symptoms. Each man was to 
Beater psycn, atement of each pair “indicates the 
Ychopathology, that is, psychological illness.” 
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They were further instructed to evolve during the 
course of this task “a reasonable, consistent, and justi- 
fiable criterion of psychopathology” to use in judging 
each pair of items. In this process, the subject would 
have available the evaluations of his judgments made 
by the more expert person and the experimenter was 
presumably interested in how the subject would make 
use of these evaluations. 

The items themselves consisted of statements from 
the MMPI and variants thereof, selected on the basis 
of whether they reflected depression (including its 
effects) or anxiety (and its effects). Data were gathered 
prior to the experiment to determine the degree to which 
items conveyed either the one or the other attribute and 
a list of 37 pairs was then drawn up. Each pair always 
consisted of one depression item and one anxiety item. 
For example, on the third card in the series appeared 
the two statements, “I become pretty depressed when I 
fail at something important” and “I have nightmares 
every few nights.” On the twentieth card were the two 
statements “Almost every day something happens to 
frighten me” and “My life has turned out to be more 
wretched than I ever expected.” 

The subject was asked to go through the stack of 
cards, make a choice on each one as to which statement 
of the pair manifested the greater psychopathology, and 
register that choice by turning one of two switches. 
This decision of his, he was told, would appear on a light 
panel next door (where the confederate would ostensibly 
be making his selections) and would be evaluated by the 
“qualified student.” Then the crucial instructions con- 
cerning the subject’s showing or concealing his choices 
were given, 

It was explained that the subject had the option of 
using the graduate student’s evaluations in arriving at 
his own criterion. If the subject wanted to insure 
that the “qualified person” saw his choice on a given 
trial (and since the latter had some vague “additional 
functions” to attend to in the adjoining room, he might 
not always be in a position to observe the panel), he was 
to pulla switch which would “hold” his response on the 
panel until it was sure to be seen. On the other hand, if 
the subject was not particularly eager that his choice 
be seen, he was told to pull another switch which was 
said to terminate his panel light immediately. For each 
trial, then, the subject had /wo decisions to make: 
choosing the item that represented the greater psycho- 
pathology and keeping on or terminating the indication 
of his first choice on the panel in the next room. 

The experimenter further stated that the graduate 
student would be compelled to make some evaluative 
response to each choice of the subject, whether or not he 
had actually seen the subject’s choice on a trial. If the 
qualified person approved of the subject’s choice, he 
would flash on a green light on the subject’s panel. If he 
disapproved, he would give a red light followed by an 
electric shock. (Two brief sample shocks were given the 
subject just before the experiment began.) A neutral re- 
sponse would be indicated by a white light. The subject 
could choose to work with the qualified person (by 
“showing” his responses) or independently (by termi- 
nating his first choice by pulling a second Switch) al- 
though he did not always have to work with or inde- 
pendently of the qualified person. The subject could 
use the latter's evaluations to help in formulating his 
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REWARD PUNISH REWARD PUNISH 
RIGHT, BAN RIGHT, PNE 
SHOW SHOW 

d 

p m 


Fic. 1. Matrices representing the two training 
schedules. 


own criterion of psychopathology or he could disre- 
gard them. The subject could change his criterion at any 
time, His job, he was told (repeatedly), was simply to 
arrive at the best criterion of psychopathology that he 
could, Perhaps the qualified person's evaluations would 
aid him, perhaps he would do better by himself. After 
answering any questions the subject had, the experi- 
menter took the confederate next door and the experi- 
ment began. 


Manipulation of Trainer's Schedule 


"The trainer (confederate) had a panel on which signal 
lights indicated the subject's choice of item and the 
subject's decision to show" or “hide” his choice. Using 
this information, the confederate delivered rewards 
(green lights) or punishments (red light and electric 
shock)! He followed one of two prescribed training 
schedules that may best be summarized by the two 
matrices shown in Figure 1. In these matrices, the sub- 
ject's possible responses are indicated along the left 
and the trainer’s possible responses, across the top. 
"Right" stands for the choice the trainer regarded as 
correct and “wrong,” for the response he considered in- 
correct. Show means that the trainee was willing to 
make sure the trainer saw his choice and hide, that he 
wished to conceal his choice. In the body of the matrix, 
the pluses and minuses refer to the rewards and punish- 
ments dispensed by the trainer (or, the outcomes re- 
ceived by the trainee) and the number in the upper 
right hand portion of each cell indicates the probability 
that the subject would be given the reward or punish- 
ment associated with that cell. 

As these matrices show, the two schedules are identi- 
cal when choices are called to the trainer's attention. In 
both cases, if the subject does the right thing and shows 
it to the trainer, he is rewarded 100% of the time; if he 


does the wrong thing and shows it, he is punished 100% 
of the time. 


? The white “neutral” light was never used. The shock 
delivered ranged from 17 volts on early trials to 30 
volts on later ones. The alternating current was gen- 
erated by a high internal impedence apparatus de- 
signed by David Lykken to whom we are indebted for 
its use. 


Hangorp H. KELLEY AND KENNETH RING 


"The schedules differ with respect to the assumption 
made in the absence of monitoring. The “trusting” 
trainer always assumes the best. Unless he has evidence 
to the contrary, as far as he is concerned the subject 
has performed correctly. Because differences in trainer 
schedules are important when the trainer's ability to 
monitor is low, in both conditions we have instituted a 
20% monitoring rate which means that the trainer 
will be able to observe what the subject has done on only 
20% of the trials on which the subject attempts to hide, 
Of course, the confederate actually knows what the 
subject has chosen every time but he follows a program 
which dictates that he act as if he knew this on only one 
in every five trials on which the subject “hides” (which 
ones being determined randomly). Thus, 20% of the 
time the trusting trainer will “catch” the subject doing 
*wrong-hide" and will punish him for it; the subject 
will “get away with it” 80% of the time, however. When 
the subject is right and hides, he is rewarded 20% of the 
time because the trainer has detected his correctness 
and the other 80% of the time because he has succeeded 
in concealing his choice and the trusting trainer has 
given him the benefit of the doubt. 

Considering now the suspicious trainer we see that 
the subject can never “get away with anything” for this 
trainer always assumes the worst; unless he has evi- 
dence to the contrary, he will assume the subject de- 
serves punishment. Even when the subject is correct, 
if he fails to show his choice he is rewarded only in those 
instances (20%) when the trainer detects his choice. 

Our experimental hypotheses may now be stated 
more precisely. First, there will develop a difference be- 
tween subjects under the two schedules in frequency of 
showing, those in the suspicious condition showing 
more. (Whether the subjects in the trusting condition 
will decline in showing or those in the suspicious condi- 
tion will increase will depend upon the initial level of 
showing created by the general “set” given the subjects 
by the experimental instructions.) Reference to the mar 
trices describing the two conditions shows why this is 8) 
The subject stands to profit under the trusting trainet 
simply by concealing what he does. For even if he re- 
sponds at random in terms of right and wrong, 90% n 
the time he'll be rewarded if he hides, but only 50% if he 
shows. On the other hand, a subject working under i 
suspicious trainer will find that he gets rewarded M 
infrequently when he hides (10%) but substantially 
more often when he shows (50%). He should heo 
ingly learn to show (or continue showing if this is bis 
initial tendency). T 

Second, subjects working with the trusting tuo 
will not learn to make the distinction between right ES. 
wrong as rapidly as will subjects who have the be 
picious trainer. Under either trainer, the subject ud 
tains veridical information about how his choices © 
respond to those of the trainer only when he al 
trainer’s monitoring by showing. If he learns to 8 
moreunder the suspicious trainer (per Hypothesis D, 
subsequent information about his performance WI er 
better and he will be better able to learn the b W 
criterion. Because the subject is encouraged to hi i 
the trusting trainer, discrimination between right 9^ 
wrong is made more difficult and, hence, lea 
retarded. This hypothesis depends upon Hypoth in 
and assumes that the discrimination between son 
and hiding one's responses is learned more quickly 
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the discrimination between what the trainer desires 
from the subject and what he does not. 

Third, if the first two hypotheses be correct, we 
should expect at the end of the training that subjects 
operating under the trusting trainer will have felt less 
responsible for (or control over) the kinds of outcomes 
they received as compared to subjects who have sus- 
picious trainers. These latter should ultimately find 
themselves in a more stable situation of converted fate 
control, with outcomes reliably dependent on what re- 
sponse they themselves make. 

It is unmistakable how closely intertwined these 
three hypotheses are. Each for its confirmation rests 
on the validity of the one(s) which preceded. It should 
also be noted that the two conditions are identical for 
Subjects who always show their choices to the trainer. 
Such subjects do not experience the different schedules 
of the trainer because these schedules specify different 
actions only in the event of hiding. Hence, comparison 
of the two conditions is meaningful only for subjects 
who hide their choices on at least some of the trials. 

In the course of the experiment, the confederate 
trainer used two different criteria of what was right and 
wrong: For half the subjects he consistently chose the 
depression items and for the other half, the anxiety 
Statements, Thus, there were altogether four experi- 
menta] conditions which may be summarized as follows, 
giving first the trainer’s schedule and then his crite- 
Non: suspicious-anxiety (Sus-Anx), suspicious-depression 
(Sus-Dep), trusting-anxiety (Tru-Anx), and trusting-de- 
rim (Tru-Dep). Of the 52 experimental subjects? 

showed their choices on all or all but one trial (9 with 
the depression criterion and 3 with the anxiety criterion) 
and were eliminated from the comparison of the condi- 
ir for the Teason given above. The remaining 40 pro- 
ed 10 ‘subjects in each of the four conditions. Subjects 
Ae i ened to experimental conditions according to 

Otating schedule with some deviations tolerated in 
order to secure equal Ns. 


Control Group Data 


"od group data were needed at the end of the 
Which eee to determine the relative frequency with 

i a two items of each pair were selected when no 
booklets E consequences were involved. Accordingly, 
printed ied the pairs of statements, each pair 

Br 7: à separate page and in the same order as in 
introdu ment, were distributed to male students in an 
lected rage Psychology class. Of 57 men, 41 were se- 
experiment (ae comparable to the subjects used in the 
bd we (in terms of age, class, etc.) and only their 
simply de in establishing norms. Each subject was 
one by on, ed to go through the pairs of statements, 
manifested” and decide which statement of each pair 
logical E eie psychopathology, or psycho- 
reasonable S. Each was instructed to try to evolve a 
Psychopath, consistent, and justifiable criterion of 

Pathology to facilitate his decisions, 


a 

dada Subjects are not included in this total. They 
(six VERON that the trainer was a confederate 

- Case), or coy ailed to understand the instructions (one 


ld not tolerate the shock (one case). 
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TABLE 1 


AVERAGE PERCENTAGE SHOWING IN SUCCESSIVE 
BLOCKS or THE TRAINING 


Trials 
Condition 
112 13-25 26-37, All 
Sus-Dep (N = 10) 7E ITISA ela (3 
Tru-Dep (N = 10) 62 50 4 5 
Sus-Anx (N = 10) 62 66 61 63 
Tru-Anx (N — 10) 78 Slt 47 58 
RESULTS 
Showing and Hiding 


The results on frequency of showing are 
presented in Table 1. The tabled values are 
the averages of the percentages of trials on 
which subjects showed their choice to the 
trainer. Subjects in all conditions begin with 
a moderately high rate of showing, about 70% 
of the time. (It will be remembered that some 
subjects who showed on every trial have 
already been eliminated from the analysis.) 
As expected, there develops a sharp difference 
between the suspicious and trusting conditions. 
The subjects under the suspicious trainer 
maintain the initial level of showing. Those 
under the trusting trainer, although initially 
at about the same level, evince a steady decline. 
An analysis of variance which includes a 
trend analysis shows that there is a significant 
overall downward trend (p < .01) and a 
significant schedule-by-blocks interaction effect 
(p < .01). (Two-tailed tests of significance are 
used throughout this paper.) The latter indi- 
cates that the downward trend is steeper in 
the trusting condition than in the suspicious 
one. In terms of amount of showing over all 
trials, there is a significant schedule effect 
(p < .05). As can be seen in Table 1, the 
suspicious schedule yields more total showing 
than the trusting condition. The effect of the 
suspicious trainer, then, is to prevent a decline 
in subjects bringing choices to his attention, 
the total result being that he is shown more 
of their choices than is the trusting trainer. 

Table 1 also reveals that the Sus-Dep 
subjects have a rather high initial level of 
showing which they maintain. Consequently, 
their overall level is markedly higher than that 
of the Tru-Dep subjects (p < .01). In contrast, 
in the anxiety condition the Tru-Anx subjects 
have a high initial level and even though they 
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drop sharply, their overall level is almost as 
high as that of the Sus-Anx sample (the 
difference for all trials is not significant). 
Although the initial levels of showing are not 
significantly different as between the various 
conditions, it is possible that small differences 
in showing on the early trials have large 
effects on the learning, so it is necessary to 
take account of these initial showing rates in 
the analysis of the learning data, as described 
below. 


Learning the Trainer’s Criterion 


The data on learning the trainer’s criterion 
are presented in Table 2. The average per- 
centage correct for various portions of the 
training are shown for the four experimental 
groups and the control sample. The latter 
data indicate how often the depression items 
(or anxiety items) are chosen in the absence 
of any training. They afford a baseline from 
which learning in the experimental groups can 
be assessed. We observe that the groups in 
which the depression criterion was used lend 
support to the hypothesis: Subjects learn 
better under the suspicious trainer than 
under the trusting one (p < .05). However, 
with the anxiety criterion, the suspicious and 
trusting conditions are not differentiated. 

These differences in learning correspond to 
those in overall showing: A significant trusting- 
suspicious difference is obtained in the depres- 
sion condition where there was a large 
difference in overall level of showing and 
no such difference is obtained in the anxiety 
condition where there was only a marginal 
difference in total showing. And as already 
noted, the overall levels of showing reflect, in 
part, differences in initial rates. This raises 
two problems in evaluating our hypothesis. 
On the one hand, the difference supporting the 
hypothesis in the depression condition may 
simply reflect the accidentally high initial 
level of showing in the suspicious sample. 
On the other hand, the lack of difference in 
the anxiety condition, which is contrary to 
the hypothesis, may reflect the accidentally 
high initial level of showing in the Tru-Anx 
sample. 

To evaluate our hypothesis in a manner 
that rules out possible effects of initial differ- 
ences in showing, a regression analysis was 
made in which amount of showing in the first 
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TABLE 2 


AVERAGE PERCENTAGE CORRECT IN SUCCESSIVE 
BLOCKS OF THE TRAINING 


Trials 
Condition 
1-12 [12-25 26-37 | (AMI. 

Sus-Dep (N — 10) 72 | 80 | 78 | 76% 
Tru-Dep (N — 10) 71 | 59 | 65 | 65° 
ee fro choices) | 67 | 63 | 66 | 65 

N = 41 
Sus-Anx (N = 10) 42 | 46 | 58 | 49> 
Tru-Anx (N = 10) 42 | 54 | 48 | 48> 
Biete end choices) (NV | 33 | 37 | 34 | 35 

= 41 


A Significantly different from each other, p < .05. 
b Significantly different from the control group, ? < .05. 


block of 12 trials was correlated with the 
subsequent amount of learning. As a measure 
of the latter, we used simply the number of 
items correct on the last two blocks (25 trials). 
This seemed warranted because the suspicious 
and trusting samples within each criterion 
condition are almost identical in their initial 
rates of correctness. This measure of learning 
was plotted against frequency of showing on 
the first block of trials. The correlations were 
obtained separately for the two criterion 
conditions! and the regression of subsequent 
learning upon initial showing was calculated 
for each case. Using these regression lines it 
was possible to correct each subject's learning 
score for the amount of early showing he did. 

An analysis of variance of the corrected 
scores yields a significant schedule effect 
(p < .025) with the suspicious schedule 
yielding more correct responses on the lest 
two blocks of trials than the trusting schedule. 
Although most of the schedule effect is due to 
the difference within the depression criterion 
condition, the interaction between schedule 
and criterion is not significant. > 

Thus, even when initial rate of showing 55 
taken into account, it appears that ov! 
hypothesis is supported, clearly so with the 
depression criterion and not so clearly 50 for 
the anxiety criterion. There is greater learning 
under the suspicious trainer than under the 
trusting one. The greater decline in showing 
under the latter’s schedule during the seco" 

* Initial showing was correlated with later perform- 
ance only in the anxiety criterion condition (r — 46,2 
< .05). The correlation was not significant (r = ^ on 
in the depression condition. 


E 


“Suspicious” VERSUS “TRUSTING” TRAINING SCHEDULES 


and third periods (as noted in Table 1) is 
accompanied by a lower level of adherence to 
his criterion. 

Incidentally, another basis for evaluating 
the learning in the Sus-Dep condition is 
provided by nine subjects trained under the 
depression criterion who showed their responses 
all the time. They made 73 % correct responses 
on the first block of the training trials, 7396 
on the second block, and 81% on the last 
block, with an overall rate of 75% correct. 
While this group that received complete 
monitoring learned significantly more (p < 
405) than the control group (no training), it 
— appears they did no better than the partially 
monitored group working under the suspicious 
trainer. 


Feelings of Responsibility and Control 


No differences in these respects were ob- 
tained for subjects whose trainers used the 
“anxiety criterion, but some confirmation for 

_ Our third hypothesis appeared in the results 
for the depression subjects, where, on the 
basis of previously mentioned data, we should 
expect to find support for it. Specifically, on 

i tle posttraining questionnaire, subjects in the 
Sus-Dep condition expressed significantly 
Breater (p < .05) feelings of control in response 
to the question: “How much did you feel 
(the trainer’s) actions were dependent on 
What you did? That is, how much control did 
you Íeel your choices had on (the trainer's) 
actions toward you?" On another question, 

To What extent did you feel responsible for 
the kind of actions taken toward you?" 
E Tesults were similar but failed to reach an 

equate level of statistical significance 
(10 < $ < .20), 

4 everal other items from the posttraining 
q estionnaire give further insight into the 
The foll reactions to the two types of trainers. 
ois “ake dede between the suspi- 
Bethe 0s orate Ing samples were significant 

mo;, Pubjects thought the trusting trainer was 

b irem than the suspicious one. 

E Jects working under the trusting 
Benes ah, More satisfied with their per- 
‘the ies ‘an were those subjects who had 

"m cious trainer, 

Ds ARN thought the suspicious trainer 
itero oncerned for them to adopt his 

i 


ni 
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4. The suspicious trainer was regarded as 
more critical than the trusting trainer (al- 
though this difference was significant only 
with the anxiety criterion). 

It is only fair to add that a number of 
other questions failed to reveal any consistent 
differences between the suspicious and trusting 
samples. These concerned how much the 
subject felt influenced by the trainer's opinions, 
his reactions to being rewarded or punished, 
the perceived consistency of the trainer, and 
the extent to which the subject felt able to 
predict the trainer's reactions. 


Differences between the Criteria 


In terms of both learning and subsequent 
feelings of control, the experimental hypotheses 
were less convincingly confirmed for the 
anxiety criterion than for the depression one. 
Two differences between the criteria are to be 
noted as possibly accounting for this result: 

1. The anxiety criterion is somewhat less 
plausible to our subjects. As can be seen in 
Table 2, the control group shows a 2:1 pref- 
erence for the depression items as indicators 
of psychopathology. Furthermore, of 10 sub- 
jects who at the end of the training indi- 
cated they had rejected the trainer's criterion, 
7 were reacting to the anxiety criterion. 

2. The anxiety criterion seems somewhat 
easier for our subjects to verbalize. There are 
slightly more instances in the anxiety sample 
of correct descriptions of the trainer's criterion 
and fewer instances of confusion between the 
two or describing the criterion by reference to 
the symptoms excluded from it. Furthermore, 
there is a hint in Table 2 that the anxiety 
criterion is learned more rapidly, although 
this difference may merely indicate a ceiling 
effect operating in the depression samples. 

These attributes of the anxiety criterion 
can reasonably be considered as acting to 
reduce the differences between the suspicious 
and trusting schedules. On the one hand, if 
the category is easy to conceptualize, subjects 
may learn it from the trusting trainer before 
they learn to hide their choices from him. 
This would make for good subsequent per- 
formance whether or not they eventually 
learn to hide. On the other hand, some of 
the subjects who quickly learn that the trainer 
is using an implausible criterion are likely to 
reject it and show poor subsequent perform- 
ance even though they know full well what his 
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criterion is. Insofar as learning is quicker under 
the suspicious schedule, these persons would 
sooner manifest their rejection of the criterion 
in this condition and, in terms of our present 
hypotheses, would be deviant cases. However, 
they would really be exemplifying the superior 
learning that the suspicious schedule makes 
possible. If we assume that a trainee has to 
have a fairly clear conception of the criterion 
before he can decide to accept or reject it, it 
would seem reasonable that the condition 
that leads him to show more will be superior 
not only in producing better learning of 
fairly acceptable trainer demands, but in 
hastening rejection of unacceptable ones. 

Several exceptional cases in the anxiety 
condition do, in fact, illustrate these effects, 
but they do not, of course, provide conclusive 
evidence as to the nature of the differential 
effects of the two criteria. 


Discussion 


This experiment was designed to shed light 
on the situation where a trainee can act 
either to abet the trainer’s monitoring of his 
behavior or to defeat it. In everyday life 
continual surveillance by a trainer (parent, 
teacher, boss) of his trainee (child, student, 
employee) is usually not possible and probably 
not desirable. The trainee, then, is often faced 
with the same option that our subjects were 
confronted with: to help the trainer in his 
monitoring task or to make it difficult for him. 

In one condition, where a suspicious trainer 
was used, the subject had to show if he was 

to avoid the shock; here the trainer always 
assumed a mistake had been made unless the 
trainee himself made sure that the former 
actually saw his choice. Once the trainee had 
learned to show, he had a good chance of 
discerning the trainer’s criterion and, thus, of 
assuring himself of a certain invariance in 
outcomes. He could learn the ‘matching 
rule” that the trainer was using and the 
latter’s fate control could become behavior 
control. By virtue of bringing order into the 
situation the trainee could feel a kind of 
security and could regard himselí—to some 
extent at least—as responsible for the kind 
of outcomes he experienced. 

Contrast this condition with the other that 
made use of a trusting trainer. Here there is 
no compulsion to help the monitoring system 

for the trainer is an "old softie," an indulgent, 
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benevolent soul who thinks the best of his 


charges. Trainees are always innocent until 


proven guilty. Such a trainer retards the 
learning of his trainees by actually making it 
profitable for them to avoid "showing." 
Since he will sometimes reward erroneous 
choices he will end up by confusing those he is 
trying to teach as to what exactly he is trying 
to teach. He may then give the impression of 
arbitrariness and capriciousness as well as of 
indulgence and lack of concern about the 
training. Accordingly, trainees may feel less 
control over what happens to them. 

It is to be emphasized that we do not 
regard the trusting schedule as universally 
inferior to the suspicious one. In this experi- 
ment we have tried to create the conditions 
that work especially to the disadvantage of 
the trusting trainer. This disadvantage exists 
when he is unable to maintain complete 
surveillance over the trainee and when the 
latter is able to aid the monitoring by bringing 
his behavior to the trainer's attention. Within 
this situation, we have focused on the depend- 
ent variables of showing/hiding and learning 
and have only partially explored other conse- 
quences of the two schedules such as their 
effects on attitudes. Some of these effects are 
more positive for the trusting trainer: he is 
regarded as more generous and subjects are 
more satisfied with their performance. In any 
long-term relationship, “morale” considera- 
tions such as these may prove of greater 
consequence than the negative effects we have 
emphasized. Therefore, we would certainly not 
use these results as a basis for advocating 4 
general introduction of suspicion into inter 
personal relations. The negative consequences 
of distrust that other studies reveal (68 
Mellinger, 1956) also suggest caution on this 
point. On the other hand, we would not like to 
see totally disregarded the possible role 0 
suspicion in introducing stability into the 
trainee’s world—suspicion as defined in this 
study and under the special conditions here 
represented. 

A further limitation on generalization from 
this study stems from the fact that we havé 
examined only two rather extreme reactions 
to the absence of monitoring information. 
remains a further problem to determine the 
effects of a more neutral approach in whi 
the trainer makes no response or a noncom 
mittal one. Although the analysis becomes 10° 
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complex to go into here, we would expect this 
schedule to have an effect similar to that of 
the trusting trainer if the trainee obtains 
considerable intrinsic gratification from the 
wrong response and the trainer imposes an 
appropriately heavy penalty for this response 
when he detects it. 

This experiment deals with the behavior of 
the trainee and his reactions and feelings 
consequent on the programed responses of 
his trainer. It is an important question, 
obviously, how real trainers react in situations 
of this sort. One might expect, for example, 
that trainers would adopt a “trusting” 
orientation toward quick learners, but a 
“suspicious” orientation toward dull trainees. 
Tf this were done too soon in the learning 


- process, however, the learning of the bright 


trainees might be delayed and the learning 
Curves of the two groups might then show 


- an unexpected convergence. The difficulty of 


 ___—--  * 


the learning task might have similar effects. 
These are problems for future research. 


SUMMARY 


This study compared two schedules used by 
4 trainer when he was not able to monitor 
the behavior of his trainee. In a suspicious 
Condition, unless the trainee presented him 
With evidence to the contrary the trainer 
Usually acted on the assumption the trainee 
Was in error, In the other, a trusting condition, 
Unless presented with contrary evidence the 
trainer usually assumed the trainee was 
Correct, Each undergraduate subject, serving 
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as “trainee,” made successive judgments as to 
which of two types of symptoms indicated 
the greater degree of mental illness, whether 
an anxiety symptom or a symptom of de- 
pression. He also had the option of bringing 
each choice to the attention of the trainer, a 
“qualified” graduate student, who then 
gave an evaluation of the subject’s choice. 
For half the subjects, the trainer considered 
the anxiety choices as correct and for the 
other half, the depression choices. 

When the depression criterion was used the 
results were clearly in favor of the suspicious 
training schedule, as hypothesized. As com- 
pared to subjects working under the trusting 
trainer, subjects under the suspicious trainer 
were more willing to bring their choices to 
his attention, learned to a greater degree to 
use his criterion (subjects under the trusting 
trainer were not different in this respect from 
a control group), and expressed greater 
feelings of control over the trainer’s actions 
toward them. In the case of subjects to whom 
the trainer applied the anxiety criterion, the 
suspicious-trusting differences were less clear- 
cut although generally in the directions 
predicted. 
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ROLE PLAYING VARIATIONS AND THEIR INFORMATIONAL 
VALUE FOR PERSON PERCEPTION! 
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Duke University 


(1944, 1958) persistent concern with 

phenomenological analysis, much of 
the recent research in social perception has 
addressed itself to the naive psychology of 
the individual perceiver. How do individuals 
use the behavior of others to infer the probable 
existence of more enduring personal charac- 
teristics? What are the bases for social evalu- 
ation that in turn color the impressions one 
forms of another? What information is ignored 
and what information is made central in the 
formation of an impression? A number of 
investigators have sought a partial answer to 
these questions by assuming that a basic 
feature of naive phenomenology is the assign- 
ment of observed behavior to psychological 
causes. It seems logical to propose, for example, 
that behavior whose locus of causation lies 
within the person is more relevant to inferences 
about his particular characteristics than 
behavior that is induced or constrained by 
external events. The present investigation was 
designed to demonstrate this proposition with 
specific reference to the adoption and per- 
formance of social roles. 

The concept of role has had a lively and 
controversial history in the literature of social 
science. It is often treated as a crucial bridging 
concept since it concerns the relations between 
social requirements and normative expecta- 
tions on the one hand, and individual percep- 
tions and behavior on the other. Controversy 
has surrounded the many attempts to define 
role, as Levinson (1959) notes, because these 
attempts have vacillated between viewing 
role as an aspect of social structure and 
viewing it as a description of socially relevant 
individual behavior. In the present paper, 
the concept of role refers to role demands 
rather than actual behavior. Role is herein 
treated as a set of expected behaviors implicit 
in the instructions to a stimulus person. These 
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instructions define the impression the stimulus 
person should attempt to create in presenting 
himself to an interviewer, and variations in 
behavior given this role definition represent 
the major independent variable. 

The present treatment of role is quite 
consistent with any other treatment that 
stresses the shaping of individual responses by 
Social expectations or externally imposed 
norms. The point has often been made that 
general adherence to relevant sets of social 
norms is very important in facilitating social 
interaction. Particularly in organizational 
contexts, but by no means exclusively there, 
many social interactions can be effectively 
described in terms of the interplay of appro- 
priate role behaviors. Jones and Thibaut 
(1958) have emphasized the economic signifi- 
cance of such interactions between roles as 
reducing the need for inferences about idiosyn- 
cratic personal characteristics. The comple- 
ment of this point is that behavior appropriate 
to role expectations has little informational 
value in highlighting these individual charac 
teristics. 

To folow this line of reasoning a little 
further, roles facilitate interaction and the 
Social cognitions that support it. The naive 
person has his own repertory of role constructs 
that help to anchor his perceptions of the 
Social environment and to endow it with the 
necessary stability for planful action. On the 
other hand, the performance of social roles 
tends to mask information about individual 
characteristics because the person reveals 
only that he is responsive to normative 
requirements. If these requirements are Un- 
clear or conflicting, of course, he may 1 
veal something about himself by the way ™ 
which he defines and displays appropriate 
behavior. The stronger and more unequivoc 
the role demands, however, the less inform 
tion is provided by behavior appropriate t0 
the role. Following our introductory comments 
this conclusion may be derived from c0" 
sidering probable differences in the attribution 
of phenomenal causality. When a person? 
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behavior is very much in line with clear and 
potent social expectations, we tend to treat it 
as externally caused and uninformative with 
regard to a wide range of personal charac- 
teristics. When it departs from normative 
expectations, on the other hand, we tend to 
locate the cause for the departure in moti- 
vational forces peculiar to the person. We may 
assume, of course, that he misperceived the 
expectations, but we would then wish to push 
on to determine the motivational sources of 
this perceptual distortion. 

From the perceiver’s point of view, the 
behavior of a stimulus person which departs 
from role expectations takes on special 
significance for appraising the latter's personal 
characteristics. In assessing the motives behind 
such a departure from role, the perceiver must 
view the sample of behavior available against 
the background of role specifications. In 
general, our inferences from behavior to 
Personality must take into account the 
stimulus conditions eliciting the behavior. This 
is no less true when the stimulus conditions 
consist of clearly established role expectations. 

tn attempting to predict the nature and 
direction of inferences, given a sample of 
behavior that departs from role expectations, 
a number of factors must be considered. For 
One thing, there may be tendencies for the 
perceiver either to minimize or maximize the 
ue of the departure. In organizing his 
Bn of the stimulus person, the per- 
E d assimilate the latter's behavior 
the Po the role specifications governing 
inferring ery ani avoiding the problem of 
Rees oe characteristics. Alternatively, 
eae i" e a contrast effect in that the 
fal is we becomes cognitively salient 

ieee ed as departing even more from 
Eu: wm was actually the case. We are 
Fese en a position to choose between 
ttis d possibilities, or to specify the 
trast, The avoring assimilation versus con- 
measure of present study does provide a 

of memory distortion, however. 

RS arnt S that assimilation does not occur, 
teal: es incompletely, the perceiver's 
On his cis m unique characteristics rest 
ture from m o understand why the depar- 
Undoubtedi € expectations took place. 
achieve 4 4 Some departures are intended to 
umorous effect (the exchange 
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of “friendly insults’ between collaborators 
on a task); others are intended to play down 
role characteristics that might be offensive 
(the “soft-selling” salesman); still others 
stem from motives of rebellion and non- 
conformity. In the typical case, however, 
departure from role suggests a pattern of 
motivation and skill that is at variance with 
specific role requirements. The individual 
does not play the role because, somehow, he 
cannot or will not. In such cases, personality 
seems to override role expectations or to 
color role performance in a unique and signifi- 
cant way. The most probable inference from 
role departures of this type is that the person 
reveals something of his “true self" through 
his failure to perform the expected role. 

The present investigation treats this last 
conjecture as a proposition. An experiment 
was designed in which stimulus persons were 
instructed in one of two patterns of role 
performance. The behavior of the stimulus 
person was arranged to be consistent with 
either the first or the second of these patterns, 
thus, creating two experimental treatments 
where the person's behavior was "in role" 
and two treatments where it was clearly 
“out of role.” The general hypotheses prompt- 
ing the study were: 

1. Persons performing in line with role 
expectations reveal little of value for assessing 
their personal characteristics. When asked to 
describe such a person, subjects do so with 
little confidence and tend to avoid extreme 
statements. 

2. Persons whose performance departs from 
role expectations reveal their personal charac- 
teristics through the direction and form of 
this departure. Their behavior is judged as 
internally caused and forms the basis for 
direct inferences about personal characteristics, 
characteristics that may be judged with 
confidence. : 

3. Roles do, however, serve as an organizing 
function in person perception. Because of its 
predictive value, in-role behavior is more 
accurately recalled than behavior that departs 
from role expectations. 

Note that no specific hypothesis was 
formulated conceming the possibilities of 
assimilation and contrast, but data bearing 
on these possibilities are to be presented. 
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METHOD 
Subjects 
One hundred and thirty-four male undergraduates 
participated as subjects in groups ranging in size from 
5 to 20. Since the experimental design consisted of four 
treatment variations, an attempt was made to assign 
approximately equal numbers of subjects to each condi- 
tion. The actual cell frequencies varied from 31 to 37. 
As one attempt to control for individual differences in 
orientation to others, care was also taken to compose 
the treatment groups of approximately equal numbers 
of high, middle, and low scorers on Christie’s Mach IV 
Scale. This scale was originally designed to measure 
differences in the tendency to endorse Machiavellian 
sentiments. A high score reflects a tough minded, cyni- 
cal, and somewhat opportunistic attitude toward 
others; low scorers are more inclined to value affective 
involvement with others and to feel that social rela- 
tions should be governed by strict ethical norms. A 
description of item content and some sampling com- 
parison data may be found in Christie and Merton 
(1958). Studies by Jones and Daugherty (1959) and 
Jones, Gergen, and Davis (in press) provide data on the 
experimental validity of the scale. Though the present 
study was not designed to validate the Mach IV Scale, 
and the scale served mainly as a control variable, the 
Mach IV score was included as a potential source of 
variation in the analysis of data. It was felt that the 
highs and the lows might respond differently to the 
experimental conditions, though no specific hypotheses 
involving Machiavellianism were formulated. 


Procedure Overview 


Each experimental session began with a brief intro- 
duction in which the experimenter described the study 
as a problem solving task involving judgments of 
another person. Subjects were instructed to listen care- 
fully to a tape recorded interview between a psycholo- 
gist and a student, in which the student would be in- 
structed to play a particular role. The tape recording 
began with the psychologist giving explicit instructions 
to the stimulus person (SP) about the interview to 
follow. Although the recordings were actually based on 
carefully constructed scripts, an attempt was made to 
convince the subjects that the SP was given no in- 
formation about the interview and his role that did not 
appear on the tape. Thus, the taped instructions em- 
phasized that the SP was to present himself in the in- 
terview in such a way as to impress the interviewer 
that he was ideally suited for a particular job. In inter- 
views played to different groups of subjects one of two 
jobs requiring radically different personal qualifications 
was described. In this way the content of the tole was 
manipulated. The SP was told in the recording to “be as 
honest as you can unless you think another answer 
would help your chances better of getting the job.” In 
the interview that followed, SP answered some standard 
questions about his background in a neutral fashion and 
then responded to a series of choice items some of which 
were clearly relevant to the job for which he was apply- 

ing. In responding to these items, the SP either gave 
answers appropriate to the job description he had been 
given, or answers which revealed markedly different 
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preferences. Thus, the design involved presentation of 
four different stimulus patterns, two of which were "in- 
role" and two of which were “out-of-role.” 

After listening to the interview, each subject was 
asked to state his general impression of the SP and to 
fill out in succession the following dependent vari- 
able forms: given the same choice items to which the 
SP responded in the interview, subjects were asked to 
reconstruct from memory SP's response to each item; 
after this choice test form was collected the subject was 
handed an identical form and instructed to indicate 
how the SP would have responded if he were being 
completely honest in describing himself; finally, each 
subject was given a 16-item trait-rating scale and in- 
structed to rate the SP, and indicate his subjective con- 
fidence for each rating. The stimulus materials and de- 
pendent variable forms are more fully described below. 
Stimulus Variations 

As implied by the foregoing discussion, the experi- 
mental design called for the construction of four sepa- 
rate tapes to be played as the stimulus pattern for in- 
dependent groups of subjects. These tapes varied along 
two cross-cutting dimensions: on half of the tapes the 
job described was that of a submariner, on the remain- 
ing half the job was that of an ‘“‘astronaut” in training 
for space flights; on half of the tapes the job description 
was followed by a set of responses appropriate for the 
submariner job (other-directed pattern), on the re- 
maining half the responses were more appropriate for 
the astronaut job (inner-directed pattern). The four 
tapes were actually constructed by separately recording 
these four segments, always with the same person 
reading the part of the SP, and splicing them into the 
following combinations: Submariner-Other, Submari- 
ner-Inner, Astronaut-Inner, and Astronaut-Other. 5 

Role descriptions. Presentation of the submariner* 
role was prefaced by a reference to the capacities of 
atomic submarines and the corresponding qualities 
necessary for adjusting to the social conditions of sub- 
marine life. The following excerpts give both the flavor 
and some of the content of the role description: 

People at the submarine school are pretty sure they 

know quite a bit about the kind of person who adapts 

well to submarine life. . . . The main thing they lod 

for is stability and good citizenship . . . constant C 

operation with others is essential... willingness to 

tolerate routine... not supposed to think for him 
self . . , sticks to the rules. . . . Since submariners atè 

in such constant contact with each other, it’s im: 

portant, of course, that the good submariner enjoys 

other people around, that he be relaxed and friendly 

and slow to irritate. i 

Presentation of the astronaut's role capitalized v. 
the timely issue of sending a single man into space. ^" 
role description is suggested again by the following &* 
cerpts: d 

One of the most difficult requirements of space t1? * i 

at least in its early stages, is that it will most m 

involve a man’s being isolated from virtually all po 

man contact for long periods of time . . . looking 

men who don't need to have other people around « v 

inner resources and the ability to maintain concen: 

tration without stimulation from others.-+ alerts 

imaginative, resourceful... . 
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These particular role descriptions were designed, of 
course, without regard to truth value, solely to em- 
phasize plausibly two sets of qualities that might best be 
described as other- versus inner-directed. 

Behavior samples. SPs on each of the four tapes re- 
sponded in an identical fashion until they were asked 
by the interviewer to take a “choice test which has 
recently been devised to indicate how well a person will 
fit into various niches in life. . . ." The test that followed 
consisted of 22 items, each comprising a pair of state- 
ments. The SP was instructed to choose the member of 
each pair which was more characteristic of himself and 
to indicate his certainty on an 11-point scale. Half of 
the items were buffer items not specifically relevant to 
the difference between roles and always answered in the 
same manner by the SPs. The remaining 11 items were 
“critical” in reflecting the intended difference between 
inner- and other-directed response patterns. The fol- 
lowing examples indicate some of the pair members en- 
dorsed by the SPs in the two behavior samples: 


Other-Directed 
3. I always like to sup- 
port the majority. 
9. I would like to be a 
door-to-door sales- 


Inner-Directed 
I like to feel free to do 
what I want to do. 
I would like to be a 
forest ranger. 


man. 
17. When planning some- 


T When planning some- 
thing I always seek thing I like to 
Suggestions from work on my own. 
others. 


21. I like to know how I 
Other people think 
I should behave. 


avoid situations 
where I am ex- 
pected to behave 
in a conventional 


p way. 

22. 1 like to settle argu- I like to attack 
ments and disputes points of view that 
of others, are contrary to my 

own. 


For each of these item pairs, the SP orally endorsed 
8 ene appropriate to the condition and indi- 
S a degree of his certainty. Degree of certainty 
(ils ned so that the same scale positions were 
ds : the same number of times for each behavior 

ple. This was to equalize any tendency to regress 


te * : 
Ward or away from less certainty in the memory task. 


Impression Rating Scale 


ral final task of the subjects was to record their 
eae of the SP in terms of a 16-item rating scale. 

by iach consisted of two polar adjectives separated 
Rhine js 5 points. To the right of each item was a 5- 
EUM e scale for that item. Thus, subjects 
ike and E icate what they thought the SP was “really 
the item. ‘ow confident they were of each rating. Ten of 
whose A ere chosen to reflect five distinct clusters 
tested, E, ns were relevant to the hypotheses being 
gesting ak cluster consisted of two items, each sug- 
direction aspect of cluster meaning but balanced for 
s the © inhibit tendencies toward response set. 
dengue conformity Cluster consisted of the pairs 
Other Mies ponent and “creative-unoriginal.” 
and canı dc ERE affiliation, intelligence, motivation, 
ach other or], € remaining six items were related to 
cold, popul y in their strong evaluative tone: warm- 

» Popular-unpopular, likable-irritating, etc. 
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TABLE 1 
PREDICTIONS Or SP's True RESPONSES 
(Summary of Analysis of Variance* and / test Results) 


Source dj | MS F j|Grup N| X n 
A. Astro-Sub | 1 |1,554.73] 63.56**AO | 33 | 91.121, e 
B. Inner-Other | 1 |1,764.91] 6&.27**|5O — | t | 69.26" 
C. Mach 2| 18.81 
AXB 1| 8.63 AI | 33 | 69.15 
AXC 2| 62.67 Sr | 37 | 43.9958" 
BXC 2| 31.23 
AXBXC 2| 22.84 
Error 122| 27.46 

Total 133 


"Using the approximation technique for unequal cell 
frequencies (Snedecor, 1946). 
** p < 001. 


In analyzing the results, cluster scores were derived 
simply by adding the scale placements for the two com- 
ponent items, naturally reversing the score of one item. 
Confidence scores were similarly derived by simple 
addition of scale values. 

RESULTS 
Predicted Differences in Perception 

In line with the general hypotheses of the 
study, the first specific prediction was that 
SPs performing out-of-role (Astro-Others and 
Sub-Inners) would be perceived as revealing 
their true preferences in the simulated inter- 
view more than SPs performing in-role 
(Astro-Inners and Sub-Others). The most 
direct test of this hypothesis may be found 
in the data provided by the subjects in trying 
to indicate how the SP would have responded 
in the interview if he was being completely 
faithful to himself. These data are summarized 
in Figure 1a and in Table 1. It is clear that 
the prediction is confirmed. The Astro-Other 
and the Sub-Inner SPs are both seen as 
revealing their true preferences, though 
there is a slight and understandable regression 
toward the mean. Interestingly enough, and 
quite in line with predictions, the two in-role 
groups locate the true answers of the SPs 
almost exactly half-way between the other- 
directed and the inner-directed pattern. 


Since behavior appropriate to powerful role 
requirements generally masks the charac- 
teristics of the actor, the perceivers apparently 
feel that their best guess is a completely 
neutral one. 

‘The next issue to be raised is the extent to 


which these results are restricted to the 
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DEGREE OF PERCEIVED AND 
RECALLED OTHER DIRECTEDNESS 


o 


ASTRO SU 
O I O I 


IN FOLE 
A. Predictions 


1a. Predictions of "true answer patterns" (Means). 


IN ROLE 
B. Recall 


1b. Recall of answers actually given (Means). 


Fic. 1. Degree of perceived and recalled other directedness. 


particular pattern of items covered in the 
recorded interview. That is, do the highly 
significant results summarized in Table 1 
merely reflect rational manipulations of the 
specific response scale used by the SP, or do 
they generalize to related measures of per- 
ception and inference? Recall that the subjects 
also filled out a bipolar adjective rating 
scale, in an attempt to express their appraisal 
of the SP's true characteristics. The items 
of this scale were members of small clusters 
of related traits, varying in their relevance to 
the dimension of inner-other directedness. 
Two of the most relevant clusters, by a priori 
judgment, were attempts to measure percep- 
tions of affiliation and conformity. It was 
predicted that the two out-of-role SPs would 
be perceived to differ markedly in both 
affiliation and conformity and that the two 
in-role SPs would not. The results, as sum- 
marized in Table 2, clearly confirm this 
prediction. The Astro-Other is seen as signifi- 
cantly more affiliative and conforming than 
the Sub-Inner, and each is seen as differing 
significantly from its in-role control. Analyses 


of variance indicate that Machiavellianism 
contributes to no significant effects and there 
are no significant interactions between role 
and behavior sample. This result seems to 
indicate that genuine perceptual decisions 
have been made that involve assessing the 
meaning of the behavior sample provided by 
the SP against the background of the role 
playing instructions imposed on him. — 
Since the predictions involving the direction 
and magnitude of perceptual rating differences 
have been borne out, the next relevant que 
concerns the confidence with which perceptua 
judgments were made. The general hypothesis) 
it wil be recalled, predicted that in-role 
ratings would be less confidently made than 
ratings of out-of-role behavior. This prediction 
could be easily tested since the subject rendere 
a judgment of confidence with respect to ET 
trait rated. The most precise test of Y : 
general prediction involves confidence er 
based on the two clusters most relevant s 
the differences between role: affiliation s 
conformity. When confidence ratings on Ww 
clusters are summed for each individual, * 
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TABLE 2 
PERCEPTIONS OF AFFILIATION AND CONFORMITY 
Aoo | Astro: | Sub- | sub. | = oompa 
Other | Inner Other [Inner 
Mens Ss s Direction L 
Affiliation 
E 15.27 | 11.12 [12.00 | 8.64 [AO > SO  4.02**; 
SD 2.92 | 3.8 | 3.53 | 4.73 [AI > SI |2.12* 
Conformity 
2x 15.91 | 13.09 |12.58 | 9.41 |AO > SO |4.02**; 
SD 3.2» | 3.42 | 3.39 | 4.95 |AE > SI |3.65** 


Note.—The higher the mean value, the greater the perceived 
affiliation or conformity. Comparisons between AO and SI are not 
tabled, but the differences between these conditions would of 
course be highly significant. 

*p«.05. 

** p < 001. 


TABLE 3 
CONFIDENCE RATINGS BY TREATMENTS 
(Analysis of Variance? Summary) 


Source df Aibiiatign and 


MS F 

A. Astro-Sub 1 .07 

B. Inner-Other 1 1.62 3.29 
C. Mach 2 .26 

AXB 1 8:58» — 17.42** 
AXC 2 33 

BXC 2 En 

AXBXC 2 AT 

Within 122 49 

Total 133 


Using the approximati ii r unequal cell 
)»proximati te 

1 pen chnique foi l cel 

Results of tests of individual mean comparisons: AO > SO, 


$= 3.30, > < 01 SI > AL t= 
4 „ t = 2.55, 02. 
5 STA 55, p < .02. 


resulting pattern clearly confirms the hypoth- 
bois d 3 shows, the interaction between 
3h ni behavior sample is highly significant. 
Manet the individual mean comparison, in 
a" Case the subjects feel more confident 

ut rating the SP who is behaving out of 


tole than the SP s; E 
Way in role. who is behaving the same 


Differences in Recall 


: i "i Of their tasks, all subjects were 
<< to reproduce the responses made by the 


i ; 3 : g 
= oe Simulated interview. The fidelity of 
ways ae at recall was treated in two 
each ES irst, a recall score was computed for 


bject involving the degree of discrep- 
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ancy in scale points for each item, summed 
without regard to direction. For convenience 
we may call these absolute error scores. 
Subjects were expected to make fewer errors 
in recalling the behavior samples of the in-role 
treatments than those in the out-of-role 
treatments. The assumption was that roles, 
like all categories that summarize relevant 
information, facilitate cognitive organization 
and enhance one's ability to predict behavior 
that is appropriate to the role. In the present 
case, the behavior sample available in the 
in-role treatments tended to confirm the 
expectations established by the role instruc- 
tions. For subjects in the out-of-role treat- 
ments, the role instructions could not be used 
to organize and predict the behavior which 
occurred, except insofar as the subjects were 
led to adopt a clear negative expectation that 
the behavior was the opposite of that called 
for. 

The results on this measure of absolute 
recall confirm the hypothesis. The responses 
of the in-role SPs (Astro-Inner and Sub-Other) 
are recalled with greater accuracy than the 
responses of the out-of-role SPs (Astro-Other 
and Sub-Inner) As Table 4 shows, the pre- 
dicted interaction is significant (p < .05). Most 
of this interaction effect comes from the two 
cells in which the SP is other-directed (Sub- 
Others > Astro-Other, = 2.60, p < .01; 
Astro-Inner > Sub-Inner, t = .64, p < .50). 
There is no obvious reason for this difference 
unless the submariner role was more helpfully 
predictive in organizing information about 
the other-directed behavior sample than was 
the astronaut role in organizing information 
about the inner-directed sample. 

The recall data were also scored to take 
account of the direction of deviation from 
accuracy. These data are relevant for answer- 
ing any questions dealing with the assimilation 
of recalled responses to categories implied by 
the roles. In fact, as Figure 1b shows, there was 
no evidence of either assimilation or contrast 
in directional recall of the SP's interview 
preferences. Those errors which the subjects 
did make (see the foregoing data on absolute 
errors) were quite evenly distributed on either 
side of the true scale position. As far as the 
mean directed error scores are concerned, then, 
the group recall accuracy was extremely high. 
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TABLE 4 TABLE 5 
ABSOLUTE RECALL ERRORS BY TREATMENTS SELECTED TRAIT AND CLUSTER MEANS By TREATMENTS 
(Analysis of Variance* Summary) WITH ÁPPROPRIATE COMPARISONS 
b- | Así Sub- 
SERA af Lg i ue joder Taner |Lamer| Relevant 
Clusters...| n= |n=| n= n= Comparisons 
A. Astro-Sub 1 3.89 33, 3 2r E 
B. Inner-Sub 1 1.36 
(oh Mach 1 aes 2n RDUM 
AXB . . 9.97 |12.10| 9.64 |10.92 
AXC 2 3.61 SD 347 | 3.66| 4.48 | 4.09 | SO > AQP t 
AXBX . 8.00 11.13] 9.39 | 9.84 
Within 122 1.54 SD 3.35 |415| 3.72 | 3.94 | SO > ^00 <a 
Candor 
Total 133 x 12.42 | 9.68 | 10.09 |12.08 | AO > SO, p < A; 
SD 4.09 | 3.61) 3.68 | 3.77 SI > AL » € 45 
^ Using the approximation technique for unequal cell 


frequencies (Snedecor, 1946). 

b Themeans for the four cells represented by this interaction 
were: X(Astro-Other) = 1242; X(Sub-Other) = 9.52; X (Astro- 
Inner) = 10.00; X (Sub-Inner)= 10.56. 

"p< 05. 


Of course, we cannot state on the basis of 
these results that distortions from accurate 
recall of social behavior are always random 
distortions, but in the present case there is no 
evidence for directional errors either toward 
or away from the role category implied by the 
instructions to the SP. 


Perception of Additional Characteristics 


In planning the experiment, it seemed at 
least conceivable that many subjects would 
attribute the SP’s out-of-role performance in 
the Astro-Other and Sub-Inner conditions 
to poor motivation, lack of intelligence, or 
both. If such were the case, then it would 
not necessarily follow that the behavior 
sample provided would be taken as a “true” 
reflection of the SP’s affiliative and conforming 
‘tendencies (or the lack thereof). We have seen 
that most subjects did consider the out-of-role 
performance to be reflections of these ten- 
dencies, but it is still of interest to note their 
ratings of the SP on trait clusters tapping 
perceived motivation and intelligence. As 
Tabel 5 shows, the Sub-Other SP was seen as 
more highly motivated and intelligent than 
the Astro-Other SP (p < .01), but this 
expected trend was slightly reversed in the 
case of the two inner-directed SPs. It would 
appear, then, that the in-role SP is judged to 
have greater motivation and intelligence 
only when the role involves volunteering 
for the submarine service. There is no obvious 


reason for this difference in response to the 
two roles. Perhaps the inner-directed pattem 
seemed more artificial or obvious in the context 
of astronaut instructions, or perhaps the 
meaning of motivation and intelligence 
became less situation-bound when subjects 
were asked to appraise a. truly inner-directed 
man. 
Also of interest are the ratings of perceived 
candor. It might be expected that performing 
out-of-role would be construed as evidence of 
the SP's frankness and sincerity. The results 
in Table 5 do show a clear difference between 
in-role and out-of-role SPs with regard to the 
perception of candor. As expected, the in-role 
SPs are judged by the average subject to fall 
at or near the midpoint of the scale (10.00 
for the cluster); the out-of-role SPs are sec 
as significantly more candid in each variation 
of role instructions. | 
Results on the remaining traits add Jittle 
to the picture already presented. For the 
most part the means fall into a pattern similar f 
to those reflecting perceived affiliation and 
conformity. That is, the two in-role SPs até 
perceived to be relatively neutral on most af 
the evaluative trait dimensions; the Astro 
Others are seen as significantly more likable 
(versus irritating), warm (versus cold), 
popular (versus unpopular), and helpful 
(versus disinterested) than the Sub-Inne* 
This pattern of findings might suggest the 
operation of a strong “halo” or "generosi 
effect favoring those who are perceived % 
other directed. However, it must be recall 
that the Astro-Other SP is seen as less high!’ 
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motivated and less intelligent than the 
Sub-Other SP, personal attributes that are 
usually considered to be quite evaluative. 
Also, Astro-Other and Sub-Inner SPs are 
both seen as more candid than their in-role 
controls. It seems more likely, then, that 
evaluative traits like warm, popular, likable, 
and helpful are linked to other-directedness 
more by direct association than through an 
underlying decision that the Astro-Other 


— SP is good and the Sub-Inner SP is bad. This 


conclusion is supported by two further results: 
to an extent that is nearly significant, the 
two out-of-role SPs are seen as more interesting 
(versus boring) than the two in-role SPs; 
also, the Astro-Inner SP is seen as significantly 
more conceited (versus self-effacing) than 
either the Astro-Other or the Sub-Inner SP. 
While these incidental findings are not all 
easy to rationalize, they do point up the com- 
plexity and subtlety of the cognitive impres- 
sions created by the experimental combinations 
of role and behavior sample. 


Machiavellianism 


While the study was designed without any 
consideration of the possible role of Machiavel- 
lianism as measured by Christie's Mach IV 
Scale, it did seem possible that high scorers on 
the Mach Scale might be generally more 
Sensitive to variations in role instructions and 
More inclined toward negative evaluation of 
the SPs who were unable or unwilling to play 
the prescribed role. 
ae d the major analyses, Mach level was 
Eus routinely as a potential source of 
Eus no case did it contribute to 
Tim nt main or interactive effects. Either 
E. Econ manipulations were simply too 

Owerful for individual differences on this 
E to manifest themselves, or our 
WN : ge of Mach Scale correlates is insuffi- 
gud ke meaningful predictive state- 
E e view of the results of a recently 
is press) wd by Jones, Gergen, and Davis 
v^ un e latter alternative seems quite 
iis ith respect to one meaningful 
E AN however, Machiavellianism does 
rd deg „contribution. The high 
tothe o to attribute greater intelligence 
Eloy role than the out-of-role SPs whereas 

Scorers show the reverse pattern in 
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perceiving intelligence. This difference between 
highs and lows is significant (t = 2.15, p < 
.05). A similar trend was noted with regard 
to perceived motivation, but this fell short of 
significance (! = 1.62, p < .15). Though the 
differences in perceived intelligence are 
consistent with what little knowledge we do 
have about high and low Mach scorers, there 
is little else in the data to suggest that the 
variable is relevant to the perception of in-role 
and out-of-role SPs. 


DISCUSSION 


The results of the present study are un- 
equivocal. Starting from the assumption that 
individual characteristics are obscured when a 
person is exposed to strong and demanding 
stimulus forces, we have reasoned that social 
stimuli embraced by the role concept may 
operate in this same way. Thus a person who 
conforms to salient social expectations reveals 
little about his basic and distinguishing char- 
acteristics. On the other hand, one who rejects 
or ignores pressures to play a defined role is 
considered to reflect his true disposition and 
is perceived with confidence. 

Tt is undoubtedly true that this reasoning 
is ubiquitous in the psychologist’s approach 
to personality assessment. In many programs 
of assessment, the patient or subject is exposed 
to a variety of situational pressures and task 
demands. Of his responses to these situations, 
his nonmodal reactions are clearly more 
informative and carry the most interpretive 
weight. To take another example, psychological 
screening for desirable jobs must cope with 
the problems raised by this study in order to 
be effective. Since the role constraints for the 
applicant are often obvious, the interviewer or 
employer must penetrate to more subtle 
cues or fall back on projective devices which, 
though unreliable at best, at least produce the 
response variety essential for individualized 
judgment. 1 

It is probably true that short-term social 
interactions can perfectly well proceed in line 
with established expectations defining re- 
ciprocal roles. If the interaction is self-limiting 
(as, say, between a hotel guest and his bellhop) 
there is little need for personalized information 
to sustain the interaction. When a relationship 
is more permanent, however, and involves 
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the prospect of interactions in varied situ- 
ations, such information rapidly increases in 
value. It is to the perceiver’s advantage, in 
such cases, to be especially attuned to out-of- 
role behaviors and to create situations in 
which out-of-role behaviors can be most 
clearly observed. The results of the present 
study show that such behaviors, rightly or 
wrongly, are perceived to be peculiarly 
diagnostic of individual characteristics. Judg- 
ments about these personal qualities are 
presumably most important in governing 
the perceiver’s behavior as he ventures into 
new situations with the SP. 


SUMMARY 


An experiment was designed to test the 
general proposition that behavior which is 
appropriate to a clearly specified role is 
relatively uninformative about personal char- 
acteristics. Subjects were asked to listen to a 
recorded interview in which the interviewee 
was heard being instructed, in two treatment 
conditions, to respond “as if" he very much 
desired to be accepted in the submarine 
service, and in two treatments as if he wished 
to qualify as a space astronaut. The qualifica- 
tions of these two positions were described 
in such a way that dramatically different 
personal qualifications were required. As the 
interview proceeded, the interviewee either 
responded in line with the qualifications 
described for the astronaut position (inner- 
directedness) or with those for the submariner 
position | (other-directedness). Thus, in a 
four-cell design there were two cells of in-role 
behavior to be judged (Astronaut-Inner and 
Submariner-Other) and two cells of out-of-role 
behavior (Astronaut-Other and Submariner- 
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Inner). Reasoning from the importance of 
distinguishing between self-caused and exter. 
nally induced behavior in person perception, 
the following predictions were made: 

1. Out-of-role SPs are perceived to be 
revealing their true characteristics more than | 
in-role SPs. 

2. Out-of-role SPs are rated with greater | 
confidence on the dimensions relevant to 
role performance. | 

3. The performance of in-role SPs is more 
accurately recalled than that of out-of-role 
SPs. 

Each of these predictions was strongly 
confirmed by the results. 


REFERENCES 


Curistre, R., & Merton, R. K. Procedures for the 
sociological study of the values climate of medical 
schools, J. med. Educ., 1958, 83, 125-153. 

HkrDER, F. Social perception and phenomenal causality. 
Psychol. Rev., 1944, 51, 358-374. 

Hemer, F. The psychology of interpersonal relations, 
New York: Wiley, 1958. 

Jones, E. E., & Daucuerry, B. N. Political orienta- 
tion and the perceptual effects of an anticipated 
interaction. J. abnorm. soc. Psychol, 1959, 69, 
340-349. 

Jones, E. E., Gercen, K. J., & Davis, K. E. Some de- 
terminants of reactions to being approved or dis- 
approved as a person. Psychol. Monogr., in press. 

Jones, E. E., & Tnrsavr, J. W. Interaction goals as 
bases of inference in interpersonal perception. In 
R. Tagiuri & L. Petrullo (Eds.), Person perception 
and interpersonal behavior. Stanford: Stanford 
Univer. Press, 1958, Pp. 151-179. 

Levinson, D. J. Role, personality, and social structure 
in the organizational setting. J. abnorm, 50^ 
Psychol., 1959, 68, 170-180. 

SwEDECOR, G. W. Statistical methods. (4th ed.) Ames 
Towa: Collegiate Press, 1946. 


(Received September 23, 1960) 


Journal of Abnormal and Social Psychology 
1961, Vol. 63, No. 2, 811-318 


IDENTIFICATION AS A PROCESS OF INCIDENTAL LEARNING! 


ALBERT BANDURA AND 


ALETHA C. HUSTON 


Stanford University 


much of a child’s behavior repertoire 
is believed to be acquired through identifica- 
tion with the important adults in his life. This 
process, variously described in behavior 
theory as “vicarious” learning (Logan, Olm- 
sted, Rosner, Schwartz, & Stevens, 1955), 
observational learning (Maccoby & Wilson, 
1957; Warden, Fjeld, & Koch, 1940), and 
tole taking (Maccoby, 1959; Sears, Maccoby, 
& Levin, 1957) appears to be more a result 
of active imitation by the child of attitudes 
and patterns of behavior that the parents 
have never directly attempted to teach than 
of direct reward and punishment of instru- 
mental responses. 

While elaborate developmental theories 
have been proposed to explain this phenome- 
Bod; the process subsumed under the term 

identification” may be accounted for in 
terms of incidental learning, that is, learning 
that apparently takes place in the absence of 
an induced set or intent to learn the specific 
behaviors or activities in question (McGeoch 
& Irion, 1952). 
oe the parents’ social training of a child, 
Uu range of cues employed by a child is likely 
to include both those that the parents consider 
Immediately relevant and other cues of 
Parental behavior which the child has had 
eve opportunities to observe and to learn 
D 2s ough he has not been instructed to do 
i duod for example, when a parent punishes 
E. ds ysically for having aggressed toward 
E i € intended outcome of the training is 
TR : child should refrain from hitting 
m ho oncurrent with the intentional learn- 
eren à certain amount of incidental 
soe be expected to occur through 
ant > ae the child is provided, in the 
“ample of h parents behavior, with an ex- 

OW to aggress toward others, and 


LTHOUGH part of a child's socialization 
takes place through direct training, 


1TH. ; Ru. 
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this incidental learning may guide the child's 
behavior in later social interactions. 

The use of incidental cues by both human 
and animal subjects while performing nonimi- 
tative learning tasks is well documented by re- 
search (Easterbrook, 1959). In addition, 
studies of imitation and learning of incidental 
cues by Church (1957) and Wilson (1958) have 
demonstrated that subjects learn certain inci- 
dental environmental cues while imitating 
the discrimination behavior of a model and 
that the incidental learning guides the sub- 
jects’ discrimination responses in the absence 
of the model. The purpose of the experiment 
reported in this paper is to demonstrate that 
subjects imitate not only discrimination re- 
sponses but also other behaviors performed by 
the model. 

The incidental learning paradigm was em- 
ployed in the present study with an important 
change in procedure in order to create à situa- 
tion similar to that encountered in learning 
through identification. Subjects performed an 
orienting task but, unlike most incidental 
learning studies, the experimenter performed 
the diverting task as well and the extent to 
which the subjects patterned their behavior 
after that of the experimenter-model was 
measured. 

The main hypothesis tested is that nursery 
school children, while learning a two-choice 
discrimination problem, also learn to imitate 
certain of the experimenter's behaviors which 
are totally irrelevant to the successful per- 
formance of the orienting task. 

One may expect, on the basis of theories of 
identification (Bronfenbrenner, 1960), that 
the presence of affection and nurturance in the 
adult-child interaction promotes incidental 
imitative learning, a view to which empirical 
studies of the correlates of strong and weak 
identification lend some indirect support. 
Boys whose fathers are highly rewarding and 
affectionate have been found to adopt the 
father-role in doll play activities (Sears, 1953), 
to show father-son similarity in response to 
items on a personality questionnaire (Payne & 
Mussen, 1956), and to display masculine be- 
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haviors (Mussen & Distler, 1959, 1960) to a 
greater extent than boys whose fathers are 
relatively cold and unrewarding. 

One interpretation of the relationship be- 
tween nurturance and identification is that 
affectional rewards increase the secondary re- 
inforcing properties of the model and, thus, 
predispose the imitator to reproduce the be- 
havior of the model for the satisfaction these 
cues provide (Mowrer, 1950). Once the paren- 
tal characteristics have acquired such reward 
value for the child, conditional withdrawal of 
positive reinforcers is believed to create addi- 
tional instigation for the child to perform be- 
haviors resembling that of the parent model, 
i.e., if the child can reproduce the parent's 
rewarding behavior he can, thus, reward him- 
self (Sears, 1957; Whiting & Child, 1953). In 
line with this theory of identification in terms 
of secondary reward, it is predicted that chil- 
dren who experience a warm, rewarding inter- 
action with the experimenter-model should re- 
produce significantly more of the behaviors 
performed by the model than do children who 
experience a relatively distant and cold rela- 
tionship. 


MetHop 
Subjects 
"The subjects were 24 boys and 24 girls enrolled in 
the Stanford University Nursery School. They ranged 
in age from 45 to 61 months, with a mean age of 53 
months. The junior author played the role of the model 


for all 48 children, and two other female experimenters 
Shared in the task of conducting the study.* 


General Procedure 


Forty subjects were matched individually on the 
basis of sex and ratings of dependency behavior, and 
subdivided randomly in terms of a nurturant-non- 
nurturant condition yielding two. experimental groups 
of 20 subjects each. A small control group comprising 
8 subjects was also studied. 

In the first phase of the experiment half the experi- 
mental and control subjects experienced two nurturant 
rewarding play sessions with the model while the re- 
maining subjects experienced a cold nonnurturant re- 
lationship. For the second phase of the experiment sub- 
jests performed a diverting two-choice discrimination 
problem with the model who exhibited fairly explicit, 
although functionless, behavior during the discrimina- 


? The authors wish to express their appreciation to 

ice Beach and Mary Lou Funkhouser for their assist- 
ance in collecting the data, and to Ruth Barclay and 
Claire Korn for their help with the behavior observa- 
tions. ' 
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tion trials, and the extent to which the subjects re. 
produced the model's behavior was measured, The 
experimental and control procedures differed only in the 
patterns of behavior displayed by the model, 


Matching Variable 


Dependency was selected as a matching variable 
since, on the basis of the theories of identification, de- 
pendency would be expected to facilitate imitative 
learning. There is some evidence, for example, that de- 
pendent subjects are strongly oriented toward gaining 
social rewards in the form of attention and approval 
(Cairns, 1959; Endsley & Hartup, 1960), and one 
means of obtaining these rewards is to imitate the 
behavior of others (Sears, Maccoby, & Levin, 1957). 
Moreover, such children do not have the habit of 
responding independently; consequently they are apt 
to be more dependent on, and therefore more attentive 
to, the cues produced by the behavior of others 
(Jackubczak & Walters, 1959; Kagan & Mussen, 1956). 

Measures of subjects’ dependency behavior were ob- 
tained through observations of their social interactions 
in the nursery school. The observers recorded subjects’ 
behavior using a combined time-sampling and behavior f 
unit observation method. Each child was observed for | 
12 10-minute observation sessions distributed over | 
period of approximately 10 weeks; each observation 
session was divided into 30-second intervals, thus, 
yielding a total of 240 behavior units. 

The children were observed in a predetermined order 
that was varied randomly to insure that each child 
would be seen under approximately comparable condi- 
tions. In order to provide an estimate of reliability of 
the ratings, 234 observation sessions (4,680. behavior 
units) were recorded simultaneously but independently 
by both observers. x 

The subjects’ emotional dependency was assessed in 
terms of the frequency of behaviors that were aimed at 
securing a nurturant response from others, The fol 
lowing four specific categories of dependency behavior 
were scored: seeking help and assistance, seeking praise 
and approval, seeking physical contact, and seeking 
proximity and company of others. , 

The dependency scores were obtained by sum 
the observations made of these five different types 0 
behaviors and, on the basis of these scores, the subjects 
were paired and assigned at random to the two experi 
mental conditions. 


Experimental Conditions 

In the nonnurturant condition, the model brought He 
subject to the experimental room and after instructing 
the child to play with the toys that were spread ce " 
floor, busied herself with paper work at a desk in di 
far corner of the room. During this period the mo 
avoided any interaction with the child. dd 

In contrast, during the nurturant sessions the a 
sat on the floor close to the subject. She repon in 
readily to the child's bids for help and attention, and? 
other ways fostered a consistently warm, and rewar! 
interaction. h 

These experimental social interactions, which. P 
ceded the imitation learning, consisted of two 15-mint 
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sessions separated by an interval of approximately 5 
days. 


Diverting Task 

A two-choice discrimination problem, similar to the 

one employed by Miller and Dollard (1941) in experi- 
ments of matching behavior, was used as the diverting 
task which occupied the subjects’ attention while at the 
same time permitting opportunities for the subjects to 
observe behavior performed by the model in the ab- 
sence of any instructions to observe or to reproduce the 
Tesponses resembling that of the model. 
i The apparatus consisted of two small boxes, identical 
in color (red sides, yellow lid) and size (6" X 8" X 10"). 
The hinged lid of each box was lined with rubber strip- 
ping so as to eliminate any auditory cues during the 
placement of the rewards which consisted of small 
multicolor pictures of animals and flowers. The boxes 
were placed on small chairs approximately 5 feet apart 
and 8 feet from the starting point. 

At the end of the second social interaction session 
the experimenter entered the room with the test ap- 
paratus and instructed the model and the subject that 
they were going to play a game in which the experi- 
d peut cies a picture sticker in one of the boxes 

at the object of the game was to i 
Contained the sticker, i FP ee 
p model and the subject then left the room and 
A i pene placed two stickers in the desig- 
ih x, they were recalled to the starting point in 
be experimental room and the model was asked to take 
iden RU During the model’s trial, the subject re- 
SERA uy point where he could observe 
Enough initially it was planned to follow the pro- 
of hod ied by Miller and Dollard (1941) in which one 
ae Oxes was loaded with two rewards and the 
trial, dens pus immediately following the leader's 
Cier i ocedure had to be modified when it became 

Curing pretesting that approximately 40% of 
the subjects invariably chose th ite borfrit 
miodel even thws d chose the opposite box from the 
sistently esda e nonimitative response was con- 
study of imitat $ McDavid (1959), ina recent 
E ative behavior in preschool children, en- 
su Similar difficulties in that 44% of his sub- 
subjects aem iine ths jade even though the 
“or Was not eed as to whether the leader was 
n order to overcome this stereot: imitati 
E Pi yped nonimitation, 
| Bone placed two rewards in a single box, but 
det the ix model’s trial the model and the subject 
“(the eden and were recalled almost immediately 

Nun interval was approximately 5 seconds), 
loaded ray the Impression that the boxes were re- 
and the E se completed his trial, the model 
corded the i ed he room. The experimenter re- 
for the Ed. "id M and reloaded the boxes 

throughout za e noncorrection method was 

the subject n à procedure „was continued until 
itative ee earning criterion of four successive 
tion trials haa ee ation responses, or until 30 acquisi- 
it proc us n completed. The slight modification 
Proved to be effective as evidenced by the 
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fact that only 9 of the 48 children failed to meet the 
criterion. 

In order to eliminate any position habit, the right- 
left placements of the reward were varied from trial 
to trial in a fixed irregular order. This sequence was 
randomly determined except for the limitation that no 
more than two successive rewards could occur in the 
same position. 

The number of trials to criterion was the measure of 
eed imitation behavior on the discrimination 
task. 

Although the establishment of imitative choice re- 
sponses was, in itself, of some theoretical interest, the 
discrimination problem was intended primarily as an 
orienting or distraction task. Thus, on each discrimina- 
tion trial, the model exhibited certain verbal, motor, and 
aggressive behaviors which were totally irrelevant to 
the performance of the task to which the subject's at- 
tention was directed. At the starting point, for example, 
the model remarked, *Here I go," and then marched 
slowly toward the box containing the stickers repeating, 
“March, march, march." On the lid of each box was a 
small rubber doll which the model knocked off aggres- 
sively when she reached the designated box. She then 
paused briefly, remarked, “Open the box,” removed one 
sticker and pasted it on a pastoral scene that hung on 
the wall immediately behind the boxes. The model 
terminated the trial by replacing the doll on the lid of 
the container. The model and the subject then left the 
room briefly. After being recalled to the experimental 
room the subject took his turn, and the number of the 
model's behaviors reproduced by the subject was re- 
corded. 


Control Group j 

In addition to the two experimental groups, a con- 
trol group, consisting of eight subjects, comparable to 
the experimental groups in terms of sex distribution, 
dependency ratings, and nurturant-nonnurturant ex- 
periences was studied. Since the model performed highly 
novel patterns of responses unlikely to occur inde- 
pendently of the observation of the behavior of the 
model, it was decided to assign most of the available 
subjects to the experimental groups and only a small 
number of subjects to the control group. 

The reasons for the inclusion of a control group were 
twofold. On the one hand, it provided a check on 
whether the subjects’ behavior reflected genuine imita- 
tive learning or merely the chance occurrence of be- 
haviors high in the subjects’ response hierarchies, 
Second, it was of interest to determine whether the 
subjects would adopt certain aspects of the model’s 
behavior that involved considerable delay in reward. 
With the controls, therefore, the model walked to the 
box, choosing a highly circuitous route along the sides 
of the experimental room; instead of aggressing toward 
the doll, the model lifted it gently off the container and 
she left the doll on the floor at the completion of a trial. 
While walking to the boxes the model repeated, “Walk, 


walk, walk.” 
Imitation Scores 


On each trial the subjects’ performances were scored 
in terms of the following imitation response categories: 
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selects box chosen by the model; marches; repeats the 
phrases, “Here I go,” “March, march,” “Open box,” or 
“Walk, walk”; aggresses toward the doll; replaces doll on 
box; imitates the circuitous route to the box. 

Some subjects made a verbal response in the appro- 
priate context (for example, at the starting point, on the 
way to the box, before raising the lid of the container) 
but did not repeat the model’s exact words. These 
verbal responses were also scored and interpreted as 
partially imitative behavior. 

In order to provide an estimate of the reliability of 
the experimenter’s scoring the performances of 19 
subjects were scored independently by two judges who 
alternated in observing the experimental sessions 
through a one-way mirror from an adjoining observa- 
tion room. 


RESULTS 


Reliability of Observations of Dependency Be- 
havior 


The reliability of the observers’ behavior 
ratings was estimated by means of an index of 
agreement based on the ratio of twice the 
number of agreements over the combined 
ratings of the two observers multiplied by 100. 
Since small time HOR m due to in- 
evitable slight asynchronism of the observers’ 
timing devices, were expected, a time dis- 
crepancy in rating a given behavior category 
greater than two 30-second intervals was in- 
terpreted as a disagreement. 

The interobserver reliabilities for the de- 
pendency categories considered separately 
were as, follows: Positive attention seeking, 
84%; help seeking, 72%; seeking physical 
contact, 8476; and seeking proximity, 75%. 


Reliability of Imitation Scores 


The percentage of agreement in scoring 
imitative behavior in the experimental sessions 
is presented in Table 1. Except for other imita- 
tive responses, the subjects’ behavior was scored 
with high reliability and, even in the letter re- 


TABLE 1 
Scorer RELIABILITY or IurrATIVE RESPONSES 
Response Category AEN 

Aggression 

Marching 7 
Imitative verbal behavior 80 
Partially imitative verbal behavior 83 
Other imitative responses 50 
Replaces doll 99 
Circuitous route 96 
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TABLE 2 

AMOUNT OF IMITATIVE BEHAVIOR DISPLAYED qy 
SUBJECTS IN THE EXPERIMENTAL AND 
CONTROL Groups 


= 
Experimental 
ubjects 
N —40 
Response Category Pax 
centage AED 
tating Trial 
Behaviors of experi- 
mental model 
Marching 45 | .23 
Verbal responses 28 | .10 
ion 90 +64 
Other imitative re-| 18 | .03 
sponses 
Partially imitative 43 | .11 
R an a 90 60 
epla lo] d 
Beli elotel Lol control 
mod 
Circuitous route 0 0 
Verbal responses 0 0 


Note.—The mean number of trials for subjects in the experi 
mental group (13.52) and in the control group (15.25) did not 
differ significantly. 


arose primarily from the experimenter’s lack 
of opportunity to observe some of the b 
haviors in question rather than from differents 
of interpretation, for example when the subject 
made appropriate mouth movements bil 
emitted no sound while marching toward the 
containers, this partial imitation of the models 
verbalizations could not be readily observ 
by the experimenter (who was at the startini 
point) but was clearly evident to the rate 
the observation room. 


Incidental Imtiation of Model’s Behavior 


Since the data disclosed no significant s% 
differences, the imitation scores for the P di 
and female subgroups were combined in t 
statistical analyses. | 

Ninety percent of the subjects in a a 
perimental groups adopted the model's 4 
gressive behavior, 45% imitated the marchi 
and 28% reproduced the model’s verbal f 
tions. In contrast, none of the control subje j 
behaved aggressively marched or verbalize 
while 75% of the controls and none of M B 
perimental subjects imitated the circu 


3 One subject in the control group hit the doll o! 
box on one trial only. 
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F route to the containers. Except for replacing 
the doll on the box, which was performed by 
most of the experimental and control subjects, 
there was no overlap in the imitative behavior 
displayed by the two groups (see Table 2). 
While the control subjects replaced the doll 
on the box slightly more often than the subjects 
in the experimental group, this difference 
tested by means of the median test was not 
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TABLE 3 
SIGNIFICANCE OF DIFFERENCES IN IMITATIVE BEHAVIOR 
EXHIBITED BY SUBJECTS IN THE NURTURANT AND 
NONNURTURANT EXPERIMENTAL CONDITIONS 


statistically significant (x? = 1.49; df = 1). 
Evidently the response of replacing things, 
undoubtedly overtrained by parents, is so well 
established that it'occurs independently of the 

- behavior of the model. Since this was clearly a 
nonimitative response, it was not included in 
the subsequent analyses. 

To the extent that behavior of the sort 
evoked in this study may be considered an 
elementary prototype of identification, the 
results presented in Table 2 add support to 
the interpretation of identification as a process 
of incidental imitative learning. 


Effects of Nurturance on Imitation 


In order to make comparable the imitation 
Scores for the subjects who varied somewhat in 
the number of trials to criterion, the total 
imitative responses in a given response cate- 
gory were divided by the number of trials, 
Since only a’small number of subjects in the 
nonnurturant condition displayed imitative 
honaggressive behavior and the distributions of 
“Scores were markedly skewed, the sign test 
Was used to estimate the significance of differ- 
ences between the two experimental groups. 
P predicted facilitating effect of social 
s ps dm Imitation was essentially confirmed 
E e 3). Subjects who experienced the 
E dis interaction with the model marched 
qe n alized imitatively, and reproduced 
is Scam resembling that of the model 
Er extent than did the subjects who 
ré the relatively cold and distant re- 
readily m Aggression, interestingly, was 
quali » ated by subjects regardless of the 

Y of the model-child relationship. 


Imitati E 

ordi of Discrimination Responses 

1 FENE analysis of variance (McNemar, 

Shag e XVII) of the trials scores failed to 
ny Significant effects of nurturance or 


Sex of s 
mutator on the imitation of discrimina- 


Number of Sub- 
jects Imitating 
Response Category Non- ? 
N 
Yee 
9) | 20) 
Nonaggressive behaviors 15 J; 04 
Marching 13 5 +05 
Verbal behavior 9 2 +05 
Other imitative responses 6 1 +06 . 
Aggressive behavior 20 16 ns 
Partially imitative verbal re- | 12 5 +04 
sponses 


Note.—The two groups of subjects did not differ in the mean 
number of trials to criterion. The means for subjects in the nurtu- 
rant and nonnurturant conditions were 13.75 and’ 13,30, respec- 
tively. 


TABLE 4 


ANALYSIS OF VARIANCE OF SuBJECTS TRIALS SCORES 
ON THE DISCRIMINATION LEARNING TASK 


F 
Tf Vari- 
Source of Variance | df | fsi | F D 
mate 
Sex 1|294 | 4.20 | .102 p> .05 
Nurturance 1 4 «1 ns 
Sex X 1| 158 | 1.45 ns 
Nurturance 
Matched pairs | 14] 70 
Remainder 14 | 109 


Note.—One subject who refused to continue the task before he 
reached the learning criterion and three subjects who could be run 
for only 20 trials had to be excluded from this analysis. The results, 
therefore, are based on 32 matched pairs. 


tion responses (see Table 4), nor did the two 
groups or experimental subjects differ signifi- 
cantly in the number of trials in which they 
imitated the model’s choice or in the number of 
trials to the first imitative discrimination re- 
sponse. 

While nurturance did not seem to influence 
the actual choices the subjects made, it never- 
theless affected their predecision behavior. A 
number of the children displayed considerable 
conflictful vacillation, often running back and 
forth between the boxes, prior to making their 
choice. In the analysis of these data, the vacil- 
lation scores were divided by the total number 
of trials, and the significance of the differences 
was estimated by means of the sign test since 
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the distribution of scores was markedly 
skewed. The results of this test revealed that 
the subjects in the nurturant condition ex- 
hibited more conflictful behavior than subjects 
in the nonnurturant group (p = .03). This 
finding is particularly noteworthy considering 
that one has to counteract a strong nonimita- 
tion bias in getting preschool children to follow 
a leader in a two-choice discrimination problem 
as evidenced by McDavid's (1959) findings as 
well as those of the present study (i.e., 75% of 
the subjects made nonimitative choices on the 
first trial). 


Dependency and Imitation 


Correlations between the ratings of depend- 
ency behavior and the measures of imitation 
were calculated separately for the nur- 
turant and nonnurturant experimental sub- 
groups, and where the correlation coefficients 
did not differ significantly the data were com- 
bined. The expected positive relationship be- 
tween dependency and imitation was only 
partially supported. High dependent subjects 
expressed more partially imitative verbal be- 
havior (re = .60; p < .05) and exhibited more 
predecision conflict on the discrimination task 
(r = .26; p = .05) than did subjects who were 
rated low on dependency. 

Dependency and total imitation of nonag- 
gressive responses was positively related for 
boys (r, = .31) but negatively correlated for 
girls (r, = — .46). These correlations, however, 
are not statistically significant. Nor was there 
any significant relationship between depend- 
ency and imitation of aggression (r = .20) or 
discrimination responses (r = — .03). 


Discussion 


The results of this study generally substanti- 
ate the hypotheses that children display a good 
deal of social learning of an incidental imitative 
sort, and that nurturance is one condition 
facilitating such imitative learning. " 

The extent to which the model's behayior 
had come to influence and control the behavior 
of subjects is well illustrated by their march- 
ing, and by their choice of the circuitous route 
to the containers. Evidence from the pretest- 
ing and from the subjects' behavior during the 
early discrimination trials revealed that dash- 

ing toward the boxes was the dominant re- 
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sponse, and that the delay produced by march- 
ing or by taking an indirect route that more 
than doubled the distance to the boxes was 
clearly incompatible with the subjects’ eager. 
ness to get to the containers. Nevertheless, 
many subjects dutifully followed the example 
set by the model. 

Even more striking was the subjects' imita- 
tion of responses performed unwittingly by the 
model. On one trial with a control subject, for 
example, the model began to replace the doll 
on the box at the completion of the trial when 
suddenly, startled by the realization of the 
mistake, she quickly replaced the doll on the 
floor. Sure enough, on the next trial, the sub- 
ject took the circuitous route, removed the 
doll gently off the box and, after disposing of 
the sticker, raised the doll, and then quickly 
replaced it on the floor reproducing the model's 
startled reaction as well! | 

The results for the influence of nurturance on 
imitation of verbal behavior are in accord with 
Mower’s (1950) autism theory of word leam- 
ing. Moreover, the obtained significant effect of 
nurturance on the production of partially 
imitative verbal responses indicates that nur 
turance not only facilitates imitation of the 
specific behaviors displayed by a model but 
also increases the probability of responses ofa 
whole response class (for example, verbal be 
havior). These data are essentially in agree 
ment with those of Milner (1951), who found 
that mothers of children receiving high reading 
readiness scores were more verbal and afiec 
tionately demonstrative in the interactions 
with their children than were the mothers 0 
subjects in the low reading ability group. 

That the incidental cues of the model’s € 
havior may have taken on positive valence 87 
were consequently reproduced by subjects d 
the mere satisfaction of performing them, P 
suggested by the fact that children in the nut 
turant condition not only marched to the coti 
tainers but also marched in and out of the 
experimental room and marched about m the 
anteroom repeating, “March, march, mare ie 
etc., while waiting for the next trial. whit 
certain personality patterns may be, thus, i 
cidentally acquired, the stability and Pe " 
sistence of these behaviors in the absence 4 
direct rewards by external agents remains 
be studied. 


onse cannot be readily imitated unless 
mponents are within the subjects be- 
repertoire. The fact that gross motor 
ses are usually more highly developed 
erbal skills in young children may ex- 
why subjects reproduced the model’s 
ng (p = .05) and aggression (p < .001) 
gnificantly greater extent than they did 
bal behavior. Indeed several subjects 
the motor component of speech by 
ning the appropriate mouth movements 
emitted no sound. The greater saliency of 
nodel’s motor responses might also be a 
ble explanation of the obtained differ- 


htification with the aggressor” (Freud, 
or “defensive identification” (Mowrer, 
990), whereby a child presumably transforms 
from object to agent of aggression by 
ting the attributes of an aggressive, puni- 
model so as to allay anxiety, is widely 
pted as an explanation of the imitative 
of aggression. The results of the 
ent study, and those of a second experi- 
it now in progress, suggest that the mere 
tion of aggressive models, regardless of 
quality of the model-child relationship, is a 
nt condition for producing imitative 
p in children. A comparative study of 
S imitation of aggressive models who 
[ea ed, who are liked and esteemed, or who 
€ or less neutral figures would throw 
ight on whether or not a more parsi- 
S theory than the one involved in 
tion with the aggressor” can ex- 
e modeling process. 
ugh the results from the present study 
evidence that nurturance promotes 
tal imitative learning, the combination 
nce followed by its withdrawal would 
Cted, according to the secondary rein- 
P theory of imitation, to furnish 
Incentive than nurturance alone for 
to Teproduce a model's behavior. It 
ossible that dependency may be essen- 
3 lated to imitation under conditions 
E nurturance, but may emerge as a 
facilitating imitation under conditions 
Scial reinforcers are temporarily with- 


Experiment reported in this 
MÀ TE paper 
9n immediate imitation in the presence 
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of the model. A more crucial test of the trans- 
mission of behavior through the process of 
social imitation involves the generalization of 
imitative responses to new situations in which 
the model is absent. A study of this type, in- 
volving the delayed imitation of both male and 
female aggressive models, is currently under 
way. 


SUMMARY 


The present study was primarily designed 
to test the hypotheses that children would 
learn to imitate behavior exhibited by an ex- 
perimenter-model, and that a nurturant in- 
teraction between the model and the child 
would enhance the secondary reward proper- 
ties of the model and thus facilitate such 
imitative learning. 

Forty-eight preschool children performed a 
diverting two-choice discrimination problem 
with a model who displayed fairly explicit, al- 
though functionless, behaviors during the 
trials. With the experimental subjects the 
model marched, emitted specific verbal re- 
sponses, and aggressed toward dolls located on 
the discrimination boxes; with the controls 
the model walked to the boxes choosing a 
highly circuitous route and behaved in a non- 
aggressive fashion. Half the subjects in the 
experimental and control groups experienced 
a rewarding interaction with the model prior 
to the imitative learning while the remaining 
subjects experienced a cold and nonnurturant 
relationship. 

The following results were obtained: 

1. The experimental and the control sub- 
jects not only reproduced behaviors resembling 
that of their model but also, except for one 
response category, did not overlap in the types 
of imitative responses they displayed. i 

2. The predicted facilitating effect of social 
rewards on imitation was also confirmed, the 
only exception being for aggression, which was 
readily imitated by the subjects regardless of 
the quality of the model-child relationship. i 

3. Although nurturance was not found to in- 
fluence the rate of imitative discrimination 
learning, subjects in the nurturant condition 
exhibited significantly more predecision con- 
flict behavior than did subjects in the non- 
nurturant group. 
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DEFENSIVENESS AND SELF-ACCE 


PTANCE IN THE MANAGEMENT 


OF HOSTILITY* 


University 


NCE aroused, the hostile impulse may 
O: expressed in diverse ways. It may 
lead to overt expression in verbal or 
physical attack, may be expressed only in 
fantasy, or completely denied expression. 
Aggression may be displaced toward targets 
apparently unconnected with the arousal of 
- the impulse. In the search for the determinants 
of the reactions to the hostile impulse, in- 
vestigators have turned to the socialization 
Process and to personality theories. Child 
‘Tearing practices which are predicted to 
produce specific kinds of hostile responses 
have been investigated (Bornston & Coleman, 
1956; Child, 1954; Sears, Maccoby, & Levin, 
1957). Hypotheses relating ego control (Block 
& Martin, 1955; Livson & Mussen, 1957), 
self-ideal (SI) discrepancy (Rothaus & Wor- 
chel, 1960; Worchel, 1958), and anxiety 
(Doris & Sarason, 1955) to aggression have 
, been tested, In general, these investigations 
have yielded significant but low correlations 
and have been limited by the circumscribed 
| and ambiguous nature of the response meas- 
_ utes employed. 
Hostility has been used to refer to drive, 
“notion, attitude, and overt response, and 
. Stüdies often fail to specify what component 
of hostility is being measured. In addition, 
ostility arousal and reduction have usually 
' Pico immediately after the inter- 
pon of Some experimental variable; the 
; T of time itself has been neglected. 
oe observations suggest that time may 
“feelin NU ent consideration affecting hostile 
‘blo oe and attitudes differentially. People 
Pe act an thn anger est 
Bus es of dislike and rejection may 
n unchanged (Newcomb, 1947). It is 
E of the present study to subject 
E. servations to experimental verification 
extend knowledge of the role of per- 
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sonality in the management of hostile feelings 
and attitudes. Four types of personalities 
based upon self-concept theory (Rogers, 
1951) are described. Hypotheses are then 
derived relating defensiveness and self- 
acceptance to the management of hostility. 

Self-theory contends that the structure of 
the self is formed as a result of direct experi- 
ence with the environment as well as evalu- 
ational interactions with others, Since hostility 
is a major product of the inevitable frustra- 
tions of living, the attitudes and controls 
imposed by socializing agents upon the ex- 
pression of hostility play an influential role in 
the development of the self-structure. For 
some individuals, all forms of hostile behavior 
have been met with severe counteraggression, 
To maintain an accepting self-picture, these 
persons, as Rogers (1951) describes, are 
compelled to deny any hostile experience. 
Their self- and ideal-concepts remain con- 
gruent, and a façade of self-acceptance is 
presented but at the expense of the continual 
repression of hostility. Block and Thomas 
(1955) and Hatfield (1958) have shown, for 
example, that persons with low SI discrepan- 
cies tend to use repressive defenses. Also, 
Altrocchi, Parsons, and Dickoff (1960) have 
demonstrated that repressors manifest smaller 
SI discrepancies than sensitizers. 

The healthy or adjusted personality “exists 
when the concept of the self is such that all the 
sensory and visceral experiences of the or- 
ganism are, or may be, assimilated on a 
symbolic level into a consistent relationship 
with the concept of self" (Rogers, 1951, p. 513). 
Hostile feelings are accepted and the resultant 
behavior would “probably be at times social 
and at other times aggressive" (p. 502). 
Here, also, the SI picture is one of congruence 
but without the operation of defensive pat- 
terns. 

There are persons, however, whose early 
development has led to rejecting self-pictures. 
Their attempts to attain internalized ideals 
have met with parental criticism and punish- 
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ment. Their self- and ideal-concepts are 
highly incongruent, and they present a picture 
of anxiety, tension, and depreciation. Author- 
ity figures are perceived as threatening and 
they are unable to retaliate for fear of punish- 
ment. Hostility can be displaced to "scape- 
goats” who are unable to retaliate. As a result 
of stimulus-generalization, aggression-anxiety 
is experienced (Miller, 1948), leading to 
further depreciation and tension. These 
subjects may be described as nondefensive 
with high SI discrepancy. Worchel (1958), 
for example, has shown that subjects with 
high SI discrepancy express significantly 
fewer direct aggressive responses towards an 
authority figure than those with low SI 
discrepancy. In a later study, Rothaus and 
Worchel (1960), using a questionnaire describ- 
ing arbitrary and nonarbitrary frustrations, 
found that such subjects gave more hostile 
actions toward a hypothetical frustrating 
agent than those with low SI discrepancy. In 
this case, however, there was no danger of 
retaliation. Worchel has just completed a 
study in which, following the experimental 
induction of frustration by a professor, sub- 
jects with high SI discrepancy expressed 
stronger hostility towards “innocent” targets 
than those with low discrepancy. 

Persons with high SI discrepancy can 
defend themselves against anxiety and tension 
by rationalizing and distorting their failure to 
attain the idealized goals. They can set up a 
facade of "doing as well as others." They 
perceive their behavior as above reproach 
as far as the social group is concerned. Aggres- 
sion displaced towards innocent scapegoats 
does not arouse aggression-anxiety in this 
case since justification is found for such 
attack. They distort the situation thus: "it is 
not my father (or other authority figure) who 
makes me angry, but it is who is 
irritating." 

To summarize, four types of personalities 
are derived from Self-Concept theory: low 
defensive, low SI discrepancy (adjustive); 
high defensive, low SI discrepancy (repressive); 
low defensive, high SI discrepancy (anxious); 
and high defensive, high SI discrepancy 
(distorters). 

If self-acceptance and defensiveness interact 
to influence the management of hostility as 
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described above, then the following predictions 
are implied: 

1. Subjects with low defensive, low SI 
discrepancy (adjustive) will express the 
strongest feelings of anger and hostile atti- 
tudes, while high defensive, low SI discrep- 
ancy subjects (repressive) will express the 
least anger and hostile attitudes of all the 
groups. 

2. Subjects with low defensive, high SI 
discrepancy (anxious) will indicate the most 
aggression-anxiety, while low defensive, low 
SI discrepancy subjects (adjustive) will 
indicate the least aggression anxiety of all 
the groups. 

3. Subjects with high SI discrepancy will 
displace more hostility than subjects with low 
SI discrepancy. | 

4. Feelings of anger will tend to decrease 
during a delay period following hostility 
arousal, while hostile attitudes will tend to | 
persist undiminished. 

METHOD 
Subjects 


Eighty undergraduate males were drawn from an 
introductory psychology course on the basis of the 
Worchel (1958) Self-Activity Inventory (SAI) and the 
K scale items from the MMPI (Meehl & Hathaway, 
1956). SAT served as a measure of SI discrepancy, an 
the K scale as an index of defensiveness. The subjects 
were dichotomized with respect to each of these tests, 
using cutting points of 45.5 on the SAI and the 145 
on the K scale. Four groups of 20 subjects were thus 
obtained: low SI discrepancy and low defensiveness 
(LSLLK), high SI discrepancy and low defensiveness 
(HSI-LK), low SI discrepancy and high defensiveness 
(LSI-HK), and high SI discrepancy and high defensive: 
ness (HSI-HK). 

Ten subjects in each of these four groups were 
assigned at random to one of two experimental condi d 
tions. In the nondelay (ND) condition the hostility 
measures were administered immediately after hostility 
arousal. In the delay (D) condition hostility measure 
ment was delayed 20 minutes. Three replications 0 
design were completed. The subjects were distribut B 
as evenly among the sessions as their schedules Pe 
mitted; no session included less than 10 or more t 7 
16 subjects. Subjects in each session were drawn i 
different psychology sections to eliminate 500 
structuring present in the original classes. 


Procedure 


The initial phase of the experiment was the ee 
for all sessions, The examiner told the subjects aits 
name, that he was a professor, that the test res 
would go on their permanent records, and tha 


scores would be used for vocational guidance, scholar- 
- ship recommendations, and admission to graduate 
school, 

The University of Texas Intelligence Test was then 
- administered. Each of its 10 subtests was given with 

time limits of 30-90 seconds, The examiner used a 
stop watch to time the tests and made frequent dis- 
` tracting remarks to the subjects about their slowness, 
the simplicity of the questions, and the penalty for 
guessing wrongly. They were told that most people 
- should finish within the allotted time. The time limits 
were, in fact, too short for any subject to complete the 
subtests. Thus failure, distraction, and insult con- 
“stituted the frustrating situation, Spontaneous 
comments of the subject during and after the experi- 
ment indicated considerable frustration and annoyance 
as a result of this treatment. 

The examiner left the room “to score the tests” 
- as soon as the "intelligence test" was completed. At no 
time was the examiner aware of the experimental 
"treatment (delay) for any particular group he tested. 
The examiner’s assistant distributed the next set of 
answer sheets as quickly as possible to prevent any 
verbal interaction among subjects. 

In the second phase of each session, the ND groups 
took the posttreatment tests immediately. The D 
-Broups completed a 20-minute word association task 
_ before taking the final battery of tests. The interpolated 
task consisted of writing free association responses to 
the Kent-Rosanoff list of 100 words presented orally 
by the assistant. Since this list contains words chosen 
for their lack of emotional content, it was assumed that 
i this task would be relatively neutral in its cathartic, 

E arousing, or hostility provoking aspects. At the 

the of each session the subjects were told the nature of 

d Ese and were asked not to discuss it until 
d ions had been completed. 


Dependent Variables 


The last set of instruments given i i 
1 given in all sessions 
formed a booklet of four tests? from which six dependent 
ariables were derived. 
RR This instrument served as an index of 
s R subject was willing or able to characterize 
A Emotional state at the time of testing. A list of 
E Jectives, each followed by a five-point rating 
j im presented with the following instructions: 
tse eee testing produces various feelings in 
Bu: Rs tested. This questionnaire does not have 
a oe or Wrong answers; you are asked only to 
Place your own feelings as accurately as possible. 
i a check mark after each adjective so as to 


describe how you feel at the present time. 


* The fi i 
deposited with ei. used in the present study have been 
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The scores for the adjectives angry, irked, and 
annoyed were summed for the Direct Anger score. 

Projected Anger. Since the self-report measure of 
anger might be influenced by inhibitory mechanisms, 
an identical version of the adjective list was included, 
but with the following instructions: 

"The ability to get along with people is related to 

the ability to understand how most people react to a 

situation, even if one's own reactions are somewhat 

different. Here you are asked to fill out the test the 
way you think most people would feel now after an 
intelligence test similar to the one you just took. 

A comparable Projected Anger score was derived 
from the responses to the same three adjectives used for 
Direct Anger. Internal consistency estimates of the 
reliability of the score for Direct Anger and Projected 
Anger, based on the average item-intercorrelations, 
were .78 and .89, respectively. 

Hostile Attitudes. A test of Insight and Social 
Sensitivity included 10 items asking for an appraisal of 
the examiner. Each statement was followed by a six- 
point rating scale, and the subject was asked to indicate 
the extent of his agreement or disagreement with each 
statement. Two components of hostile attitudes were 
assessed separately: rational and irrational aggression. 
Rational aggression (Examiner Blame) was defined as 
“blaming the examiner" for the subject’s poor per- 
formance since, in reality, he did distract, interrupt, 
and upset the subject. Irrational aggression (Examiner 
Devaluation) consisted of statements condemning the 
examiner on characteristics which had little or nothing 
to do with the frustrating situation. Examiner Blame 
score was the sum of the agreement scores for five 
statements which blamed the examiner for errors made 
by the subject on the “intelligence test" (e.g, “The 
examiner was to blame for some of the errors I made"). 
The Examiner Devaluation score was similarly derived 
from five statements concerning various aspects of the 
examiner's personality (e.g., “The examiner appears to 
be a dependable person”). Internal consistency esti- 
mates of the reliability of these scores were .80 and 
.87, respectively. i 

Displaced Hostility and Aggression-Anxiely. Ten 
stems from a sentence completion test developed by 
Zimmer (1959) were selected to measure these aspects 
of the hostile reaction. The responses were scored 
accordingly to Zimmer’s manual, A more detailed 
description of the method and instruments used is 
available elsewhere (Veldman, 1960). 


RESULTS AND DISCUSSION 


Since the six measures were obtained from 
the same subject, interactions may occur to 
confound the results, It is also possible that, 
regardless of the apparent face validity of the 
measures the tests are all really tapping the 
same variable. Intercorrelations between vari- 
ables, however, are all less than .37 with the 
exception of that between Direct and Pro- 
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TABLE 1 


MEANS or Aut SUBGROUPS ON MEASURES OF Hostire FEELINGS AND ATTITUDES FOR THE DELAY 
(D) anp NoxpELAY (ND) CONDITIONS 


LSI-LK HSLLK LSLHK HSI-HK 
Subgroup 

ND D ND D ND D ND D 
Direct Anger 9.4 7:9 153. 7.5 6.8 5.3 8.0 713 
Projected Anger 9.0 7.7 7.9 1.0) 8.6 6.4 9.6 6.6 
Examiner Blame 25.8 19.6 21.9 21.5 19.8 20.3 22.9 19.1 
Examiner Devaluation 156 14.2 12.5 14.9 13.9 12.4 13.3 14.7 
Displaced Hostility 27 2.9 3.8 2.9 2.4 D: 3.2 3.3 
Aggression-Anxiety 2.4 2.4 3.7 2.8 2.6 3.9 3.4 3.1 

TABLE 2 


jected Anger, which as would be expected, is 
somewhat higher (.59).3 

The means of each of the eight experimental 
groups are presented in Table 1. Analyses of 
variance (SI X K X D) were carried out for 
each of the six dependent variables and the 
results are presented in Tables 2 and 3. 

Hostile Feelings. As predicted, the inter- 
action of K X SI for Direct Anger (see Table 
2) is significant at the .05 level (F = 4.93). 
Whereas the mean scores (regardless of delay) 
for the two high SI groups (low and high 
defensiveness) are practically identical (7.4 
and 7.7, respectively), the means of the low 
SI groups are radically different. The greatest 
expression of anger is produced by the adjusted 
(8.7) and the least anger is expressed by the 
repressive subjects (6.4). Defensiveness, as 
measured by the K scale, is an important 
determinant of the direct expression of anger, 
particularly with subjects of low SI discrep- 
ancy. Taken alone, defensiveness is almost 
significant (F = 3.35) at the .05 level (F of 
3.98 required for .05 level). 

Delay does not seem to be a significant 
variable in the direct expression of anger, In 
the projective expression of anger, however, 
delay is significant (see Table 2). Subjects in 
the D groups projected less anger than those 
in the ND groups. 

Defensive subjects would be expected to 
reveal more anger in the projective than in the 


* A 1-page table giving the intercorrelations of all 
six variables has been deposited with the American 
Documentation Institute. Order Document No. 6865 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress; Washington 
25, D. C., remitting in advance $1.25 for microfilm or 
$1.25 for photocopies. Make checks payable to: Chief, 
Photoduplication Service, Library of Congress. 


ANALYSIS OF VARIANCE OF HOSTILE FEELINGS 
(Anger) AND Atrirupes (Examiner Blame) 
FOR ALL GROUPS 


Direct Projected | Examiner 
Source of A Anger Anger Blame 
variation aoe 
us| F|MS| F | MS| P 
Delay (D) 1 |15.31 | 1.86 |52.81 | 6.63* 122.51] 3.16 
Defensive (K) 1 27.61 | 3.35 | 2.11 | .27 | 56.11] 1.72 
Self-ideal (SI) 1 | .61| .07| .11] OL) 0 — 
DXK 1 | 1.01} .12 |19.01 | 2.39 | 13.61) 4 
D XSI 1 | 7.81] .95| .31| 04| 281. 49 
KXSI 1 |40.61 | 4.93*| 5.51 | -69 | 19.01 S 
DXKXSI 1 |no| .12 | 5.51 | .69 127.51) 390 
Within 72 | 8.25 7.91 32.57 


* Significant at the .05 level. 


direct test. That this is partly true is shown by 
comparing the differences between the tio 
techniques for the high and low defensive 
groups regardless of SI discrepancy. The mean 
difference of 1.7 for the high defensive groups 
under the ND condition is significant at the 
.02 level (¢ = 2.58), In the projective tech 
nique, these subjects expressed greatet anget 
than in the direct version. For the low defen 
sive subjects, under both D and ND condi 
tions, there is no significant difference between 
the two versions of the test. 

Hostile Attitudes. It was predicted that th 
adjusted group would express strongest hos 
tility while the repressive group would express 
least hostility toward the examine a 
rational aggression (Examiner Blame), ; 
means in Table 1 were in the predicted diret 
tion (25.8 and 19.8, respectively) in the 


condition. The F ratios, however, show E. j 
these results are influenced by p cat 


main effect of delay is almost significan 
.05 level and the triple interaction, D 
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TABLE 3 
ysis OF VARIANCE OF DISPLACED HOSTILITY 
AND AGGRESSION ANXIETY FOR 


Att GROUPS 
Displaced ion- 
r | uno 


ofvariation | df 
MS F MS F 


‘Delay (D) 1| .01| .OL| .Oi| .Ol 
ive (K) 1| :31| 22 | 3.61 | 3.37 
(ST) 1| 6.61 | 4.74*| 3.61 | 3.37 
1|2.11 | 1.51 | 4:51 | 4.21* 
1|2.81 | 2.01 | 7.81 | 7.30** 
1| .01| .01 | 3.61 | 3.37 
X SI 1| :601 | :44| :61| .57 

72 | 1.40 1.07 


gnificant at the .05 level. 
mificant at the .01 level. 


SI, barely reaches this level of significance. 
E ner Blame is greater immediately after 
frustrating test than 20 minutes later. 
justed group which has the highest 
er Blame score in the ND condition 
shows the greatest drop in hostility 
t delay (6.2). The lowest Examiner Blame 
score was given by the repressive group 
M ), and with delay, the mean Examiner 
me score actually increased slightly (20.3). 
en where the frustrating agent is clearly 
ble for interfering with performance, 
essive subjects show far less rational 
lon than the adjusted subjects. With 
it is interesting to note the decreased 


Fe Examiner Devaluation (see Table 3), 
—.-1 represents the irrational component of 
F On, there are no significant F ratios 
5 T E the main variables or interactions. 
NE E. the adjusted subjects are highest 
‘host aggression, they express no greater 
Mty on “unreasonable” items. 
Splaced Hostility. As predicted, there is a 
nt F ratio (7.74) for the SI variable 
able 3). The high discrepancy subjects 
S B ently more hostility towards 
ie E than the low SI subjects. None 
05 Je d F ratios reached significance at 
a Vel. On the basis of these results, it 
TM expected that high SI subjects would 
à © prejudiced than low SI subjects. 
ahi AN 1S not only a function of the 
] of direct aggression but is also a 
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function of personality. For example, the 
repressors in the present experiment show no 
more displacement than the adjusted group. 

Aggression-Anxiely. Though the F ratio 
(3.37) for the interaction of K X SI (see 
Table 3) is not significant at the .05 level 
(3.98), a one-tailed test of significance is 
justified since it was predicted that the low 
defensive, high SI group (anxious) would 
show most and the low defensive, low SI 
group (adjusted) would show the least ag- 
gression-anxiety. The hypothesis is confirmed 
if delay is disregarded. The adjusted group 
expressed significantly lower aggression- 
anxiety (2.4) than any of the other three 
groups. The anxious group expressed the 
greatest aggression-anxiety in the ND con- 
dition (3.7) but in the D condition, the mean 
was 2.8. That delay is a significant factor in 
the expression of aggression-anxiety is shown 
by the ratio of 3.37 and the highly significant 
F ratio for the interaction of D X SI (7.30). 
It is interesting to note, on the other hand, 
that the repressors were the only group that 
increased in aggression-anxiety after delay 
(1.3). It may be that in time these subjects 
became somewhat aware of their hostile 
impulses. 

Delay. 'The hypothesis concerning the effect. 
of delay on feelings of anger did not come out 
as predicted with the direct technique, but 
with the projective instructions significant 
differences were obtained. The defensive 
subjects (HK) inhibit the direct expression of 
hostile feelings which tend to hide any de- 
crease over a period of time. The projective 
test gives a more accurate estimate of the 
feelings of anger since inhibitory mechanisms 
are diminished. 

As far as hostile attitudes are concerned, 
delay did seem to produce a decrease in 
Examiner Blame contrary to the prediction 
but, as the triple interaction shows, the 
decrease seems to occur primarily for the well- 
adjusted subjects (6.2) and for the high 
defensive, high SI subjects (3.8). Hostile 
attitudes do tend to persist for certain types 
of personalities, namely, the repressors and 
anxioüs subjects. 

SUMMARY 

On the basis of Self-Concept theory, four 

types of personalities were described based on 
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the nature of the defenses employed in main- 
taining the self-structure. Hypotheses con- 
cerning the management of hostility were 
derived from the interaction of SI discrepancy 
and defensiveness: Subjects with low defensive, 
low SI discrepancy (adjustive) should express 
the strongest feelings of anger and hostile 
attitudes, while high defensive, low SI subjects 
(repressive) should show the least feelings of 
anger and hostile attitudes; high SI subjects 
should displace more hostility to other ob- 
jects than low SI subjects; the low defensive, 
low SI group (adjustive) should show the 
least aggressions anxiety while the low defen- 
sive, high SI group, (anxious) should ex- 
press the most aggression-anxiety. 

Tt was also predicted that delay following 
the arousal of hostility would produce a 
decrease in feelings of anger but that negative 
attitudes would tend to persist. 

Eighty undergraduate males from an 
introductory course in psychology were 
selected on the basis of high and low scores 
onan SI inventory and the K scale (defensive- 
ness) of the MMPI. These four groups of 20 
subjects were randomly divided into an ND 
and a D group (2 X 2 X 2 design). In the 
ND condition, hostility measures of direct 
anger, projective anger, examiner blame, 
examiner devaluation, displaced hostility, 
and aggression-anxiety were administered 
immediately after a frustrating intelligence 
test, In the D condition, the same measures 
followed 20 minutes of an interpolated neutral 
task. The results showed that: 

1. On hostile feelings, the interaction of 
SI discrepancy and defensiveness was sig- 
nificant at the .05 level. The highest feelings 
of anger were expressed by the adjusted 
subjects and the least expressed by the re- 
pressors. 

2. On the expression of rational aggression 
(Examiner Blame) towards the examiner, the 
triple interaction of D X SI X K was sig- 
nificant at about the .05 level. The adjusted 
group had the highest score immediately after 
frustration but then dropped more than any 
group after 20 minutes delay. The repressors 
showed the smallest amount of rational 
aggression in the ND condition and actually 
increased slightly after delay. 

3. On irrational aggression (Examiner De- 
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valuation), there were no significant main or 
interaction effects. 

4. As predicted, the high SI group dis- 
placed more hostility than the low SI group. 

5. The anxious group showed most, while 
the adjusted group showed the least, aggres 
sion-anxiety in the ND condition. The inter- 
action of D X SI was significant, indicating 
that delay is an important factor in the ex- 
pression of aggression-anxiety particularly 
with low SI subjects. 

6. With delay, there was a significant 
decrease in feelings of anger on the projective 
test but no significant change occurred on 
the direct technique. With delay, there was a 
decrease in rational aggression primarily for 
the low defensive, low SI (adjusted) and the 
high defensive, high SI (distorters) subjects, 
For the other two groups, repressors and 
anxious subjects, examiner blame persisted 
with little change. 
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RESISTANCE TO PERSUASION CONFERRED BY ACTIVE AND 
PASSIVE PRIOR REFUTATION OF THE SAME AND 
ALTERNATIVE COUNTERARGUMENTS' 


WILLIAM J. McGUIRE 
Department of Social Psychology, Columbia University 


HE present study is one of a series 

comparing the relative effectiveness of 

various  pretreatments in making 
initially unquestioned beliefs resistant to 
change when subsequently the person is 
forced to expose himself to strong counter- 
arguments against the belief. The theoretical 
point of departure of the present and the 
earlier studies is the postulate of "selective 
exposure." Initially strong appearing beliefs, 
it is assumed, are actually quite vulnerable 
under forced exposure to strong counter- 
arguments, because such beliefs tend previ- 
ously to have been overprotected. By selective 
avoidance of counterarguments in the past, 
the person has kept his beliefs extreme, but 
has also left himself unpracticed in their 
defense and unable to deal with strong counter- 
arguments when exposure to them is forced. 

The postulate of “selective exposure" is no 
novelty in communication research. Since 
Klapper (1949) called it “the most basic 
process thus far established by research on 
the effects of mass media," it has stimulated 
much research, designed either to account for 
it in a theoretical system (Festinger, 1957) or 
to use it predictively (Janis, 1957). Its general 
relevance to the present problem is to suggest 
that beliefs can be “inoculated” against 
persuasion in subsequent situations involving 
forced exposure to strong counterarguments 
by pre-exposing the person to the counter- 
arguments in a weakened form that 
stimulates—without overcoming—his defenses 
(Lumsdaine & Janis, 1953). 

The hypotheses tested in the study reported 
here were suggested by interpretations of the 
outcomes of two previous studies. One of them 
(Papageorgis & McGuire, 1961) demonstrated 
that prior exposure to refuted counterargu- 
ments tends to make a belief more resistant 
to subsequently presented strong forms, not 

1 This study was supported, in part, by a grant from 
the Office of Social Science, National Science Founda- 


tion. A partial version of this paper was presented at 
the 1960 APA meetings in Chicago (McGuire, 1960). 
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only of the very counterarguments refuted, 
but also of novel counterarguments against 
the belief. Such generalized immunization 
could derive from either of two mechanisms, 
Pre-exposure might shock the person into 
realizing that the “truisms” he has always 
accepted are indeed vulnerable, thus provoking 
him to develop a defense of his belief, with 
the result that he is more resistant to the 
strong counterarguments when they come. 
Alternatively the refutations involved in the 
pre-exposure might make all subsequently 
presented counterarguments against the belief 
appear less impressive. The other relevant 
study (McGuire & Papageorgis, 1961) showed 
that when the subsequent attack is by the 
same counterarguments as were previously 
refuted, more resistance is conferred when the 
subject passively attends to the refutation 
than when he himself actively participates 1n 
it. That study corroborated the prediction 
from “selective exposure” that the person is 50 
little able to defend these previously un 
questioned truisms that the prior defensive 
session is not used effectively when the subject 
is called on to help refute the counterargt 
ments, Hence, the particular counterarguments 
presented are more thoroughly refuted an 
subsequently better resisted in the passive 
defense situation than in the active. 

The three main hypotheses tested in the 
present study all deal with interaction effects 
between type of prior refutation defense 4? 
whether the subsequent attack involve 
strong forms of the same counterarguments 
as were refuted or of novel counterarguments 

The first hypothesis would limit the previous 
finding (McGuire & Papageorgis, 19017 
that passive exposure to the refutations P 
more effective than active in produc 
resistance to persuasion—to situations n 
which the subsequently presented p. 
counterarguments are the very ones previous 
refuted. When, on the contrary, the subseque" 
exposure is to novel counterarguments agni 
the belief, the present hypothesis holds th?! 


p 


fense is relatively more effective 
even be absolutely more effective) 
lucing resistance. Of the two hypoth- 
mechanisms for the immunizing 
acy of the refutation defense—shocking 
‘subject into developing support for his 
by acquainting him with its vulner- 
and making subsequent counter- 
nents seem less impressive by refuting 
e earlier ones—the first is better provided 
active defense (wherein his inadequacy 
him feel more vulnerable) and the 


guments get more thoroughly re- 
ince the first mechanism is likely to 
important in providing resistance to 
forms of novel counterarguments and 
e second mechanism, to strong forms of the 
ne counterarguments, the interaction pre- 


defenses: lengthening the prior ex- 
te Session to allow both active and passive 
mse should produce greater resistance 
increment over the single prior defense when 
e subsequent strong exposure is to the same 
mntérarguments as had been refuted. Less 
lee in resistance between the single 
ouble defense is predicted when the 
went strong exposure is to novel counter- 
nts, The basis of this hypothesis is 
to that of the first one. The resistance 
cifically refuted counterarguments 
with the thoroughness of the prior 
j the resistance to novel counter- 
nents depends more on the shock value 
e Dre-exposure, which is more likely to 
lan increase with the thoroughness of 
1efütation of the counterarguments. 
third hypothesis deals with an order 
within the double defense condition: 
AVe active sequence of prior refutation 
be more effective than active-passive 
arbsequent. attack involves strong 


em the specifically refuted counter- 


d 
x 


3 while the active-passive order 
the more effective against novel 
mo Similar reasoning again 
cg r hypothesis. The active defense 

© be more effectively carried out if 
E by the passive since in this case 
~~ Goes not have to depend on his 
*r repertory of defenses when he is 
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called upon to refute actively the counter- 
arguments, but has at his disposal the ref- 
utations to which he has just passively 
attended. Hence, the counterargument to 
which he is pre-exposed should be more 
effectively refuted with the passive-active 
order, and he should show more resistance to 
these specific counterarguments. 

The active-passive order, on the other 
hand, is predicted to be a more effective 
preparation for subsequent exposure to strong 
forms of novel counterarguments. When the 
active defense comes first, the subject has 
already had the dismaying experience of 
being confronted with novel counterarguments 
which he is hard pressed to refute, but he 
has then seen in the passive defense how 
effectively these formidable-seeming counter- 
arguments can be refuted by an expert. After 
this reassuring experience, the subject is 
likely to be less dismayed by the further 
novel counterarguments used in the attack. 
He should tend to recall that the earlier 
counterarguments also appeared formidable 
when he was first called upon to refute them 
actively, and yet were shown to be quite 
refutable in the subsequent passive defense. 


METHOD 
General Procedure 


All subject’s took part in two one-hour experimental 
sessions. The first was devoted to the active and passive 
belief defenses; the second, to presenting the strong 
counterarguments against the belief. The persuasive 
aspects of the experiment were disguised by repre- 
senting the study as an attempt to obtain norms for 
analytic thinking ability in high level personnel, a 
representation given credence by some of the tasks the 
subjects were called upon to perform. 

First session. All the defenses to which this session 
was devoted consisted of a mention of two counter- 
arguments against one of the four experimental beliefs, 
together with a refutation of these counterarguments. 
The refutations took one of two forms: either the 
subject was asked to read a mimeographed message 
(about 600 words long) that mentioned two counter- 
arguments and refuted them in detail (passive defense); 
or the subject was given a sheet listing the two counter- 
arguments and told (without any specific guidance) to 
show how each of these counterarguments could be 
refuted (active defense). The subject received passive 
only, or active only, or active-then-passive or passive- 
then-active defenses on one or another of the four 
experimental issues. In the passive defense condition, _ 
the subject was given 5 minutes to read. the message 
and to select and underline the key clause in each 
paragraph (this task being introduced to bolster the 
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“analytic ability” pretext for the experiment and to 
insure attention to the material). In the active defense 
condition the subject was given 10 minutes to refute 
the two counterarguments and was told that what he 
wrote need not reflect his own opinion and could be 
based on any relevant material he found in earlier 
parts of the test. 

Second session. Two days later, the subject took 
part in the second session which was devoted to 
messages containing strong counterarguments against 
his belief. These messages were similar in appearance 
to the refutational messages used in the passive defenses 
conditions, each being about 600 words long and 
mentioning two counterarguments against the belief. 
However, while the defensive messages had been 
devoted mainly to refuting the counterarguments 
mentioned, these second session messages were devoted 
primarily to showing how the counterarguments were 
valid. Here again the subject was given 5 minutes to 
read and underline the crucial phrases of the message 
on each issue. In half the cases, the two counter- 
arguments in these strong-attack messages were the 
same two as the subject had previously seen refuted; 
in the other half of the cases, the two counterarguments 
were novel. 

After completing this “analytic reading and under- 
lining" test, the subject was given the opinion question- 
naire and asked to indicate his personal beliefs on the 
issues regardless of what materials may have been 
presented to him in the test. This questionnaire was 
justified to the subjects as needed to determine if the 
testee's attitude towards the content of the passages 
he had read affected the verbal skills in which we were 
supposedly interested, 

Afterwards the subject filled out a questionnaire 
designed to test the effectiveness of the manipulations 
and experimental conditions and was given a detailed 
explanation of the nature of the experiments and the 
deceits used. Particular effort was made to assure the 
subject that all material presented in the messages was 
designed solely to persuade him in one direction or the 
other and that no special credence should be given to 
any of the arguments simply because it appeared in 
these messages, 

Materials 

Issues, Four health issues were selected on the basis 
of earlier studies which indicated that beliefs on items 
were homogeneously extreme in the college population. 
Three of the issues were used in a previous study 
(McGuire & Papageorgis, 1961): “Everyone should get 
a chest X ray each year to detect any possible TB 
symptoms at an early stage"; “The effects of penicillin 
have been, almost without exception, of great benefit 
to mankind”; “Everyone should brush his teeth after 
every meal if at all possible.” A fourth, used for the 
first time in the present study, was: “Everyone should 
see his doctor at least once a year for a routine medical 
check-up.” 

Opinion questionnaires. Opinions on these issues were 
measured by having the subject indicate on a 15-point 
graphic scale the extent of his agreement with state- 
ments bearing on one of the issues. There were 17 


. attacking the beliefs, were similar in length and format 
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statements in all, 4 on each issue, the seventeenth | 
being a repetition of an earlier item (to serve as a 
reliability check). The opinion scores to be reported are 
the mean number of points the subject obtained on the 
four items dealing with the issue in question, 

Defensive messages? Each of the 600-word passages 
used in the passive defense consisted of three para- 
graphs. The first stated the belief and mentioned two 
counterarguments against it, together with the remark 
that each was refutable. The second and third para- 
graphs were devoted to a detailed factual refutation of 
each counterargument. There were two such messages 
on each issue (hence, eight in all) in order to counter- 
balance material and implement the design of having 
half the subjects receive, in the second session, strong 
forms of the previously refuted counterarguments, and 
the other half, novel counterarguments. These messages 
were given appropriate titles (“Some Misguided 
Attacks on Penicillin,” etc.) but not specific source 
attribution, They were mimeographed, single spaced, 
on a letter-sized sheet. 

In the active defense condition the refutationd 
material consisted of a sheet of paper with the same 
title and a brief statement of two counterarguments 
against the belief, together with instructions to write 
refutations of these counterarguments (not necessarily 
expressing one's own view and using any material one 
wished from previous parts of the test). Here agam 
there were two different forms for each issue, 
presenting an alternative pair of counterarguments to 
be refuted. 

In the double defense conditions, the subject always 
received the same pair of counterarguments for refuta- 
tion in the active as in the passive defense. 

Strong counterargument messages. These messages 


to the essays in the passive defense condition but 
differed radically in content. The first paragraph 
suggested that informed opinion has recently begun to 
question the “truism” and mentioned two countet- 
arguments against it. The next two paragraphs attacked j 
the belief by describing detailed factual evidence 
bolstering the counterarguments. Each message 
suitably titled (“Some Drawbacks Involved in the Ust 
of Penicillin,” etc.). There were two messages attacking 
the belief on each issue, the two developing alternate 
pairs of counterarguments. 


Design and Subjects 


A total of 168 subjects took part in both sessions, 5 
order to obtain data for all the comparisons involve A | 
the hypotheses and also comparable data 0n br 
needed controls, a complex design was employed. Th 


: : t 
were four types of defense (active, passive, activ 


( 
All 24 of the defensive and attacking mest) 
used in this study have been deposited with the E d 
can Documentation Institute. Order Document m 
6866 from ADI Auxiliary Publications Pro 
Photoduplication Service, Library of Congress; "- 
ington 25, D. C., remitting in advance $2.00 for ta i 
film or $3.75 for photocopies. Make checks pay? di 
Chief, Photoduplication Service, Library of Const 


passive, and passive-active), which could then be 
followed by no attack, attack with the same counter- 
arguments, or attack with novel counterarguments, 
making for 12 conditions in all. In addition, there were 
two control conditions: an attack only condition, with- 
" out prior defense, to ascertain the effect of the strong 
 counterarguments when the beliefs had not been 
—immunized; and a condition involving neither attack 
- mor defense, to ascertain the "initial" belief levels. 
- Each subject served in 4 of these 14 conditions, a 
— different issue being used in each of his 4 conditions. 
The combinations of conditions given to any one 
- subject were limited only by the requirements that all 
"subjects be given tasks taking the same amount of 
time (one hour) in each session. The number of subjects 
serving in each condition is shown in Table f; 

Since the design called for four issues and two sets 
of material on each issue, counterbalancing the material 
required eight subgroups in the 12 attack-and-defense 
conditions, and in the thirteenth (attack only) condi- 
tion; while in the fourteenth condition (neither-attack- 
nor-defense) four different subgroups were required, 
one on each issue. 

The significance levels reported in the Results 
Section are based on F tests in which the “error” 
variance is the residual variance in the conditions being 
Compared with the treatment and (where appropriate) 
individual differences variance removed. 

, The subjects were 168 students enrolled in the 
eae psychology courses at the University of 

‘ois. Almost all were either freshmen or sophomores 
and about half were males and half, females. 


Se TM C 


RESULTS 
General Effects 


_ As can be seen in the top row of means in 

1, the four defensive conditions did not 

3 i appreciably among themselves in regard 

ae direct strengthening effects on the 
. Eoo to exposure to counterarguments, 
5 he largest between-means differences, 
P p aia the active-passive and 
only at the 2 E tance is significant 
Emo the mean belief level after 
si Hr nse only treatments does not differ 
Attack asa those in the neither-defense-nor- 
91299 ontrol conditions. The overall mean 
m am fee four defense only conditions is 
mean E d superior (F « 1) to the 12.78 
Es ets * control condition. This lack of 
Eis Pens effect from the prior 
the hidd Particularly interesting in view of 
subseque E Teserves of resistance against 
as will be ee rong counterarguments which 
the belief, Iscussed below) were conferred on 
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TABLE 1 
BELIEF STRENGTH (on a 15-Point Scale) IN THE VAR- 
IOUS CONDITIONS BEFORE AND AFTER ATTACKS BY 
THE SAME OR DIFFERENT COUNTERARGUMENTS 
FROM THOSE To Wmicg THE SuByect Hap 
BEEN PRE-EXPOSED DURING THE DEFENSE 


(Number in parentheses is the number of 


cases for the cell) 

‘Type of defense condi- 
tion prior to attack Nei- | At- 
ther | tac 
Time when beliefs FREUE ume 
ara AM tive | sive | nor | prior 
tive Tive then | then | de- | dez 
pas- | ac- |fense | fense 

sive | tive 

After defense prior to | 12.94| 12.75| 12.57| 13.37 
attack (48) | (48) | (24) | 04) 

After defense, plus 10.66] 11,47] 12.15] 12.18] 12.78] 8.60 
attack by same (48) | (48) | (48) | (48) | (96) | (49) 
counterarguments 

After defense, plus 11.42| 10.62| 10.92| 10.71 
attack by novel (48) | (48) | (48) | (48) 
counterarguments 


Eee 


The beliefs were considerably weakened 
(to a mean of 8.60) by the strong counter- 
arguments when these had not been preceded 
by any defense. This drop of 4.18 points from 
the control level is significant at the .001 
level. The impact of these strong counter- 
arguments was considerably lessened when 
they were preceded by a defensive treatment, 
The overall mean belief in the four defensive 
conditions after the defenses and strong 
attacks was 11.27. This does represent a 1.51 
point drop from the control level (p < .001), 
but it leaves the beliefs well above the 8.60 
point level to which they were reduced when 
the strong attack had not been preceded by a 


defense. 


Comparative Immunizing Effectiveness of the 
Defenses 

Single defense conditions. The first hypothesis 
dealt with an interaction effect in the single 
defense condition: that while passive pre- 
exposure to refuted counterarguments tends 
to be superior to active in conferring immunity 
to subsequent strong forms of the same 
counterarguments, the active tends to be 
superior (or at least less inferior) to the 
passive in conferring resistance against novel 
counterarguments. As can be seen by com- 


paring the four cells in the lower-left quadrant 
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of Table 1, this prediction was substantially 
confirmed. The interaction effect is in the 
hypothesized ^ direction—passive ^ defense 
superior against the same counterarguments, 
and active, against novel—and significant on 
the .02 level. 

The nature of this interaction effect can be 
seen more analytically if we examine the 
comparative effectiveness of active and 
passive defense against the same and novel 
counterarguments, separately. With respect 
to immunization against strong forms of the 
same counterarguments, we find the final 
belief levels in the passive and active con- 
ditions to be 11.47 and 10.66, respectively. 
The difference between these subgroups is 
significant only at the .08 level, but cor- 
roborates the more significant result of the 
McGuire and Papageorgis (1961) study. 
Against novel counterarguments, we find the 
final belief levels in the passive and active 
defense condition to be 10.62 and 11.42, 
respectively. Since this difference is significant 
only at the .13 level, it remains to be shown 
definitively that the active defense is actually 
superior to the passive with respect to con- 
ferred resistance against novel counter- 
arguments, The significant interaction does, 
however, demonstrate that any superiority of 
passive over active defense is less pronounced 
against novel than against the same counter- 
arguments, x 

In this study, as in that by Papageorgis and 
McGuire (1961), we find that the pre-exposure 
(either actively or passively) to refuted 
counterarguments produces considerable 
generalized immunity. In neither active nor 
passive condition is there a significant differ- 
ence between the resistance to same and novel 
counterarguments, and in the two conditions 
combined the mean beliefs after the same and 
novel strong counterarguments (11.06 and 
11.02, respectively) is only trivially different, 
This finding is encouraging in regard to the 
possibility of making given beliefs immune to 
persuasion even when we cannot foretell which 
counterarguments will eventually be offered 
against them. 

Single vs. double defenses. The second 
hypothesis dealt with an interaction between 
single vs. double defense and subsequent 
exposure to strong forms of the same vs. 
novel counterarguments. An inspection of the 
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four means shows that both marginals, x 
well as the predicted interaction, attained 
appreciable levels of significance. The same 
vs. novel counterarguments marginal wa 
significant at the .01 level: over the four 
defensive conditions, subsequent exposure to 
strong forms of the same counterarguments 
reduced the beliefs only to 11.61 points while 
novel counterarguments reduced them to 
10.92 points. 

The single vs. double-defense marginal 
shows the least differential (significant only] 
at the .10 level): after the single defense the 
strong counterarguments reduced the beliefs 
to 11.04 points and after the double defense, 
to 11.49 points, 

The interaction was in the predicted 
direction: the single defense was almost & 
effective against strong forms of novel counter- 
arguments as against strong forms of tht] 
same counterarguments (the postattack means 
being 11.06 and 11.02, respectively), while the 
double defense was much more effective against 
the same than against novel counterarguments 
(the postattack means being 12.17 and 10.82, | 
respectively). This interaction is significant 
at the .01 level. To state this interaction effet 
in another way, the slight marginal superiority 
of the double over the single defense just 
mentioned (p = .10) was due entirely to the 
subconditions in which the subjects Wet 
subsequently exposed to the strong forms 
the same counterarguments as had been 
refuted in the defensive session. In this 
“same” condition the superiority of double 
over single defense was significant at the «0t 
level. The implication is clear that it is wor 
belaboring the counterarguments to the extent 
of a double (active and passive) refutation ! 
and only if the subsequent attack is going i 
involve the very counterarguments refutes 

That the single defense is as good as- 
fact, trivially (p = .60) better thanh 
double in conferring resistance. to nove 
counterarguments fits in well with the undet 
lying postulate that the efficacy of the pi 
refutational defense in producing resistant 
to novel counterarguments derives mà 3 
from provocative impact of pre-exposing , 
belief to counterarguments, which bes 
home to the subject that the truism is ps 
attackable and stimulates him to bolster 
belief. Elaborate double refutation of 
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counterarguments adds little since the attack 
jnvolves novel counterarguments (and may 
subtract from the defense stimulating impact 
by refuting too thoroughly the counter- 
arguments to which the subject is pre-exposed 
during the defense). Thus this interaction 
effect between the single vs. double defense 
and the same vs. novel counterargument 
variables lends itself to the same theoretical 
interpretation as did the significant interaction 
between the passive vs. active single defense 
and the same vs. novel counterargument 
variables. 
The double defense conditions. The third 
hypothesis dealt with an interaction effect 
predicted in the double defense condition: that 
the active-then-passive sequence of defenses 
would prove more resistance producing 
against strong novel counterarguments, while 
the passive-then-active sequence of defenses 
would be more effective against strong counter- 
arguments similar to those refuted. As can be 
seen by inspecting the four relevant means in 
Table 1 (lower center portion), the interaction 
effect between the order of defenses and the 
Same vs. novel counterarguments variables 
is in the predicted direction but trivial in 
Magnitude. Neither is there any differential 
effect in the sequence margin per se: the 
strong counterarguments reduced the beliefs 
to 11.54 points after the active-then-passive 
ae and to 11.45 points after the passive- 
E Sequence of defenses. (The F of 
" erence is less than 1.) The only sig- 
E difference in the double defense 
after "ma a hg same vs. novel marginal: 
et s 3 oat the beliefs were 
counterar, y to 12.17 points by strong 
ae pia that were the same as had 
endum , but to 10.82 points by novel 
dlücant ape This difference is sig- 
E on the .001 level. As mentioned in the 
ier i se this sizable same-vs.-novel 
trasts with à E Ted iR conditions con- 
defense hich ack of such effect in the single 
anaE contrast tends to confirm the 
postulate of the study. 
strates S der preceding discussion demon- 
fered n eight defensive conditions 
E. siderably among themselves in 
noted S EAM effectiveness, it Should be 
Significant at all conferred on the belief a 
Tesistance to the subsequent strong 
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attack. Even the least effectively immunizing 
defense (single passive defense, with subse- 
quent attack by novel counterarguments) 
left the beliefs still appreciably stronger after 
the attack (at a mean of 10.62 points) than 
they were left in the attack only condition, 
where the resulting mean was only 8.60, the 
difference being significant at the .001 level. 

All the beliefs being defended and attacked 
in this study were of a highly homogeneous 
type: all were cultural truisms and all touched 
upon a highly involving and anxiety arousing 
topic, physical well-being. Perhaps this 
deliberate homogeneity is responsible for the 
obtained generality of effects across issues, in 
the sense that none of the interaction effects 
between issues and type of immunization 
reached conventional levels of significance. 
(However, issues as a main effect did con- 
tribute significantly (p « .05) to the variance; 
that is, the overall belief levels after both 
combined defenses and attacks did vary 
significantly among the four issues.) General- 
ization of these results to other types of issues 
would be unwarranted without further re- 
search. 

There are, in fact, theoretical reasons for 
hesitating to generalize the present findings 
to other types of beliefs, The “selective 
exposure" postulate used to derive the present 
prediction would yield quite different pre- 
dictions with other types of beliefs, Had the 
beliefs been controversial rather than truisms, 
the subjects would have been more practiced 
in defending them and, hence, would have 
participated more effectively in the active- 
defense condition and would have been less in 
need of a threatening, defense stimulating 
pre-exposure. Had the issues been less in- 
volving than these health ones, there would 
have been more to gain, in regard to moti- 
vating the subject to pay adequate attention 
to the material, from requiring his active 
participation in the defense. Hence, with 
different types of issue we would expect 
resulting differences not only in the size of 
the obtained effects, but even in their direc- 
tions. 


SUMMARY 


The postulate that cultural truisms 
to be overprotected as a result of the ten 
toward “selective exposure,” together 
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the results of earlier studies, led to three 
predictions regarding the amount of resistance 
to subsequent strong counterarguments that 
would be conferred by pre-exposure to refuted 
counterarguments under conditions of active, 
passive, and combined refutation. The hypoth- 
eses dealt with interaction effects between 
amount of activity during the pre-exposure 
and whether the strong counterarguments 
used in the subsequent attack were the same 
as or different from those to which the subject 
had been pre-exposed. 

Each of the 168 college student subjects 
served in four conditions, each on a different 
issue. The complex design furnished data on 
the relative effectiveness of the four defensive 
conditions (active, passive, passive-active, 
and active-passive refutations of counter- 
arguments) in regard to direct strengthening 
effect, and immunization against the same, 
and against novel counterarguments presented 
in strong form 2 days later. 

Besides corroborating various results of 
previous studies, the present results confirmed 
two of the three interaction hypotheses. 

1, In the single defense condition, more 
immunity was conferred against the same 
counterarguments by passive than by active 
defense, while against novel counterarguments 
the active defense was the more effective 
(p = .02). 

2.In the double defense condition the 
predicted interaction effect involving the 
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sequence of the active and passive defenses 
was not found. 

3. As predicted, the immunizing superiority 
of the double over the single defense was 
found only when the subsequent attack 
involved the same counterarguments; against 
novel counterarguments, the single defense 
was as effective as the double (interaction 
significant at the .01 level). 
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larger research project concerned with 

the relationships between childhood 
maladjustments, personality problems, and 
associated background factors—described at 
the time of their actual occurrence rather than 
retrospectively—and adjustment in young 
adulthood. Information contained in child 
guidance clinic case histories is being com- 
pared with personnel and psychiatric informa- 
tion in the Selective Service and military 
service records of the same persons at a later 
period. For the total group of male clinic 

Cases from two child guidance clinics in 

Minnesota, information has been obtained 

from the Selective Service System, the national 

Army, Navy, and Air Force Record Centers, 

and the Veterans Administration. Both 

clinics began during the 1920s; the sample 

under consideration here includes men who 

aes in service during World War II and 
er, 

An introductory report (Roff, 1956) in- 
cluded a preliminary predictive study, indi- 
cating that experienced clinical child psy- 
chologists could make significantly accurate 
Predictions of service adjustment, good or 
dd on the basis of the case histories. With 
E ostadon that meaningful predictions 
enr made on a global „basis, it seemed 
E aa identify the particular dimensions 
would h g to prediction. | Relevant findings 
waits ave both theoretical and practical 
cud aS could provide useful leads for 
quia procedures outside the case history 
TIN Specified dimensions could also be 
patt ed in potentially more effective 

ms for predictive purposes. 

Don subgroups exhibiting various 
A poor adjustment at the adult level 
This investigati i 

U gation was supported (in part) b 

ine ins M2218, from e pone 

A. -49.007-MD-2015 Health; by Contract Number 
and Develo -20 Š vith the Army Medical Research 
18 (600) s4 oont Command; and by Contract AF 
Medicine. with the Air Force School of Aviation 


T: present paper describes one part of a 
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include those diagnosed as psychoneurotic, 
psychotic, character and personality disorders 
with and without bad conduct, “psycho- 
somatic” disorders, sexual deviations, etc. 
One would expect that some case history 
information may be specifically predictive of 
only a single type of outcome but that some 
variables from the childhood period may be 
more broadly effective in predicting outcomes 
of more than one kind. The first of these out- 
come groups to be studied intensively was a 
sample of persons diagnosed as psychoneurotic 
while in service. Reports of predictions for this 
group in contrast to a control group judged to 
have made satisfactory service adjustments 
have been presented elsewhere (Roff, 1957, 
1960). 

The present paper is concerned with the 
application of the method developed for the 
prediction of adult psychoneurosis from child- 
hood histories to a group who exhibited severe 
bad conduct while in service. 


METHOD 


The subjects were 164 former child guidance clinic 
cases who had entered and been discharged from one 
of the military services. Half of these subjects had a 
record of severe bad conduct while in service, serious 
enough to lead to other than an honorable discharge or 
to a number of days AWOL or days of confinement 
(“bad time") totaling at least 60. Although most of 
this behavior occurred in a military setting, a few 
subjects committed offenses in the civilian community 
severe enough to result in their discharge from service. 

‘The first major class of offense was repeated periods 
of AWOL, with or without other violations. A second 
major type of offense was theft or robbery under a 
wide variety of circumstances, either in a service 
situation or while on leave or AWOL. A third category, 
frequently including repeated offenses or in combination 
with other types of offense, included disobedience, 
refusal to obey an order, and violations of this nature, 
including assault on a superior. Other groups of offenses 
included repeated drunkenness, forgery, escape from 
detention, and breaches of regulations governing the 

ion or use of firearms. In general, this group of 
men committed violations for which they could have 
no reasonable hope of escaping detection and punish- 
ment. If a man went AWOL for a month, he could 
expect that this would be detected and that he would 
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be caught and returned (unless he had been placed in 
civilian confinement in the meantime). 

Individuals who had had frequent enough or serious 
enough offenses in the civilian community to lead to 
their rejection for military service were not included in 
the present sample, so some of the most serious offenders 
were excluded, thus restricting somewhat the range of 
the group studied. Although the policies regarding 
rejection varied somewhat from one time to another, a 
substantial number of serious offenders in all periods 
were excluded from service altogether. 

Many bad conduct subjects were diagnosed by 
service psychiatrists as having a personality disorder, 
but not one appropriate for a medical discharge. 
During World War II, offenders who were sent to 
disciplinary barracks were carefully studied psy- 
chiatrically, partly with a view to determining whether 
or not they should be returned to duty. More recently, 
it has been the practice for an offender to be referred 
to a psychiatrist to assist in determining whether he 
should receive a medical discharge on psychiatric 
grounds or an administrative discharge. Since the 
reports of these interviews give a somewhat more 
definite picture of the individuals involved than does a 
simple statement of offenses, abstracts from some of 
these are presented for illustrative purposes. 

‘This soldier has been in the Army for approxi- 
mately eight months, and has been under psy- 
chiatric evaluation and treatment practically ever 
since he came into the Army. He is subject to periods 
of animosity and has developed psychosomatic 
symptomatology referred to various parts of the 
body, which has handicapped him in his duties. He 
has been convicted by court-martial five times during 
the past six months. 

He is a paranoid personality, chronic, severe, 
manifested by an habitual inappropriate attitude of 
being imposed upon or persecuted, nonpsychotic in 
degree. He has had six periods of AWOL ranging 
from ten to fifty-nine days, 

He is erratic, undependable, and argumentative, 
with temper tantrums. He has had repeated dis- 
ciplinary measures taken against him and has been 
constantly ostracized by the other men because of 
his laziness, belligerence, and unwilingness to 
cooperate. He is an anxious, restless, and egocentric 
individual who has always had phobias and con- 
flicts with others, with a long record of maladjust- 
ment in the Navy. 

"These subjects have been studied without specific 
reference to combat performance because most of them 
never reached combat. Beebe and Appel (1958) ex- 
amined the relationships between various items of 
information obtained from precombat service records 
and psychiatric breakdown during combat, finding 
that men with disciplinary records broke down about 
twice as often as individuals without such history. The 
bad conduct of their groups was milder than that of 
the present subjects. 

Control subjects consisted of individuals who had 
reached and kept a grade of sergeant (or equivalent) 
or higher without any indications of disciplinary or 
mental health trouble at any point following entrance 
into service. They were matched in childhood IQ with 
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the bad conduct cases because intelligence level was 
sometimes directly related to service career possibilities, 
Control cases were drawn by taking the nearest appro- 
priate case, in terms of clinic case number, which had 
not previously been used. While this method provided 
comparable information for both control and experi- 
mental subjects, it meant that the controls differed 
from a random sample of the general population in 
having been dealt with by a clinic during childhood. 
Some subjects had received treatment ranging from 
minimal to substantial in amount, while others had 
only been studied. 


Procedures for Analyzing Clinic Case Histories 


Information in the case histories usually covered the 
following kinds of items: behavior difficulties; mother; 
father; grandmother, aunts, etc. (of particular impor- 
tance if they functioned in place of the parents); 
siblings; home and family situations; health, including 
nervous mannerisms and speech; social adjustment 
outside the family; psychological test data; and psy- 
chiatric evaluation. 

Readers with no knowledge of the service records 
of the subjects abstracted all potentially significant 
information for certain of the categories listed above. 
These were then evaluated by other readers who had 
never seen the complete case history. For example, all 
the information about the mother was abstracted and 
evaluated by itself. A similar procedure was followed 
for siblings and sibling relations. Two practices were 
found desirable in order to lessen a possible loss 0 
information in going from the full case history to these 
abstracts. ; 

1. Different opinions of the child were sometimes 
expressed by informants who had had an opportunity 
to observe him in different situations. In preparing 
abstracts, it was found important to identify all 
abstracted material by the character of the informant 
who furnished it (teacher, case worker, mother, psy” 
chiatrist, etc.). 

2. There was sometimes a difference between earlier 
and later descriptions of a youngster as a result o 
either treatment or merely the passage of time. It e 
considered desirable not to lose this chronologica 
information in the course of the abstracting, 80 the 
dates were carefully recorded for all items in the 
abstract. m. 

While this is a highly multivariate situation, it 
seemed desirable to attempt to develop different sing | 
variables on an analytic basis before attempting a 
put them back together. It would be expected thal 
some simplification of this multivariate situation woul 
result if one or two leading dimensions could be founi a 
Various lines of work, including psychiatric consider" 
tions of disturbances of interpersonal relationships ! 
various behavior disorders, the success of nominal 
techniques and buddy ratings in predicting E 
performance (Trites & Sells, 1957; Williams & Ted 
1947), and the current use of group procedures b i 
aid to diagnosis in some child guidance clinics sug 
that the boy's social adjustment in relation to of a 
youngsters his own age might be an important area 
investigate, 


I 
" 


i 


4 
. Jt was thus decided to see what could be accom- 
- plished with the general variable of peer group social 
Telationships and related items of information. Ab- 
stracts were prepared, including all appropriate material 
“in this category contained in the history, with explicit 
1 information about informant and date for each item of 
‘information. One class of information, excluded as 
‘unhelpful if not actually misleading, covered all state- 
‘ments by the mother indicating good peer group 
‘adjustment outside the home situation. Two factors 
" contributed to making this information unsatisfactory. 

"The mother may have had no accurate basis for com- 
paring her child with other children on this score, and 

“Sometimes a mother, tiring of contacts with the clinic, 
Teported that everything was fine when information 

_ obtained from other sources at the same time indicated 
that this was not so. These observations are in line with 
lie finding of Harris (1959) that for a group of children 
without serious problems, school personnel and the 

trist agreed more closely in the evaluation of 
children than did the mother with either. 

T For the peer group adjustment dimension, experience 
led to the construction of a priority list for the weighting 
of informants (other dimensions would not necessarily 
_ have the same list of priorities). This was related to the 
— Opportunities persons in different categories had for 

actual observation of the youngster in interactions with 
associates. Instructions to the readers for the use of 
his priority list were to examine first the data from 
4 informants with the highest priority and to keep 
od down the list as long as, and only as long as, 
ES k pay to reach a decision. The priority list is 
Li. Persons, primarily teachers, who had an 
n extended opportunity to observe him in a peer group 

K, Situation when quoted directly. 

E he 2. Visiting teachers or case workers when sum- 
C Marizing information from persons in Category 1 
B „Specific quotations; clear and definite 
_ formal diagnostic statements by psychiatrists which 
Ki relate to social adjustment. 

D OM except for favorable state- 
B. Mons about social adjustment by 
i nt in interview, and statements by psychiatrist 

& D as noted above; comments about personality 
LA Psychologists based on impressions obtained 
C ing the mental test situation. 

l eet the observations of the chronology of 
ys for s and the Priority list for informants, a 
m Bun ‘information as positive, neutral, 
“developed E appraising peer group adjustment was 
i, 1957) s has been presented in detail elsewhere 

E d. „Among the positive items were such 

3 freedom E signs of liking by the general peer group, 
“acces fu Sau in class or on the playground, 

: itications teat th older children (nonsiblings), and 
lescence. N, girls liked the subject, especially at 

‘of friends em Duy items are exemplified by a shortage 
Playing bh ut specific indications of being disliked, 

- ated as 5 Younger children, etc. Items to be eval- 
by Er included all signs of active dislike 
1 Peer group, inability to keep friends, and 
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being regarded as “odd,” “peculiar,” or “queer” 
other children. (os Mbse quif, RARE SM 
Working with the abstracts according to the in- 
formant priority list and the list of information to be 
evaluated as positive, neutral, or negative, two graduate 
students in psychology at approximately the MA 
level made “blind” evaluations as to good or poor peer 
group adjustment during the period covered by the 
case history. Each student evaluated about half the 
abstracts. Judgments of “good” or “poor” were made 
only if the reader felt reasonably certain of a judgment. 
If the information seemed incomplete, a response of 
“undecided” was made. This required some judgment 
on the part of the reader, but the area in which judg- 
ment was required was narrowed markedly in com- 
parison with global predictions based on the entire 
case history. This procedure is intermediate between 
completely global predictions based on the entire case 
and more atomistic predictions based on a further 
fragmentation of the case history material. 


RESULTS AND DISCUSSION 


This procedure for making peer group 
evaluations was originally developed with 
subjects diagnosed as psychoneurotic during 
service. Applied unchanged to a new sample of 
subjects who showed severe bad conduct while 
in service and control subjects different from 
those with which the procedure was developed, 
it yielded the results shown in Table 1. Those 
subjects whose earlier peer group adjustment 
was evaluated as poor showed significantly 
more bad conduct in service than did subjects 
whose earlier peer group adjustment was 
appraised as good. 

For purposes of comparison, results ob- 
tained earlier with the psychoneurotic sub- 
jects are shown in Table 2 (Roff, 1957, 1960). 
Tt can be seen that a similar discrimination was 
obtained for the psychoneurotic group. 

The problem of the early detection of 
individuals who are likely to exhibit later 
maladjustment of various kinds is generally 
recognized as an important one. It is commonly 
assumed in dealing with both physical and 
psychological difficulties that early treatment 
would have possibilities of benefit that might 
be lost if the difficulty were not detected until 
it had reached a later stage. It is difficult to 
find detailed results which support this 
assumption for the problems under considera- 
tion in this paper. It remains, however, 
important working hypothesis. 

The combined results reported here indicat 
that the level of earlier social adjustmen! 
contributes significantly to the discriminatio 
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TABLE 1 
EVALUATIONS OF PRESERVICE STATUS IN 
RELATION TO OUTCOME IN SERVICE: 
Bap Conpucr Group AND CONTROLS 
Outcome Good AREE Poor 
Good 47 14 21 
Bad conduct 20 16 46 
X = 10.17; $ < 01. 
TABLE 2 
EVALUATIONS OF PRESERVICE STATUS IN 
RELATION TO OUTCOME IN SERVICE: 
PSYCHONEUROTICS AND CONTROLS 
Outcome Good Ap Peet Poor 
Good 54 32 18 
Psychoneurotic 13 29 62 


x? = 24.71; > < .001, 


between groups showing adjustment difficul- 
ties while in service and “good” groups. This 
is in line with an impressive amount of evidence 
that in a situation where there is sufficient 
mutual exposure to permit thorough acquaint- 
ance, appraisal by peers is very effective in 
predicting subsequent military adjustive re- 
actions of various kinds (Rigby, Sayers, 
Ossorio, & Wilkins, 1957; Trites & Sells, 
1957; Wherry & Fryer, 1949; Wilkins, 1954; 
Williams & Leavitt, 1947). The school situ- 
ation is also an appropriate place for effective 
peer group evaluations, and it has been found 
that teachers can report accurately on the 
peer group reactions of children, at least in 
cases of marked behavior disturbances, 

This can be made more concrete by giving 
illustrations of teachers’ comments for the bad 
conduct subjects. The most frequent descrip- 
tions relating to the peer group situation 
indicate in one set of words or another that 
the boy is “mean” and is disliked by his 
associates, Sometimes this is accompanied by 
attempts to dominate other children. In other 
cases it is described simply as behavior which 
is antagonizing to others: “He always wants 
to be the leader, and if the other children do 
not do as he says, he is apt to be abusive and 
ugly to them”; “His conduct is irreproachable 
in the schoolroom, but he can be very mean 
when out of sight of authority. The boys in 
his room do not like him. During the past year, 
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hardly a week went by without someone com- 
plaining about him”; “The teacher had known 
of his hurting children on various occasions, 
The children disliked him and grew into the 
habit of blaming him for everything that went 
wrong”; “He had violent temper tantrums 
when crossed. He did not get along with any 
of his classmates and was always in fights 
with some of them”; and “He feels picked on, 
wants to dominate, and is often the aggressor 
in fist fights, and the other boys are afraid of 
him.” This pattern of overt aggression toward 
other children was more pronounced than it 
was in the case of the psychoneurotic group. 

In a definitely smaller number of cases, the 
boy was described as a good-natured non- 
conformist; “Very much of a problem, just 
smiled and wanted to do what he pleased”; 
“There was nothing mean about him, and he 
was good-natured and generous. He is well- 
liked by the other children, but all alike 
consider him as undependable, untruthful, 
and irresponsible.” 

Groups who know one another as well as do 
members of the same grade school class are 
fairly well aware of disturbed behavior in one 
of their members, although they may have no 
clear-cut diagnostic term for it. The peer group 
reactions employed in the present paper 
collected and recorded by clinic personnel, 
represent a promising if neglected type of data. 
Further systematizing of such information ; 
should lead to an improvement in its predictive 
value. It seems very likely that a systematic 
study of antagonizing youngsters, extending 
over most of the grade school period, would 
identify, in at least the worst 1 or 2% of the 
cases, a group which would be of major interest 
from a long-time mental health point of view. 
Obtaining appraisals for at least 2 or 3 year 
would allow both for such real shifts of be 
havior as do occur and for unreliability in the 
observations of individual teachers. It seems 
possible to use this approach to develop ? 
method that may prove more practica y 
effective than any now available for the early 
location of individuals with mental heal 
problems. 

In attempting a solution of the early detec: 
tion problem, it is not essential that a precis? 
prediction of the specific adult difficulty be 
made. The finding of the present paper pr 
two major adult adjustment areas, psycho" 


md bad conduct, can be predicted 
e procedure gives a point of attack 
work on the early identification of 


and bad conduct groups are not 
and preliminary work indicates that 
be differentiated to some extent on 
3 of early peer group relations, the 
tion between these two groups is 
tical importance than the finding 
is enough generality in the method 
it the early prediction of later dis- 
in both behavioral areas. It is also 
that this same procedure will be 
2 in locating youngsters who later 
psychotic or who as adults exhibit 
"personality disturbances other than 
cussed here. 


SUMMARY 


up study was made of male child 
Clinic cases from two Minnesota 
terms of their military service records. 
lictive procedure based primarily on 
oup adjustment as described in the case 
had been found earlier to discriminate 
persons diagnosed as psychoneurotic 
n service and controls who were former 
cases who had made relatively good 
nents in service. This procedure was 
to the prediction of severe bad conduct 
Service by comparing the peer group 
Írom the clinic case histories of 82 
Who exhibited severe bad conduct 
ervice with those of controls matched 
tistory IQ. The procedure based on 
peer group reactions discriminated 
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between the bad conduct and control groups 
about as well as it had done with the psycho- 
neurotics. Both these adult problem groups 
tended as youngsters to have antagonized 
their associates to an unusual degree. Some 
implications were discussed. 
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SOME ASPECTS OF SELF-CONCEPTIONS AND ROLE DEMANDS IN 
A THERAPEUTIC COMMUNITY 
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Austen Riggs Center, Stockbridge, Massachusetts 


mIs study is an attempt to measure 
| objectively the self-conceptions of pa- 
tients and personnel, and the role de- 
mands upon them, in a psychiatric hospital 
where daily observation of hospital life over 
the past 10 years has indicated that patients 
have come to view themselves more as responsi- 
ble adults than did patients in previous years. 
The term role demand is used as suggested by 
Levinson (1959): “The role-demands are 
external to the individual whose role is being 
examined, They are the situational pressures 
that confront him as the occupant of a given 
structural position” (p. 173). From comparison 
of self-conception with role demand, an esti- 
mate of the suitability of a person to fill one 
or another position may be inferred, 

The Austen Riggs Center, an open private 
psychiatric hospital specializing in the treat- 
ment of borderline cases where all patients are 
in intensive psychoanalytic psychotherapy has, 
over the past 10 years, evolved into a com- 
munity in which the patients and staff share 
administrative responsibility and authority, 
Patients, individually and as a group, have 
actively collaborated with the staff to set and 
enforce standards of behavior as well as to 
create social inventions to enable the hospital 
society to run smoothly. 

Each patient is a citizen in the hospital 
community, having the privileges and responsi- 
bilities that accrue to being in the position of a 
citizen. He votes for representatives to various 
agencies, votes on policies and procedures 
regarding community life, is expected to 
exercise his rights as a citizen, and to abide by 
the agreed-upon customs, rules, and laws. 

Patients are the principal administrators of 
a work program in which all patients are 
required to participate for a minimum of one 
hour daily. The jobs are those which would 
otherwise have to be done by hospital per- 
sonnel, such as some aspects of housekeeping, 
maintenance of grounds and buildings, con- 
struction and repair of equipment and furni- 
ture, and secretarial work. The money saved 
as a result of the work accomplished is con- 


tributed to a Patient Aid Fund. Since all the 
work is carried out during a common time, 
during which each patient is aware that every 
other patient is working at some assigned job, 
the program is called the Common Work 
Program. Failure to meet a work assignment 
is considered a community problem, and 
patients who absent themselves from their 
jobs are confronted by designated agencies of 
patients and staff. 

An Activities Program of instruction in arts, 
crafts, drama, and academic courses taught by 
a staff of professional artists and teachers is 
available. Participation in activities is elective, 
and it is the design of the department to have 
people approach the activity areas as students 
rather than as patients being treated by 
traditional occupational therapy. The teachers 
tend to view and respond to their students as 
they react to others they instruct who happen 
not to be hospitalized psychiatric patients; 
the tendency of the instructors is to be more 
teachers than adjunctive therapists. The 
position of student in the Activities Program 
is, thus, like that of student in an adult educa- 
tion program. 

All patients live in a building that resembles 
2 country inn. Each cares for his own room 
and determines for himself how he will spend 
his leisure time. There is no segregation by 
age, sex, or diagnosis, and any behavior that 
is socially appropriate, such as parties (with 

inking in moderation), tournaments, dating, 
etc., is an accepted part of life in the hospital 
Many expectations and responsibilities similar 
to those which stem from role demands of life 
outside the hospital impinge on patients 
when they fill the various positions available 
within the hospital. If a patient creates 4 
problem for the community, formally €s- 
tablished agencies consisting of both patient 
and staff members confront him, seeking ? 
social solution to the problem while leaving 
the psychodynamics of his behavior for €* 
ploration in his psychotherapy. EN 

It is probably evident from this descripto 
that the considered policy of the members ? 
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hospital society, both patients and per- 
nel, has been to create a community where 
patients can have an image of themselves as 
worthwhile, responsible, and productive people 
and where the personnel can view them in a 
like manner. Since there is no master plan to 
reach the intended goal, each new aspect of the 
hospital organization must emerge from joint 
atient-staff agreement on ways of main- 
taining the more adult and mature functioning 
"patients while they are hospitalized. Pa- 
tients today seem to view themselves as more 
responsible adults than did patients in previous 
years, although there has been no change in 
admission policy. 
the self-conception of the hospitalized 
| psychiatric patient is a function of the role 
. demands upon him as well as of his ego or- 
ganization. A psychiatric hospital ideally 
should make available to the patient a range 
“of positions within which he can find not only 
some that are congruent with his ego organiza- 
tion and self-conception at the outset of 
treatment, but also others that are fitting for 
him as he changes and help him prepare for 
return to life outside. When, reintegrated, the 
patient can cope with role demands beyond 
the ones that exist in the hospital, he may 
seek social positions that are fitting with his 
new level of ego organization. 
nm ro social positions were selected 
m uo and comparison in regard to 
the role demands associated with them, and in 
regard to the self-conception of patients and 
p mnel, We were particularly interested in 
. finding ding a measure of the way patients and 
hospital 
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personnel differentiate the role de- 
bow of these positions, and in determining 
ir vs M demands are or are not congruent 
|! o eir self-conceptions. 
m Obtain a measure of role demand, patients 
3 Staff were asked to rate various social 
Positions on a number of adjectival scales in 
eee cosctibed by Osgood (1952) as the 
[s ic differential technique. In addition, 
unus Obtained for the word “me” could be 
Compared with those obtained for social posi- 
to determine with which category of 


Fe MM details of the development of the hospital 

re and of the research program of which this 
| White Bibs see Kubie (1960), Polansky, Miller, 
37) 55), and Polansky, White, and Miller 
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positions the subject’s self-conception was most 
similar. 

The semantic differential technique makes 
it difficult for a subject to base his ratings on 
preconceived assumptions concerning the 
intended meaning of any set of ratings, and, 
thus, deliberately to control them. If for 
example a subject is to rate the concept 
“adolescent,” “mental patient,” or “worker 
in the Common Work Program” on a seven- 
point scale from hard to soft, or slow to fast, 
or beautiful to ugly, it is difficult for him to 
know what the “correct” rating is supposed 
to be. Especially as many concepts are rated 
along many scales while the subject is con- 
strained from comparing his various ratings, 
it seems reasonably certain that results cannot 
be deliberately manipulated. 


METHOD 


The following social positions were selected for study, 
because they have received most attention from both 
patients and staff at this hospital: 

1. One category of positions can be considered 
unique for this hospital (though, of course, analogous 
ones may exist in other hospitals). These Hospital- 
Created social positions include Worker in the Common 
Work Program, Citizen in the Riggs Community, and 
Student in the Activities Program. 

2. Another group of positions comprises some of 
those available to adults outside of a hospital, though 
they have counterparts in the hospital community 
(we refer to these as usual adult positions). They 
include Worker, Citizen, and Student. 

3. A third class (Mental Patient positions) consists 
of those that refer more directly to the psychiatric 
disorder—Patient at Riggs and Mental Patient. 

4, A fourth category—Adult, Adolescent, Child, 
and Baby—was included to allow comparisons from 
which inferences could be drawn with regard to the age 
levels that are associated with the various positions. 


Subjects 

Thirty-eight adult patients and the 33 members of 
the professional staff of the hospital (10 members of 
the clinical and research staff, 9 residents in psychiatry 
and clinical psychology, 6 activities instructors, 8 
nurses) participated in the study. The patients (20 
women and 18 men) ranged in age from 18 to 51 years 
with a median age of 25 years. Seventeen were diagnosed 
as borderline psychotic or psychotic and 21 were 
diagnosed as neurotic or neurotic character disorder. 
The median length of stay for all patients was 8.75 
months with a range from 3 days to 5 years. 


Measures of Role Demand and Self-Conception 


Rating scales based on the semantic differential 
technique were administered to all subjects. Ratings 
were made along seven-point scales of 15 adjectival 
opposites for each of the 12 social positions and for 
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the word “me.” The adjectival scales in the order pre- 
sented were: cruel-kind, masculine-feminine, sick-well, 
active-passive, necessary-unnecessary, calm-excitable, 
therapeutic-untherapeutic, weak-strong, irresponsible- 
responsible, slow-fast, beautiful-ugly, real-unreal, good- 
bad, hard-soft, important-unimportant. The semantic 
differential was administered to patients in a group 
meeting called for the purpose, and to personnel at 
a regular staff meeting. 


Instructions 


All subjects were given the following instructions 
which are essentially identical to those used by Jenkins, 
Russel, and Suci (1958). 

The purpose of this experiment is to discover the 
meaning of certain words by getting your rating of 
the words on a set of descriptive scales. Attached 
are 13 sheets each with a set of scales and each with 
a different word printed at the top to be rated on 
each of the scales. I wish you to rate the words on 
the basis of what they mean to you. Place a check 
mark on each of the scales wherever you feel the 
word should be rated. Work as fast as you can; 
don’t take too long to make any rating; and rate 
according to your first impression of the words. 
Don’t hesitate to use the extreme ends of the scales, 
wherever these seem appropriate. 

Here are some examples of the way you should do 
this task: If you were rating the word EXPRESS 
TRAIN and came to the scale *slow-fast," you would 
probably consider an express train quite fast, and 
so you would place a check mark on the “fast” end 
of the "slow-fast" scale—perhaps here: 


EXPRESS TRAIN 


"Then you would go on to the next scale. Be sure 
that your mark is between the dots. 

If next you were rating the word STREETCAR, and 
came again to the "slow-fast" scale, you might feel 
that a streetcar was only fairly fast and would 
check the scale here: 


STREETCAR 
: ns 


slow : : : : : : 
—fast 

Then you go on to rate STREETCAR on the rest of 
the scales. If, however, you were rating the word 
OXCART on the “slow-fast” scale, you would probably 
consider it quite slow, and rate it here: 


OXCART 
slow—— : M : : : 3 E 
—fast 
or here: 
SLOW ME uii $$ MI 
—fast 


Of course, you would make only one mark, 

Most of the ratings you are to make will not be 
so literal as these examples. For instance, rating the 
word OXCART, you might come to the scale “hot- 
cold.” 


OXCART 

hot—— 3 —— > —— : —:—: 
—cold 

There is no obvious “correct” answer here—s0 
rate it as you see it; does OXCART seem to you to be 
hot or cold or in between? Don't expect the ratings 
to be literal. We want your impression of the words. 
In some cases you wonder how a certain scale can 


slow— : — : — : — : — : —: apply to the word you are rating, but we have foun 
AAE that you will be able to make the cease me 
or perhaps in the next space: San y y sped instructions, rating quickly on 
EXPRESS TRAIN Misi 
E A gS, Mte RD ON AB The mean of the ratings of all subjects on 
——fast each adjectival scale for each social position 
TABLE 1 


D Score Matrix 
(All patients and personnel combined) 


Stu- |Woker| — 
iene, | othe | Cie? | patient | gantal 
Clusters Citizen | Worker| Adult | Activ- | mon | Riggs | at | Patient 

ities | Work | Com- | Riggs 

Pro- | Pro- | munity 

gram | gram 
Student -95 | 2.45 | 1.43 | 1.85 | 1.84 | 2.57 | 4.45 | 5-28 
oe 2.76 | 1.08 | 1:79 | 1:60 | 2.34 | 4.33 | 5:79 
IAE 1.76 | 3.62 | 3.41 | 4.60 | 5.68 Kk) 

to. Ten 2.32 | 2.14 | 2.94 | 4.90 | 6: 

Student in the Activities Program 74 | 1.35 | 3.16 4,54 
Worker in the Common Work Program y 1.22 | 3.21 | 4.64 
Citizen in the Riggs Community “1 9:19 | 3.18 
Patient at Riggs 2.08 
Mental Patient 
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Type I Worker — Adult — Citizen = Student? 

Type II Student in the Worker in the 
Activities Program “ Common Work Program 
Type III Patient at Riggs — Mental Patient 


Citizen in the 


©] Riggs Community 


Fic. 1. Clusters obtained for social positions available to adults. 


Staff: 


Typel mez Student <— Citizen — Adult «— Worker 


Type I _ Citizen in the Student in the 
Riggs Community 
Type III Patient at Riggs = Mental Patient 


Neurotic Patients: 
Typel Worker — Adult — Citizen 
Type II m Student in the 


Type III Citizen in the 
Riggs Community 


= Student 
Worker in the 


Psychotic Patients: 
TypeI Worker — Adult 
Typell Student in the 


= Citizen — Student 
Worker in the 


Activities Program " 


Worker in the 
Common Work Program 


Activities Program * Common Work Program 
= Patient at Riggs — Mental Patient 


Activities Program “ Common Work Program 


Type III „„„—. Citizen in the 


mé= Riggs Community 


« Patient at Riggs — Mental Patient 


Fic, 2. Clusters obtained for staff, neurotic patients, and psychotic patients. 


was calculated. From these mean scores a D 
score matrix, as described by Osgood and Suci 
(1952), was obtained for the group as a whole 
(Table 1). The D score is a measure of dissimi- 
larity between two variables. A D score of 
zero would indicate identical ratings on two 
positions; the larger the D score the less similar 
are the ratings between positions. 
Elementary linkage analysis, a rapid and 
objective method of clustering variables, 
developed by McQuitty (1957), was used to 
pent the clusters in the matrix of Table 1. 
peony linkage analysis categorizes vari- 
es into types in such a way that any 
Dam in one category is in some way more 
it i AR other variable in that category than 
With R any variable not in that category. 
iam refe technique the clusters or 
Cau Tuctures of Figure 1 were obtained for 
RR positions available to adults (in this 
analysis “baby,” “child,” “adolescent. ” and 
me” were omitted). : i 
r pee analysis of the D scores of three 
ou EE neurotic patients, and psychotic 
Le the clusters of Figure 2. 
fhe at the staff rates the word Me most 
way they rate usual adult positions. 


2A 
most ee arrow indicates that each variable is 
indicates ee with the other. A single arrow 
Most close] t the variable at the tail of the arrow is 
the one at dm ed with the one at the head, but 
one at the al is not most closely associated with 


Neurotic patients rate Me most like the 
Hospital-Created positions. Psychotic pa- 
tients rate Me in the cluster with Mental 
Patient. 

Patients who are ready to leave the hospital 
to return to society at large should have a 
self-conception congruent with the demands 
associated with usual adult positions. Those 
whose ego organization is so fragmented that 
they are unable to adapt to an open hospital 
environment may be expected to have a 
self-conception congruent with positions con- 
sidered rather regressed, such as that of 
mental patient. Three patients who were 
discharged within 2 months after testing and 
returned successfully to college, and three 
patients who were unable to adapt to the 
demands of the hospital (within 2 months of 
testing two were transferred to a closed 
psychiatric hospital and one committed 
suicide) were selected for separate analysis of 
their ratings. The typal analysis of the two 
D score matrices derived from the mean 
scores on all scales for the two groups—the 
Successful Group and the Unsuccessful 
Group—reveals the data in Figure 3. 

For the Successful Group the word Me is 
placed in the cluster with the usual adult 
positions, and is rated most like Student in the 
Activities Program with Student rated most 
like Me. In addition, there is a separate cluster 
made up of positions in the hospital these 
patients are about to leave. The Unsuccessful 
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Successful Group: 1 
Typel Student — me = josey dom + Citizen — Adult — Worker 


pg etat Ries, — Workerin the _. Citizen in the 
Mental patient 7 Common Work Program ~ Riggs Community 


Unsuccessful Group: 


TypeI Worker — Adult = Citizen — Student 
Typell  Studentinthe _. Worker in the 
Activities Program ~~ Common Work Program 


"Type III me — Mental Patient — er s M < Patient at Riggs 


Fic. 3. Clusters obtained for the successful and the unsuccessful groups. 


"Type A (necessary-unnecessary) — (important-unimportant) += (real-unreal) 

Type B (kind-cruel) — (therapeutic-untherapeutic) = (good-bad) 
(excitable-calm) — (beautiful-ugly) (active-passive) 

Type C (responsible-irresponsible) — (strong-weak) — (hard-soft) 


(masculine-feminine) = (fast-slow) «— (well-sick) 
Fis. 4. Linkage analysis for all subjects. 


TABLE 2 TABLE 3 
COORDINATES or ADJECTIVAL SCALES IN MEANS or RESPONSIBILITY DIMENSION 
THE THREE Tyres FOR SOCIAL Posrrrons* 
m————————————————— Áo 
Scales Coordinate values Pa- | Patients and 
—_ Social position Staffs tients | tiir 
Type A 
Important-Unimportant 6.89 
ecessary-Unnecessary 6.45 Usual Adult positions 
Real-Unreal 6.37 Student 
eB Worker 2.81 | 3.03 | 2.92 
d-Bad 4.32 Citizen 
Active-Passive 3.39 Adult 2.71 | 3.10 2.90 
Therapeutic-Untherapeutic 2.85 Hospital-Created positions 
ind-Cruel 1.46 Worker in the Common 
Beautiful-Ugly 1.81 Work Program 
Excitable-Calm .28 Student in the 3.45 | 3.86 | 3.65 
Type c Activities Program ^ $ d 
esponsible-Irresponsible 4.22 Citizen in the Riggs 
Strong-Weak 2.86 Community 
Hard-Soft 2.56 Adolescent 3.61 | 3.95 3.78 
Masculine-Feminine 1.32 Child 4.03 | 3.97 4.00 
Fast-Slow 1.15 Mental Patient positions 
Well-Sick 1.63 Patient at Riggs 4.70 | 4.52 | 4.61 
spe ERR Mental Patient 
Baby 4.98 | 5.25 5 


Group rates Me most as they do Mental 
Patient. 

In order to determine what dimensions were 
present in the adjectival Scales, a D score 
matrix based on the data from all Subjects 
was calculated which compared each adjectival 
scale with every other one. 

The elementary linkage analysis of this 
matrix revealed three types (Figure 4), 

In addition D? factor analyses (Osgood, 
Suci, & Tannenbaum, 1957, pp. 332-335) 
were performed on the means obtained on each 
adjectival scale in each type in order to get 


* Score of 1 refers to maximum responsibility; a score of 7 t0 
minimum responsibility. sual 

P An analysis of variance of responsibility means for U z 
Adult positions, Hospital-Created positions, and Mental Patient 
Positions is significant beyond the .001 level. 


the coordinates. The variables in each tyPé 
formed separate matrices and D? factor 
analyses were done with each matrix, Co 
ordinates are analogous to factor loadings 
the higher the coordinate value of a variable 
in each typal structure, the more closely 
related is that variable with the type. AS 
shown in Table 2, Type A has equally hig 
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coordinate values for the three scales im- 
portant-unimportant, real-unreal, and neces- 
- sary-unnecessary. Type B is most highly 
. ]oaded by the scales good-bad, active-passive, 

and therapeutic-untherapeutic. The scales 

responsible-irresponsible, strong-weak, and 
hard-soft have the highest coordinate values 
in Type C, which we tentatively call a re- 
sponsibility dimension. 
It is reasonable to assume that the degree of 
responsibility that is expected of a person is in 
- some measure directly related to his age level. 

Does the responsibility dimension found in 

the factor analysis show increasing responsi- 

bility associated with more mature ages? 

Table 3 indicates that the scores on the re- 

sponsibility dimension for adult, adolescent, 

child, and baby are in the predicted direction. 

As Table 3 also shows, the similarity in 

ratings of usual adult positions with the rating 

of the word adult, the similarity in the ratings 
of Hospital-Created positions with those of 
the words adolescent and child, and the 
similarity between ratings of mental patient 

Positions and baby are striking. The difference 

in the responses to the three categories of 

positions is statistically significant by F test 
beyond the .001 level. 


DISCUSSION 


‘The three usual adult positions (Worker, 
Ren Student) elicit very similar responses 
rom both patients and personnel. The re- 
vem of patients and staff for the Hospital- 

teated positions (Worker in the Common 

sg Program, Student in the Activities 

"iie Citizen in the Riggs Community) 
ied SO very similar to each other. A similar 
NU emerges from. ratings relating to the 
Men gest positions (Patient at Riggs, 
ite a atient). From these findings we may 
ins » there are similar kinds of expecta- 
Eur M d aiid valuations associ- 
CS positions in each cluster. Those 
fe n with usual adult positions differ less 
pisi: US edet with Hospital-Created 
Tied li an they differ from those con- 
B being a mental patient. From the 
oo On these adjectival scales we may 
iba et Measure of the role demands made 
studied CU filing the various positions 
patients (al ded b ims TATP UPO oe 
ong the dimensions measured) seem 
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to fall somewhere between those made upon 
adults outside of a hospital and those made 
upon persons considered to be mental patients. 
Are the role demands associated with the 
Hospital-Created positions congruent with the 
conception our patients have of their own 
capacity to meet these demands? Neurotic 
patients rate the word Me much as they rate 
Worker in the Common Work Program and 
Student in the Activities Program—positions 
which are near-adult. Psychotic patients place 
Me in the cluster with Mental Patient posi- 
tions, rating it most like Citizen in the Riggs 
Community. We may infer that the self-con- 
ception of neurotic patients is more nearly 
adult than is that of psychotic patients. Both 
patient groups place Citizen in the Riggs 
Community in the cluster with Mental Patient 
positions. The minimal requirements of a 
citizen in the Riggs Community, besides the 
work expectation, are little more than attend- 
ance at regular group meetings, occasionally 
taking one’s turn at preparing snacks for the 
patient group, and abiding by the accepted 
standards of behavior of the hospital. In other 
words, a fairly passive adjustment to the 
community is sufficient to be an acceptable 
citizen of the hospital. The position of citizen 
can, however, be filled in a very active, cre- 
ative, and adaptive manner—the role has a 
wide range of behaviors that would satis- 
factorily fit it. The psychotic patients, who 
respond to Me as they respond to Citizen in 
the Riggs Community, seem to view them- 
selves as capable of meeting some measure of 
social demand, but not fully capable of meet- 
ing the demands that they associate with the 
other Hospital-Created positions. 
Concerning the reciprocal relation between 
patient and hospital community, it might be 
helpful to note the process of mutual selection 
that results from the admission policy and 
from the voluntary nature of the hospital. 
Only those patients who are judged capable of 
adapting to an open hospital are admitted. 
Also, since patients enter and remain in the 
hospital voluntarily, they, in turn, select it as 
the place that fits them. The correspondence 
between the social system of a voluntary 


hospital and the degree of ego intactness of 
its patients is due, in large ure, to 
process of mutual selection. 

There are occasions when org 
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tion of a patient proves to have become so 
well integrated or so fragmented that patient 
and hospital no longer fit each other. In fact, 
when treatment is successful, integration of 
ego functions is expected to develop to a 
degree that the role opportunities within the 
hospital are too limited to dovetail satis- 
factorily with newly developed adult self- 
conceptions. Thus, the person must leave the 
hospital to be able to go to school, work, 
marry, etc., where the social structure of the 
community allows a further range of role 
opportunities to correspond with the new 
self-conception. 

From the analysis of the D score matrix of 
three patients who demonstrated a definite 
reintegration of ego functions by their readiness 
to return to college (among other indications), 
it is clear that their self-conception was most 
congruent with their conception of a student 
in or out of the hospital. Since these patients 
were tested shortly before they left the hos- 
pital, the results may reflect a point in a 
changing self-conception from being a student 
in a hospital to being a student in college, 
Such a result also supports the consideration 
that realistic congruence of one’s self-concep- 
tion with usual adult roles cannot occur while 
a person is hospitalized. 

When a patient becomes manifestly psy- 
chotic or for other reasons becomes untreat- 
able, and begins to be unable to meet even 
minimal demands and expectations, the social 
demands are softened and eased temporarily, 
but never given up entirely. If, despite this 
easing, he is still unable to meet the com- 
munity's expectations, administrative mecha- 
nisms are set in motion to transfer him out of 
the hospital. This is done partly because his 
deviance threatens the hospital social struc- 
ture, and partly because he himself cannot 
benefit from the treatment offered if he does 
not also have a minimally adequate role in the 
social setting. Three patients who were un- 
treatable in the open hospital where the study 
was conducted had self-conceptions most 
congruent with that of being a mental patient, 
Their inability to adapt to the hospital com- 
munity coincided with a conception of them- 
selves (not necessarily entirely in awareness) 
as rather regressed and mentally ill. 

In this study we focused on only one sig- 

nificant dimension of role demand—that 
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related to responsibility. A decreasing degree 
of responsibility is expected from persons 
filling each group of positions from adult 
positions to Hospital-Created positions down 
to Mental Patient positions. If a person is. 
thought of as a mental patient, he seems to be 
considered no more responsible than a baby, 
If he is an active participant in the various. 
hospital programs, he is considered moderately | 
responsible—about as responsible as an adoles- 
cent. Only when occupying certain active 
nonpatient positions is a person considered to 
be as responsible as an adult. We can infer 
from the data also that filling some kinds of | 
positions excludes a person from other posi- f 
tions, For example, to be an irresponsible 
mental patient excludes one from the possi- | 
bility of occupying such adult positions as 
worker or student; on the other hand, if oneis | 
occupying such adult positions, it is unlikely 
that he would be considered mentally ill, 
Therefore, if a person is considered to be 4 
mental patient, and the role demands upon 
him are to be a mental patient exclusively—be 
irresponsible, be "crazy," etc.—he has m 
opportunity to explore or learn to cope with | 
more adult demands, The Hospital-Created | 
positions appear to provide near-adult de 
mands, expectations of responsibility, and 
opportunities to function as a valued member | 
of the community. 2a 
If a patient works in some of the hospital | 
programs, especially the Common Work | 
Program, he seems to be considered as ap: 
proaching the status of adult; that is, he is not 
merely a mental patient, but someone capable 
of assuming responsibility. Failure to partici- 
pate in the Common Work Program does not 
remove the role demand that he work as long 
as he is a member of the community. If there 
are sufficient intact ego functions correspond: 
ing with the demand, they will be supporte | 
by it, and the patient will be able to perform 
in that work situation, Without the soc! 
demand, sufficient internal forces would be 
less likely to be mobilized in the hospitali 
patient to enable him to participate in ? 
consistent and sustained way. , 
The results of this study confirm the 1- 
pression gained from clinical observations 
that the developments, thus far, in 
hospital community have provided 
opportunities for patients that allow an 


role 
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- support behavior that is socially responsible, 
- competent, and valued. The kind of hospital 
- organization described above seems to ap- 
“proach the goal of providing a subsociety 
“within which a temporary functional balance 
j may exist between a patient's ego organization 
"and the social structure of the hospital. As 
“reorganizations occur in the ego capacities of 
“the person, movement from the hospital 
society to society at large is possible without 
"vast readjustment in the patient's self-con- 
i ception or utter unpreparedness for the social 
demands that he will encounter on leaving 
the hospital. 


SUMMARY 


A study of role demand and self-conception 
using the semantic differential technique was 

- conducted in a psychiatric hospital organized 
as a therapeutic community. Both patients 
and staff clearly differentiated three cate- 
 Bories of social positions: usual adult social 
positions, positions unique for the hospital 
studied, and those related to being a mental 
- patient, Staff members’ rating of the word Me 
| was most similar to their ratings of usual adult 
positions; neurotic patients' rating of Me was 
Most similar to their ratings of Hospital- 

| Created positions; rating of Me by psychotic 
patients was most similar to their ratings of à 
rad of positions that included Mental 
ae From these results we infer that the 
ee tion of staff members is most 
P Eon with the role demands of usual 
ens, the self-conception of neurotic 
cong most congruent with the role demands 
flat d pns unique for the hospital, and 
iem ae otic patients most congruent with 

e demands associated with being a men- 
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tal patient. On a responsibility dimension 
there were striking similarities between the 
ratings of usual adult positions and adult, 
between the ratings of Hospital-Created 
positions and adolescent and child, and 
between the ratings of Mental Patient posi- 
tions and baby. 

The results were discussed in relation to the 
reciprocation between the degree of the pa- 
tients’ ego intactness and the social structure 
of the hospital studied. 
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PRIMACY EFFECTS IN PERSONALITY IMPRESSION FORMATION! 


NORMAN H. ANDERSON AND ALFRED A. BARRIOS 
University of California, Los Angeles 


TUDIES in communication research have 
reached no consensus as to the effect of 
order of presentation. The pro-con vs. 

con-pro paradigm leads sometimes to primacy 
effects (greater effect of the first communica- 
tion) and sometimes to recency effects. Al- 
though some progress has been made in the 
study of the relevant variables (Anderson, 
1959; Hovland et al, 1957a; Luchins, 1958; 
Miller & Campbell, 1959), the situation is far 
from clear. ZR 

One limitation of most work is the small 
number, typically two, of communications 
involved. Two experiments (Anderson, 1959; 
Weld & Roff, 1938) have used reasonably long 
sequences of communications on a single topic. 
However, no work has been done to assess 
practice effects over a sequence of communica- 
tions on separate topics. Both lines of attack 
are desirable, not only because they simulate 
everyday situations more closely, but also for 
theoretical reasons. Thus, Hovland (1957b) 
has emphasized some of the ways in which 
prior familiarity with the topic might influ- 
ence reception and acceptance of the com- 
munications. Analogous considerations would 
apply when a sequence of different topics is 
used, and various practice effects might also 
be important. The two cited paradigms would 
thus be expected to be useful in bringing the 
relevant psychological processes under closer 
experimental scrutiny. 

The present experiment was designed to 
study order effects over a sequence of com- 
munications on separate issues. The classic 
paper of Asch (1946) suggested the use of 
personality adjectives in order to get a large 
body of relatively homogeneous material, 
Asch's results, as well as those of Luchins 
(1957, 1958), indicated that under the specific 
experimental conditions employed here, pri- 
macy effects would be obtained for the initial 
trials. However, it was thought that with 
continued practice the primacy effect would 
decrease and perhaps become a recency effect, 


1 This research was supported by Grant M-3502(A) 
from the National Institute of Mental Health, 
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METHOD 


Subjects. Volunteers who were fulfilling a dig 
requirement in introductory psychology were assigned 
randomly to the various conditions within each exper 
ment. Sex was balanced in Experiment 1 but not in 
Experiment 2. The Ns for the two experiments wer 
64 and 24, respectively. Subjects were examined it 
dividually. 

Stimuli. A set of 328 adjectives descriptive of 
personality characteristics were picked on the basis d 
familiarity by the experimenter and were then given’ 
rough scaling based on the responses of 10 judges, The 
judges rated each adjective on a six-point Favorable 
Unfavorable scale which had an additional “X” ratin 
for unfamiliar words. The 25 adjectives that received 
an X were not used, The mean ratings of the remaining 
adjectives ranged from 0.1 to 4.8, with a rating ati 
corresponding to Very Favorable. The 48 adjectives it 
the range 4.0-4.3 will be called H; the 48 adjectivesia 
the range 1.2-2.5 will be called L. Table 2 shows sont 
of the adjectives that were used. 

Procedure. Subjects were told that they would bt 
read a number of sets of adjectives, each set describing 
a different person, and that they should try to form a 
impression of the kind of person the adjectives de 
scribed. The instructions indicated that they were ld 
think of the six adjectives as having been given by 
different people who knew that person well. After tht 
experimenter read each set, the subject indicated bis 
response numerically in terms+of a rating scale typ 
on a card in front of him. This eight-step scale range 
from +4 to —4, the neutral response being disallowed: i 
An identifying label was placed by each number using ^ 
the various combinations of Favorable and Uem 
able, with Highly, Considerably, Moderately, 3 
Slightly as modifiers. d 

Experiment 1. Each subject judged 61 or 62 sets 
six adjectives each. One of these sets was the same 9 
that used by Asch (1946, Experiment VI). It was a 
as the first set for half the subjects, and as the n 
for the other half. Within each of these halves, the £ 
was given in forward order to half the subjects an 
reverse order to the other half. Those subjects Wo 
received this set as their first set also received it as the! 
last set. "i 

The remaining sets were of five types. Type dl 
consisted of three adjectives in the H range Ur 
by three adjectives in the L range of scale values: d 
LH had three L adjectives followed by three H wi 
tives. Type GD was a sequence of six s 1 
gradually descending from H to L in scale value. m 
GA gradually ascended from L to H. Type R cons! bit 
of six adjectives chosen similarly to the other ues 
arranged in random order within a set. Twelve 5e 
each type were constructed randomly subject to B 
restriction that no adjective appear twice in nd 
and that each adjective be used about equally ofte 


x ordered blocks of 10 sets each were constructed 
randomly subject to the restriction that two sets of 
each type appear in each block. Four sequences of 60 
were obtained using four different permutations of 
six blocks. From these four sequences, four addi- 
equences were formed by reversing the order of 
ectives in the 60 sets. The GD, GA, and R sets 
completely reversed in order. However, the H and 
ibsets in the HL and LH sets were simply inter- 
ed without disturbing the order of the three 
ives within either subset. 
he adjectives of each set were read by the experi- 
enter at an approximate rate of one adjective each 3 
nds, Three trials were given each minute until all 
sets had been judged. 
eriment 2. Each subject judged 90 sets of two 
ves each, 30 HL, 30 LH, 15 LL, and 15 HH. 
ges were constructed as in Experiment 1 and 
need for type in ordered blocks of 30 sets each. 
subjects received the sets in one sequence; the 
lier half received the same sequence but with the 
T of the two adjectives in the LH and in the HL 


| The main independent variable was the time between 
à reading of the first and second adjectives of a set. 

h subject judged one block of 30 sets at each of 
time intervals: 0, 2, and 4 seconds. All six possible 
ices of time intervals were used in a latin square 
Successive blocks were separated by 1 minute 
ring which the subject was told that a new tempo 
Would be used for the following block. 


Female: GD-GA 
Female: HL-LH 


X Male: GD-GA 
X Male: HL-LH 


1 2 3 

i vu 20 TRIAL BLOCKS 

type of set ean primacy scores as a function of sex, 
E » and trial blocks, (Experiment 1.) 
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TABLE 1 


ANALYSIS OF VARIANCE OF ORDER EFFECT 
SCORES SUMMED OVER ALL 


Source df F 
Mean 1 48.38* 
Sex 1 2.23" 
Error (b) . 62 (183.81)* 
Type s 1 .02 
Sex X Type 1 8.03* 
Error (w) 62 (83.77) 


Note.—Error mean squares in parentheses. 
"p< .05. 


RESULTS 


Experiment 1. If an HL set produces a . 
higher response than the corresponding LH 
set, then the first subset of three adjectives 
had a stronger effect than the second subset, 
within at least one of the two sets. Positive 
HL-LH differences, and positive GD-GA 
differences thus represent primacy effects. - 
'These two difference scores were computed 
for each subject using the several adjectives of 
each type in a given block of trials. 

The results are shown in Figure 1 which 
plots mean difference scores for the four main 
experimental conditions as a function of 
trial blocks. It is seen that there is a strong 
primacy effect which equals 0.69 averaged 
over all conditions. Although the effect de- 
creases somewhat over trials, it does not 
appear to be approaching zero. Females show 
more primacy than males but this sex difference 
resides largely in the HL-LH sets. Indeed, the 
sexes are not too far apart on the GD-GA 
curves and, compared to these, the males are 
lower and the females are higher on the HL- 
LH curves. 

The analysis of these data was performed 
on the difference scores summed over all 60 
trials and is given in Table 1. Since a differ- 
ence score was used, the F for Mean shows a 
significant primacy effect. Although the main 
effect of Sex is not significant, the significant 
Sex X Type interaction verifies the comments 
on sex differences made in the preceding 
paragraph. A trend test showed that the 
decline of the primacy effect with trials was 
significant (F = 12.27, df = 1/60). 

The absolute responses, as distinguished 
from the difference scores, are also of interest. 
The mean response over all descending sets 
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TABLE 2 
Mean RESPONSE TO Srx Sets SHOWING STRONGEST PRIMACY EFFECTS 
Type Words Descending Ascending iat | 
GD smart, artistic, sentimental, cool, awkward, faultfinding 1.38 —0.72 2.10 
HL determined, tolerant, gentle, stubborn, forgetful, tricky 1.62 —0.16 1.78 
GD orderly, entertaining, humble, cool, calculating, moody 1.84 0.44 1.40 
HL efficient, scholarly, smart, crafty, faultfinding, unruly 0.66 —0.66 1.32 | 
GD warm, independent, optimistic, reserved, tough, faultfinding 1.69 0.41 1.28 
HL patriotic, perfective, generous, softhearted, uninhibited, humorless 1.84 0.59 1.25 


(HL and GD) was 0.94, and the mean re- 
sponse over all ascending sets (LH and GA) 
was 0.25. This latter value is quite close to 
the mean response of 0.32 for the R sets. 
Since the adjectives for the R sets were from 
the same pool as for the other sets, the asym- 
metry of these results suggests that the pri- 
macy effect has its source in the initial ad- 
jectives of the descending sets. However, this 
suggestion must be viewed with caution 
since, although the HL-LH and GD-GA order 
effects were almost identical, the mean response 
to GA and LH sets was 0.48 and 0.02, respec- 
tively. 

Results from the set used by Asch (1946) 
were consistent with the above. When this set 
was given first, the means were 0.87 and 
—0.94 for the GD and GA orders, respec- 
tively. The primacy effect of 1.81 was sig- 
nificant (F — 6.87, df — 1/30). The primacy 
effect was just half as large when the set was 
given last and this was nonsignificant, even 
when all 64 subjects were used. 

Of the 48 critical sets used, all but 5 gave a 
primacy effect, The descending versions of the 
6 sets which showed the greatest overall mean 
primacy effect are listed in Table 2. 

Experiment 2. The overall mean Scores for 
the HH, HL, LH, and LL sets were 2.68, 0.80, 
0.76, and —1.27, respectively. The analysis of 
variance was performed on the HL-LH 
Scores computed for the three successive 
blocks of 30 trials. Neither time interval nor 
trial blocks approached significance. The 
957% confidence interval for the overall mean 
order effect was 0.04 + 0.12. It thus seems 
reasonable to conclude that the true order 

effect is rather small when only two adjectives 
are used. Supporting evidence is found by 
comparing Error(b) to Error(w) which tests 
for individual differences. The resulting 


F (df = 17/36) was 1.19. Because of the great 
power of the F test with this many df, it 
seems reasonable to conclude that true in 
dividual differences in the order effect are alu; 
quite small when only two adjectives are used. | 


Discussion 


The primacy effect that was found is strik: 
ing. Over the first block of 10 trials, the mean 
difference in response obtained from simply 
reversing the order of the six adjectives wa 
1.12. This is a respectable part of the eight 
point scale, especially since few subjects usel 
the full scale range. Moreover, the result isi 
not due to some peculiarity of the particular 
sets used since they were constructed rai 
domly and since a primacy effect was observel | 
in 43 of the 48 sets. However, it is not cleat | 
how far the results may be generalized beyond | 
the experimental situation used here. The | 
need for some caution in this respect is i| 
dicated by the work of Luchins (1957, 1958) 
who finds that the order effect changes wit 
variations in the procedure. 

The decline in primacy over the latet 
trials has a number of possible causes. It may | 
be that, despite the use of the random stis. 
the pattern of good and bad words in tht 
remaining sets brings about an increas! 
tendency to take account of all the words i8 
each set. This possibility could perhaps i 
tested by reducing the relative frequency 
the critical sets. A progressive loss of inter® 
in the task might also influence the pin 
effect, a hypothesis which would be test?» f 
by inserting a rest and motivating instit 
tions partway through the session. A M 
possibility is that adaptation to the exp% 
mental situation and practice in integr? E 
the material are the governing factors. EA 
the decrement in primacy found here indic? 
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that caution should be used in generalizing 
4 results of studies employing only one or two 
- communications. 
The results of an earlier experiment (Ander- 
| son, 1959) were interpreted as suggesting the 
existence of two opinion components. The 
basal component, once formed, was quite 
resistant to change whereas the surface 
component was readily influenced by the 
' successive communications. The present data 
- would be consistent with the two-component 
hypothesis if it were assumed that a strong 
basal component was formed over the first 
three but not the last three adjectives within 
a given set. Thus for an HL set, the surface 
- component induced by the three H adjectives 
would be approximately canceled by the 
surface component induced by the three L 
adjectives. However, the basal component 
induced by the three H adjectives would 
persist to produce a response toward the 
Favorable end of the scale. For the correspond- 
- ing LH set, the basal component would tend 
- to produce a response toward the Unfavorable 
| end of the scale, and the difference between 
the two would constitute the primacy effect. 
i y The two-component hypothesis is thus not 
‘inconsistent with the present data but other 
explanations cannot be ruled out. In particu- 
lar, the primacy effect may result from a 
Progressive decrease in attention over the 
adjectives of a given set. Attention decrement 
could Presumably be reduced by more strin- 
Bent control or at least tested by asking sub- 
ects to recall the adjectives (McGuire, 1957). 
nh should also be noted that in the present 
ituation the basal component would be 
) Bom in effect to and perhaps explainable 
Tn of Luchins' (1957) Einstellung, or 
a ah 946) concept of direction. However, 
Um E eae interpretation suggests a 
Foro ch the other two formulations do not. 
oF Consider the order effect paradigm defined 
| a the two sets, HLH’L’ vs. HLL’H’, in 
| E m six adjectives are the same in 
qu onn the last six differ only in order 
| Bini ation. According to hypothesis, only 
list six a Component is affected over the 
3 ve Es and hence (Anderson, 1959) 
Al though um should yield a recency effect. 
rather S prediction must be considered 
Speculative, the use of this and similar 
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paradigms should prove valuable in the study 
of opinion and impression formation. 

Experiment 2 was designed on the assump- 
tion that a considerable crystallization of 
impression had occurred in the first half of the 
sets in Experiment 1, It was thought that the 
primacy effect would increase as increased. 
time was allowed for the first adjective to sink 
in. Even though the reduction from six to two 
adjectives was expected to decrease the overall 
effect, the finding of negligible order effects in 
Experiment 2 came as a surprise in view of the 
strong primacy obtained in Experiment 1. 
The results of Experiment 2 thus give no 
evident support to Asch's (1946) hypothesis 
that the first adjective sets up a directed 
impression in terms of which the later adjec- 
tives are interpreted. However, the combined 
results of the two experiments give some 
basis for speculating that the critical events 
leading to primacy in Experiment 1 occurred 
at the second and third adjectives. 


SUMMARY 

Sets of Favorable and Unfavorable adjec- 
tives descriptive of general personality char- 
acteristics were constructed so as to test for 
effect of order of presentation of the adjectives. 
The sets were read to subjects who were asked 
how favorable an impression they had of the 
person described by the set of adjectives. 

In Experiment 1, 64 subjects each judged 
some 60 sets of six adjectives each. Strong 
primacy effects were found although there 
was some decrement over trials. Females 
showed greater primacy than males for sets in 
which the change from Favorable to Un- 
favorable (and vice versa) was abrupt. How- 
ever, there was little sex difference when the 
change was gradual. ; 

In Experiment 2, 24 subjects each judged 
90 sets of two adjectives each, with intervals 
of 0, 2, and 4 seconds between the adjectives. 
Time interval had no observable effect and 
order of presentation was also nonsignificant. 
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graph (Edwards, 1957b) the results of 
esearch bearing upon the problem of 
ial desirability variable in personality 
ent. I interpreted the results as sup- 
the hypothesis that scores on many 


ences in the tendency of subjects to give 
ly desirable responses to items in the 
3. Part of the evidence bearing upon this 
pothesis consisted of the correlations ob- 
ined between Minnesota Multiphasic Per- 
mality Inventory (MMPI) scales and scores 
| other scale which I called the Social 
esirability (SD) scale. 
twill be clear to anyone who reads the 
ograph that I in no way restricted the 
desirability hypothesis to the MMPI 
calés—nor was the evidence confined only 
to the correlations between the MMPI scales 
And the SD scale. It is, however, my interpre- 
tion of scores on the SD scale as measuring 
tendency to give socially desirable re- 
es and of the correlations of the SD scale 
ith the MMPT scales as supporting the social 
S BS hypothesis that has been ques- 
and it is for this reason that my paper 
18 primarily concerned with the MMPI dd 
es. 
‘has been suggested, for example, that, 
Mary to my interpretation, the SD scale 
t a measure of the tendency to give 
ally desirable responses, but is instead a 
CRUS of the tendency to acquiesce, that 
E. tendency to give True responses to 
ems in the scale. It has also been sug- 
that many of the MMPI scales reflect 
BE to acquiesce and that the cor- 
E etween the SD scale and the MMPI 
> can, therefore, be accounted for by 
ual differences in the tendency of sub- 


to give acqui i 
er quiescent responses to the items 
the scales, E 


d 
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DESIRABILITY OR ACQUIESCENCE IN THE MMPI? A CASE 
STUDY WITH THE SD SCALE: 


ALLEN L. EDWARDS 
University of Washington 


The question before us, then, is a controver- 
sial one. I propose to review, briefly, the his- 
tory of the controversy by first stating the 
argument for the social desirability hypothesis 
and then the argument for the acquiescence 
hypothesis. I shall then present some new 
evidence which bears upon these two alterna- 
tive hypotheses. 


SOCIAL DESIRABILITY HYPOTHESIS 


Relationship between Probability of Item En- 
dorsement and Social Desirability Scale Value 


My first research (Edwards, 1953b) on the 
subject of social desirability was concerned 
with the relationship between the probability 
of endorsement of a personality statement and 
the social desirability scale value of the state- 
ment. This study is particularly relevant to 
the points I shall develop later with respect 
to both the acquiescence and social desirability 
hypotheses and I shall, therefore, describe the 
procedures I followed as well as the results. 

As is well known, psychological scaling 
methods have been used for many years to 
obtain scale values for statements of opinion. 
For example, if we have a set of statements of 
opinion relating to, say, capital punishment, 
we can have judges rate how favorable or 
unfavorable they believe each statement to 
be with respect to capital punishment. Psy- 
chological scaling methods can then be applied 
to the distributions of ratings to obtain a scale 
value for each statement. It is then possible 
to order the statements on a continuum ranging 
from highly unfavorable, through neutral, 
to highly favorable opinions. 

It was my belief that the same methods 
could be applied to obtain social desirability 
scale values for personality statements. Con- 
sequently, I asked judges to rate the degree 
of social undesirability or social desirability 
of the behavior, trait, or characteristic rep- 
resented by each personality statement in a 
set of 140. I then used the method of succes- 
sive intervals to obtain a social desirability 
scale value for each statement. The scale 
values order the statements on a continuum 
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ranging from highly socially undesirable, 
through neutral, to highly socially desirable 
characteristics. I call this continuum the 
social desirability continuum. 

The statements were then printed in the 
form of a personality inventory and adminis- 
tered to a new group of subjects under stand- 
ard instructions to describe themselves. For 
this group the proportion endorsing or answer- 
ing True to each statement was found. This 
proportion I refer to as the probability of item 
endorsement. If we now plot the probability 
of item endorsement against the social desir- 
ability scale value, we can see whether there 
is any relationship between the two variables. 

The relationship that I found is shown in 
Figure 1. The X or horizontal axis is the social 
desirability continuum, with the left end re- 
presenting statements with socially undesir- 
able scale values and the right end statements 
with socially desirable scale values. The Y 
or vertical axis is the probability of item en- 
dorsement. It is evident that the probability 
of item endorsement, or of a True response, 
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is a linear increasing function of the social de. 
sirability scale value. The more socially desir. 
able a statement is, the greater is the proba, 
bility of item endorsement. The product 
moment correlation between the two variables 
is .87. 

I first presented the results shown in Figure 
1 at a meeting of the Western Psychological 
Association in 1952. Since that time the re 
search has been replicated many times (Cowen 
& Tongas, 1959; Edwards, 1955, 1957a, 1959; 
Hanley, 1956; Hillmer, 1958; Kenny, 1956; 
Taylor, 1959; Wiggins & Rumrill, 1959; 
Wright, 1957). These replications have in 
volved variations in the scaling method used 
to obtain the social desirability scale values, 
variations in the technique used to obtain the 
descriptions of subjects, variations in the set 
of personality items, and a range of different 
groups of subjects. In each instance the results 
have been consistent with those I originally 
reported. Thus, I believe it is possible to state 
that the relationship between probability o 
item endorsement and social desirability scale 
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Fic. 1. The relationship between 
duced from Edwards, 1953b) 


probability of item endorsement and social desirability scale value. (ReP® 


"yalue is a general phenomenon rather than an 
‘isolated result obtained under rather restricted 
- conditions. 


Social Desirability Scale 


Consider now one way in which we might 
Key a given set of personality statements. 
Suppose, for example, that if a statement has 
“a socially desirable scale value we key the 
‘True response, whereas if the statement has 
a socially undesirable scale value we key the 
False response. The resulting scale would be 
ofa kind that I have called a social desirability 
“or SD scale. An SD scale, in other words, is 
“one in which all of the items are keyed for 
Socially desirable responses. A socially desirable 
Tesponse is defined as a True response to an 
item with a socially desirable scale value or a 
False response to an item with a socially 
Undesirable scale value. The score on an SD 
Scale is the number of socially desirable re- 
Sponses that have been given to the set of 
items comprising the scale. 

The first SD scale I constructed (Edwards, 
1953a) consisted of 79 items from the MMPI. 
These 79 items were subsequently analyzed 
E and teduced to the 39 items which best differ- 
| Entated between a high and a low scoring 
_ Broup (Edwards, 1957b). There is evidence 
(Edwards, 1957b) to show that the results 
: ned with either the 79-item or the 39-item 
Scale are comparable and the distinction 
_ Between these two SD scales need not concern 
P However, with but few exceptions, the 

search I shall be reporting upon was done 
With the 39-item SD scale. 
the T pete out that all of the items in 
BL are keyed for socially desirable 
E 5. we consider the question of what 
D. Scale is measuring, then, it seems to 
“a Eu pet statement to be made is that 
E... es a measure of the tendency of sub- 
id OMM Socially desirable responses in 
Fins ordi lon under the standard instruc- 
E. M Used with personality inven- 
Bus etermine the degree to which other 
this Eu Scales might also be measuring 
oit endency, I correlated scores on the 
eem SD scale with a ber of oth 
Personality Sai number of other 
etal, the es (Edwards, 1953a). In gen- 
Personali orrelations I obtained with other 
i Y scales of the True-False type were 
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of substantial magnitude. Furthermore, the 
signs of the correlations were in the direction 
predicted by the social desirability hypothesis. 
For example, if a high score on a given scale 
indicated a socially undesirable trait, that is, 
if it was necessary to make a large number of 
socially undesirable responses to obtain a high 
score, then the correlation of this scale with the 
SD scale was negative. On the other hand, if a 
high score on a given scale indicated a socially 
desirable trait, that is, if it was necessary to 
make a large number of socially desirable 
responses to obtain a high score, then the cor- 
relation of this scale with the SD scale was 
positive. These results indicated to me that if 
the SD scale was measuring the tendency to 
give socially desirable responses, as I believed 
it was, then the scores on the other scales I 
had investigated were, to a remarkable degree, 
reflecting the same tendency. 

Correlations subsequently obtained by 
Fordyce (1956) between the 79-item SD scale 
and the clinical and validity scales of the 
MMPI and by Edwards, Heathers, and 
Fordyce (1960) between the 39-item SD 
scale and 11 “new” MMPI scales, described 
by Hathaway and Briggs (1957), are also con- 
sistent with the social desirability hypothesis. 


ACQUIESCENCE HYPOTHESIS 


Two articles by Cronbach (1946, 1950) on 
response sets have done much to arouse in- 
terest in the particular response set which 
Cronbach called, as had Lentz (1938) before 
him, acquiescence. Acquiescence, according to 
Cronbach, refers to the tendency of a subject 
to agree with or respond True to a test item 
when he is in doubt as to the appropriate or 
correct response. It has been suggested that 
items in personality inventories are of a kind 
likely to evoke doubts as to the appropriate 
response since subjects are required to inter- 
pret the content of the item along with the 
meaning of such terms as few, seldom, fre- 
quently, occasionally, sometimes, and often 
(Allport, 1937; Benton, 1935; Watson, 1959). 
If the interpretative ambiguity ofa personality 
item does produce doubts as to the appropriate 
response, then the item is also likely to evoke 
acquiescent tendencies. Thus, if we have a 
personality scale in which a majority of the 
items are keyed False, and if these items also 
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arouse doubts as to the appropriate response, 
then it follows that a highly acquiescent sub- 
ject is apt to respond True to these keyed 
False items. The highly acquiescent subject, 
in other words, may be expected to obtain 
a lower score on the scale than he otherwise 
would if acquiescence were not involved. Thus, 
instead of measuring the trait we think the 
scale is measuring, the scores may be reflecting 
the differing acquiescent tendencies of the 
subjects. 


ACQUIESCENCE AND THE SD SCALE 


The argument for the acquiescence hypoth- 
esis, as I have presented it, can be applied 
to the 39-item SD scale in which 30 of the 
39 keyed socially desirable responses are False. 
Assuming the argument to be valid, we would 
expect the more acquiescent subjects to ob- 
tain low scores and the less acquiescent sub- 
jects to obtain high scores on the scale. It is a 
question of some importance, therefore, to 
ask whether a high score on the SD scale 
is primarily a measure of the tendency to give 
socially desirable responses or of the tendency 
to be nonacquiescent. I was quite aware of this 
possible double interpretation of scores on the 
SD scale, and my reasons for rejecting the 
acquiescence interpretation in favor of the 
social desirability interpretation are, I believe, 
clearly stated in my monograph. 

Let us look once again at Figure 1. Suppose 
we take only items from the socially desirable 
end of the continuum and key all of these 
items for the True response. Then a high 
score on this SD scale may be either a measure 
of a strong tendency to give socially desirable 
responses or of a strong tendency to be ac- 
quiescent or it may, of course, reflect both 
tendencies. If the original 39-item SD scale is 
primarily a measure of acquiescence and if this 
is also the case with the all-True SD scale, 
then, from the acquiescence point of view, 
scores on the two scales should correlate nega- 
tively, since the acquiescent subject will ob- 
tain a high score on the all-True SD scale and 
a low score on the 39-item SD scale in which, 
we recall, 30 of the 39 items are keyed False. 
On the other hand, if the original 39-item SD 
scale is primarily a measure of the tendency 
to give socially desirable responses and if this 
is also the case with the all-True SD scale, then 
the correlation between the two scales should 
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be positive. Thus, I believe the obtained cor 
relation between these two SD scales would 
provide a basis for choosing between the two 
alternatives I have mentioned. In actual fact, 
the correlation was positive (.70) and consist. 
ent with the social desirability hypothesis, as 
I reported in my monograph. In addition to 
this correlation, I reported upon the correla- 
tions of the all-True SD scale—and another 
SD scale in which the True-False keying was 
balanced—with two MMPI scales, the D 
or Depression scale in which a majority (67%) 
of the items are keyed False and the Sc or 
Schizophrenia scale in which a majority (787) 
of the items are keyed True. The correlations 
obtained were consistent with the social 
desirability hypothesis, whereas this was not 
so in the case of the acquiescence hypothesis 
It was on the basis of these, and other, findings 
that I rejected the acquiescence interpretation 
of the SD scalein favor of the social desirability 
interpretation. 
At the time my monograph was in press, 
Fricke (1956) published an article in which he 
presented results which he interpreted as 
supporting the notion that the imbalance m 
the True-False keying of the various clinical 
and validity scales of the MMPI made them 
most susceptible to the acquiescence response 
set. In correspondence with me, since the pub 
lication of my monograph, Fricke has indi- 
cated that he believes this is also the ca? 
with the SD scale. Wiggins (1959) has 
taken the position that the imbalance In the 
keying of the 39-item SD scale results in 1 
being a measure of acquiescence. Thus, Wig 
gins believes that the various corres 
that have been reported between the SD sci? 
and the MMPI scales should be appropriate 
considered as evidence of the influence 
acquiescence in the MMPI rather than 0 
social desirability. He cites, in support of t 
belief, a negative correlation he obtain 
between a “response bias” scale, develo 
by Fricke (1957), and the SD scale. Ji 
and Messick (1958), in their review of Ws. 
sets, also state that the correlations e 
the SD scale and the MMPI scales may 1°% s 
acquiescence, since the SD scale coni 
disproportionate number of items keyed Fa à 
I have already cited certain evidenc. " 
indicate why I do not agree with the acqui 
cence interpretation given the SD sc 
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Wiggins, Jackson, and Messick. Nor 
hare their apparent conviction that the 
nce in the True-False keying of the 
[ scales makes these scales susceptible 

escent tendencies. And, as I shall 
later, the correlation between the SD 
Fricke's (1957) response bias scale 
he same sign and just about the same 
tude that would be predicted by the 
desirability hypothesis. What is at 
then is quite clear. It is this: Can the 
ions between the SD scale and the 
‘scales be more adequately accounted 
terms of the acquiescence hypothesis 
social desirability hypothesis? 


EYIDENCE BEARING UPON THE Two 
HYPOTHESES 


cence Hypothesis 


€ presenting some new evidence bear- 
on these two alternative hypotheses 
that there may be no misunderstand- 
t me summarize briefly the argument 
acquiescence hypothesis as it has been 
by Fricke, Wiggins, Jackson, and 
k. If a majority of the items in an 
scale are keyed False, as is also the 
th the SD scale, then the more acquies- 
bjects should obtain low scores on both 
with the less acquiescent subjects ob- 
high scores on both scales, assuming, 
itse, that both scales are measuring 
scence. Thus, the correlation between 
Scale and this MMPI scale, according 
acquiescence hypothesis, should be 
€. On the other hand, if a majority of 
ms in an MMPI scale are keyed True, 
the acquiescence hypothesis is correct, 
$ scale and the SD scale should cor- 

"negatively. 
"Us accept, for the moment, the state- 
Fricke, Wiggins, Jackson, and Messick 
Possible to evaluate the role of ac- 
ence in a scale by considering the per- 
Et ltems keyed True (or False) and 
S percentage as an index of the ac- 
nce set. I have obtained this index 
h of the 43 MMPI scales? that is, for 
"abbreviated code names for the 43 MMPI 
Rr by Dahlstrom and Welsh (1960), are 
S, Pa qd D, D-S, D-O, Hy, Hy-S, Hy-O, 
1.0, & dio Pa, Pa-S, Pa-O, Pi, Sc, Ma, 
» Dy, St, At, Ho, P», Do, Re, A, R, 


Es, Eo, No, Nu, Pn, Cn, Ca, Ne, and B. 
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PROPORTION OF KEYED F ITEMS IN MMPI SCALE 

Fic. 2. Correlations of 43 MMPI scales with the 

SD scale as a function of the proportion of keyed False 
items in the MMPI scales. 


each scale I obtained the proportion of items 
keyed False. Figure 2 shows the SD correla- 
tions? plotted against the proportion of keyed 
False items in each of the 43 MMPI scales. 
It is clear that there is some tendency for the 
SD correlations with the 43 MMPI scales to 
be related to the imbalance in the True-False 
keying of the scales. The product-moment 
correlation between the two variables is .55. 
Thus, approximately 30% of the variation in 
the SD correlations can be accounted for by the 
imbalance in the True-False keying. 


Social Desirability Hypothesis 

Let us now see how well the social desira- 
bility hypothesis accounts for the same SD 
correlations. Knowing thedistribution of social 
desirability scale values of the items in a per- 
sonality test it is possible to classify the items 
in the scale in terms of whether the keyed re- 
sponse is a socially desirable or a socially unde- 
sirable response. According to the social desira- 
bility hypothesis, if the scale contains a large 
proportion of keyed socially desirable re- 
sponses, then the correlation between this scale 
and the SD scale should be positive. On the 
other hand, if the scale contains a Jarge propor- 


tion of keyed socially undesirable responses, 

3 All of the correlations are with the 39-item SD 
scale. The correlations between the SD scale and the 
subtle and obvious scales of the MMPI are from an 
unpublished study by Fordyce and Rozynko reported 
by Edwards (1957b). The SD correlation with the B 
scale is reported by Wiggins (1959). The remaining 
correlations are based upon a sample originally tested 
by Merrill and Heathers (1956). 
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Fic. 3. Correlations of 43 MMPI scales with the SD 


scale as a function of the proportion of keyed socially 
desirable responses in the MMPI scales. 


then the correlation with the SD scale should 
be negative. If the keying of the items is rela- 
tively balanced for socially desirable and so- 
cially undesirable responses, then the correla- 
tion with the SD scale should be low. Social 
desirability scale values for the MMPI items 
have been obtained by Heineman (1953) and I 
have used these scale values to classify the 
items in each of the 43 MMPI scales in terms 
of whether or not the keyed response is a 
socially desirable or a socially undesirable one. 

I should point out that whether or not the 
keyed response is a socially desirable response 
is not necessarily a fixed characteristic of an 
MMPI item for all scales. For example, a 
given item may appear in more than one 
MMPI scale. In some scales the True response 
may be keyed and in other scales the False 
response may be keyed. Thus, in some scales 
a given item may be keyed for a socially de- 
sirable response and in other scales keyed 
for a socially undesirable response. This, of 
course was also the case when we considered 
the imbalance in the True-False keying of the 
MMPI items. 

Figure 3 shows the SD correlations plotted 
against the proportion of keyed socially de- 
sirable responses in the 43 MMPI scales. 
There can be no doubt that we have a much 
closer relationship in this instance than when 
we plotted these same correlations against 
the proportion of keyed False items in the 


4 Fricke's response bias scale is located in Figure 3 
by the point with coordinates .40 and —.59 and is in 
accord with the trend for the other points shown in 
the figure. 
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scales. The product-moment correlation be 
tween the two variables in Figure 3 is 92 
and we can account for approximately 85% 
of the variance in the SD correlations with the 
43 MMPI scales in terms of the proportion 
of keyed socially desirable responses in the 
scales. 


Partial and Multiple Correlations 
We recall that, by considering the propor 
tion of keyed False items in the MMPI scales, 
we could account for only 30% of the variance 
in the SD correlations. Even this figure of 
30% is spuriously high, as we shall now see, 
Consider the relationship shown in Figure 4 } 
The X or horizontal axis gives the proportion 
of keyed False items in each MMPI scale and 
the Y or vertical axis gives the proportion of 
keyed socially desirable responses for the 
scales. It is evident that there is some rela- 
tionship between the imbalance in the True 
False keying for a scale and the imbalance 
in the social desirability keying for the same 
to have a large proportion of keyed False 
keyed socially desirable responses. The prod- 
of proportions shown in Figure 4 is .42. 
happens to the correlation of .55 between a 


scale. In other words, those scales which tend 
items also tend to have a large proportion of 
uct-moment correlation between the two sels 

It is of interest, therefore, to consider what 
proportion of keyed False responses in a scale 
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and the correlations with the SD scale, when 
we partial out the proportion of keyed socially 
desirable responses. As shown in Table 1, the 


- partial correlation is .46. So, you see, if we 


hold constant or partial out the proportion of 
keyed socially desirable responses, the correla- 
tion between the proportion of keyed False 
responses and the SD correlations drops from 

.55 to .46. On the other hand, if we obtain the 
partial correlation between the proportion of 
keyed socially desirable responses and the SD 
correlations, holding the proportion of keyed 
False responses constant, we find that this 
partial correlation is .91 and this value does 
not differ greatly from the original zero-order 
correlation of .92. It, thus, appears that the 
variation in the correlations of the MMPI 
scales with the SD scale can be more ade- 
quately accounted for by the fact that the 
MMPI scales vary in the degree to which 
they are keyed for socially desirable responses 
than by the fact that these same scales also 
vary in the degree to which they are keyed 
for False responses. 

We can examine the same point in a slightly 
different way by looking at the multiple cor- 
relation between the SD correlations and the 
Proportion of keyed socially desirable responses 
and the proportion of keyed False responses. 
The zero-order correlation between the SD 
correlations and the proportion of keyed 
socially desirable responses in the MMPI 
Scales is 92, The gain in efficiency by taking 
Into account also the proportion of keyed 
i ht responses in the MMPI scales is very 
is ES The multiple correlation, for example, 
Fue 7, 94. Thus, the proportion of keyed 
Ele accounts for very little of the 
bo sem in the SD correlations 
TE ave considered the proportion of 

Socially desirable responses in the scales. 
think basis of the evidence presented, I 
E Ee to conclude that the im- 
mde "yen True-False keying of the items 
aE T scales is of relatively little im- 
the aes with the imbalance in 
ees esirability keying in accounting 
alse of the MMPI scales with 

e: 


the sD 
Conctupinc REMARKS 


T : 
Pome in which the acquiescence hy- 
à5 been applied to personality scales, 
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TABLE 1 
CORRELATION BETWEEN SD SCALE CORRELATIONS OF 
MMPI SCALES AND ITEM PROPERTIES OF 
THE MMPI Scares 


cem — | Mapes | pag 
Zero-order 55 +92 
Partial .46 .91 
Multiple .94 


little attention has been given to the problem 
of what kinds of items are likely to evoke 
acquiescent tendencies. For example, in a 
recent paper, Couch and Keniston (1960) 
define acquiescence as the tendency to give 
Agree (or True) responses to items regardless 
of the item content. We have seen, however, 
that the probability of a True response to a 
personality item is an increasing linear func- 
tion of the social desirability scale value of the 
item. If acquiescence exists as a personality 
trait, in the sense defined by Couch and Kenis- 
ton, then the probability of a True response for 
an acquiescent subject must be greater than 
the corresponding probability for a nonacquies- 
cent subject for all items, regardless of the 
social desirability scale values of the items. In 
other words, we would have two regression 
lines for probability of item endorsement, one 
for acquiescent subjects and one for nonacqui- 
escent subjects, with that for acquiescent sub- 
jects having the greater Y intercept. This is an 
interesting possibility but one for which as 
yet there is no supporting evidence. 

Rather than regarding acquiescence as 
independent of item content, I would agree 
with Cronbach that response sets operate 
only when items are difficult or ambiguous. 
Response sets, as he points out, do not operate 
apart from the items which evoke them. It is 
only when an item is in some way unclear so 
that the subject is in doubt as to the appropri- 
ate or correct response that response sets be- 
come of importance. 

In an achievement test, for example, a 
difücult item is, by definition, one to which 
only a few subjects know the correct response. 
Thus, a difficult item can be said to evoke 
doubts as to the correct response among more 
subjects than will an easy item. Difficult 
items, in other words, provide greater oppor- 
tunities for acquiescent responses to occur 
than do easy items. If a test consists of a 
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series of very easy items, then we may expect 
few of the total responses to reflect acquiescent 
tendencies. As the difficulty level of the items 
increases, then we may expect an increasing 
number of responses to reflect acquiescent 
tendencies. 

Suppose, for example, we divide the items 
in an achievement test into those that are 
difficult and those that are easy. Then scores 
based only upon the easy items are apt to be 
relatively free from acquiescent tendencies, 
simply because easy items provide relatively 
few opportunities for acquiescent responses 
to occur. On the other hand, scores based only 
upon the difficult items may be very much 
influenced by acquiescent tendencies because 
these items provide many opportunities for 
acquiescent responses to occur. 

I believe the situation with respect to per- 
sonality tests is somewhat similar. Scores on 
personality tests will be influenced by acquies- 
cent tendencies only to the degree to which the 
test contains items which are difficult or am- 
biguous and, thus, provide opportunities for 
acquiescent responses to occur. But we must 
consider not only the interaction between 
the items and acquiescent tendencies but also 
the interaction between the items and social 
desirability tendencies, There is evidence to 
indicate that the tendency to give socially 
desirable responses is so powerful, so dominant, 
that if it is evoked by an item, then acquies- 
cent tendencies are of little importance. The 
kinds of items which evoke the social desira- 
bility set are those with socially desirable or 
socially undesirable scale values. I would 
assume, therefore, that if a personality test 
contains mainly items with socially desirable 
or socially undesirable scale values, scores 
on the test will be little influenced by acquies- 
cent tendencies. With neutral items the socially 
desirable response is not obvious and the 
social desirability set cannot operate in the 
same way in which it does with nonneutral 
items. I would assume, therefore, that re- 
sponses to neutral items may be much more 
influenced by acquiescent tendencies than 
responses to nonneutral items. 

If Iam correct in this analysis, then it is the 
neutral items in a personality test which will 
provide opportunities for acquiescent tend- 
encies to influence scores on the test, If the 
neutral items are consistently keyed for True 
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responses, then we may expect the mor 
acquiescent subjects to obtain higher scores 
on the test than nonacquiescent subjects, 
If the neutral items are balanced in ther 
True-False keying, then we may expect 
acquiescent tendencies to have little influence 
on the total test scores. 

If most of the items in a test have socially 
desirable or socially undesirable scale values 
and only a few items have neutral social desira- 
bility scale values, then acquiescent tendencies 
should have relatively little influence upon the 
total test scores. An examination of many per 
sonality scales leads me to suggest that this 
is most often the case. Very few of these scales 
contain many neutral items in relation to the 
total number of items in the scale. Under this 
condition, we may expect scores on a scale 
to be relatively little influenced by acquiescent 
tendencies, regardless of whether or not there 
is an imbalance in the True-False keying of 
the items. The SD scale, in which all of the 
items have socially desirable or socially ur 
desirable scale values is, I believe, a case m 
point. 

If most of the items in a personality scale 
have nonneutral social desirability scale values, 
then the nature of the social desirability keying 
of the items is of importance not only for an 
evaluation of the social desirability set but also 
for an evaluation of the acquiescence set. Ac 
quiescence, for example, has sometimes been ir 
vestigated by obtaining the correlation be 
tween the separate scores on the keyed Trut 
and the keyed False items in a scale. A neg 
tive or low correlation between the scotts 
based upon the keyed True items and the 
keyed False items has been interpreted 4 
evidence of the influence of acquiescence: 
wish to emphasize, however, that this is 00 
necessarily the case. For example, the so 
desirability hypothesis would predict & r 
tive correlation between the part scores ii 
True items are consistently keyed for s0c! ui 
desirable responses and the False items $% 
socially undesirable responses, or vice e 
a low correlation if the True items are const 
ently keyed for socially desirable or soë 
undesirable responses and if there is a oe 
in the social desirability keying of the F' i 
items, or vice versa; and a positive com’. 
tion between the part scores if the items d 
both sets are consistently keyed for el 
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“socially desirable or socially undesirable re- 
. Thus, what may on the surface ap- 
pear to be the influence of the acquiescence 
may, upon closer examination, be the result 
‘of social desirability. 

Finally, I should like to say that for the 
reasons I have given in my address, I regard 
the attempt to evaluate acquiescent tendencies 
dn terms of an imbalance in the True-False 
keying of all of the items in a test as completely 
inadequate. To determine the imbalance in 
the True-False keying is a very easy approach 
to the problem, but one which overlooks the 
a of the problem. 
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THE MOTHER-CHILD RELATIONSHIP IN BRONCHIAL AS' 
MARVIN MARGOLIS? 


Michigan State University 
INCE the publication of French and 
Alexander’s (1941) now classic mono- 
graph, “Psychogenic Factors in Bron- 
chial Asthma,” the mother-child relationship 
has been increasingly implicated as an etio- 
logical factor in the genesis of childhood 
asthma. French and Alexander conceived ex- 
cessive dependence on and longing for the 
mother as being at the very core of the psycho- 
dynamics of asthma. This fear of losing ma- 
ternal love may arise from the threat of an 
actual separation or from the patient’s anxiety 
that the exposure of his aggressive and sexual 
fantasies would evoke the mother’s rejection. 
French and Alexander reported that most of 
their cases were characterized by marked and 
early maternal rejection. 

Numerous clinical reports have since con- 
curred with this formulation (Coolidge, 1956; 
Dunbar, 1938; Gerard, 1946; Jessner, Lamont, 
Long, Rollins, Whipple, & Prentice, 1955; 
Monsour, 1960; Sperling, 1949), suggesting 
that the mothers of asthmatics often carry 
into their relationship with their children their 
own unresolved childhood conflicts. Sperling 
(1949) noted that these mothers have an un- 
conscious need to keep their asthmatic off- 
spring in a helpless and dependent state to a 
degree found only among mothers of psychotic 
children. Monsour (1960) was so impressed 
with the extent of the psychological conflicts 
of these mothers that he stated that “treatment 
of the mother seems inescapable in the cases of 
young asthmatic children.” 

There have been few controlled, objective, 
experimental studies of the mothers of asth- 
matic children. Miller and Baruch (1951, 1957) 
reported studies of 201 allergic children (in- 
cluding asthmatics) and their parents: 97% of 


1 This article is based upon a dissertation submitted 
in candidacy for the degree of Doctor of Philosophy at 
Michigan State University, August 1959. The writer is 
indebted to the members of his dissertation committee: 
Albert I. Rabin (chairman), Charles Hanley, and 
Donald M. Johnson for their invaluable advice and 
encouragement. 

2 Now at Alcoholism Treatment Center, Highland 
Park Health Department, Highland Park, Michigan. 
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the mothers were described as having “y 
expressed their rejecting attitude towaj 
children”; only 37% of control 
(mothers of “problem” children) e 
similar feelings of rejection. While these. 
ences were statistically significant, the 
apparently based on subjective evaluatior 
Miller and Baruch themselves. More stand 
ized, objective measuring procedures 
seem more desirable. Another objection 
type of research lies in the lack of a ch 
ill but nonpsychosomatic control grou 
out such controls, the alleged rejectin 
tudes of the allergic mothers may be at 
to their reactions to caring for a chroni 
child, often an oppressive burden, bo! 
nomically and psychologically. D 
and often irritable, such children requi 
mother to make many personal sacrifi 
properly care for them. Many mothers 
personally responsible for their child's. 
and this added strain often aggravates oth 
wise quiescent conflicts, introducing @ 
active factor." 
Cutter (1955) reported studying 33 m 
of asthmatic children by means of a qui 
naire, measuring parent-child interact 
“warmth,” “freedom,” and “control. 
tistically significant differences were not 
between experimental and control 
Fitzelle (1959) studied a group of 100 m 
of asthmatic children, using the MM 
the USC Parent Attitude Inventory. Ag 
significant differences were found. Thes 
studies, then, raise doubt about the 
clinical claims regarding the pathogen! 
maternal attitudes in the genesis of bro 
asthma, 3 
Some clinicians, however, may qu 
whether propositions derived from : 
analytic observations, involving intenst 
character of psychological conflicts, 
adequately tested by instruments basica 
signed to measure attitudes regarding | 
rearing behavior (e.g., the USC Parent 4 
tude Inventory). Clinical reports ha 
gested that some mothers of asthmatic CA 
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overindulge their children, whereas others are 
openly hostile and rejecting. Both groups, 
however, may share the same core conflicts, 
centered on unresolved dependency relation- 
ships with their own mothers. Their observable 
behavior and their attitudes regarding child 
rearing could, therefore, be quite different and 
have the effect of cancelling the extremes. As 
a result, experimental and control groups may 
show similar scores on attitude scales. It is also 
apparent that the intensity of psychosexual 
conflicts cannot be easily inferred from MMPI 
indices, which are not directly relevant to the 
testing of hypotheses by psychoanalytically 
oriented clinicians who have studied asthma. 

This study has attempted to answer some of 
these objections. A projective test based upon 
psychoanalytic constructs (Blacky Pictures 
Test) was included in the test battery. All tests 
employed could be objectively scored. A 
chronically ill but nonpsychosomatic control 
group was provided to aid in evaluating the 
Teactive factor. Such a design may more 
adequately test the notion that the mother- 
child telationship plays a primary, pathogenic 
tole in the etiology of bronchial asthma. It was 
hypothesized that mothers of asthmatic 
oe appear more emotionally disturbed on 

€ tests selected than mothers of comparable 
control subjects, 


METHOD 
Sample 


nd ee, for this study was drawn from the 
Detroit, M „Patients treated at Children’s Hospital in 
aimo, ichigan.3 The experimental group consisted 
(A Sd of asthmatic children in the Allergy Clinic 
chile, a. A group of mothers of chronically ill 
no iu selected from the cases in the Rheumatic 
matic cont ic is Serve as a chronically ill, nonpsychoso- 
Selected for ¢ ieee (RH mothers). No women were 
Bs r this control group if any of their children 
auh eie disease. Because rheumatic fever 
lia (Fri usually equal to or greater in severity 
represented riedberg, 1956, p. 858), the RH group 
Ra Ape, Conservative, chronically ill controls. 

nd control group was selected from the 


x JA 
of estan 4s expressed to the following members 
in ol tainin, " Children's Hospital for their assistance 
(consultin g subjects for this study: Joseph Fischhoft 
Rheumatic ip chlatrist); Manes S. Hecht (Director of 
ergy ae Clinic); Samuel J. Levin (Director 
lini); Charles N. Weller (Director of 


Surge "EM 
in id) nios and Paul V. Woolley, Jr. (Pediatrician- 
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mothers of relatively healthy children. Nineteen 
mothers were chosen from the Surgery Clinic. Their 
children were in the hospital for such routine operations 
as appendectomies. Six mothers were selected from the 
Outpatient Department (OPD), where the patients 
were being treated for minor cuts and burns. Mothers 
were tested several days after their childrens' operation 
or clinic treatment so that any situational anxiety 
attendant upon the hospital procedures might be 
dissipated. Only those mothers were included who had 
no allergic or chronically ill children. This group is 
referred to as the surgery-o.p.d. (SOPD) mothers or 
healthy controls. 

Table 1 indicates that the three groups were drawn 
approximately from the same socioeconomic, religious, 
and racial groups. In general, these mothers were 
lower middle to upper lower in status, The three groups 
were matched with regard to race because of Negro- 
white differences in the role of the mother in family life 
(Frazier, 1951). That this was a necessary precaution 
will be seen in a subsequent section of this paper.* 


Procedure 


Mothers were asked to participate in the study by 
either their physician or by the investigator. All mothers 
who appeared at their respective clinics during a 
period of several consecutive weeks were asked. In 
almost every case they agreed to participate. They 
were told by the investigator that this study of the 
relationship of maternal attitudes to illness was being 
conducted in many clinics throughout the hospital. 
They were tested at the hospital in groups of three to 
five mothers. Five to six mothers in each group, who 
volunteered to participate but subsequently were 
unable to come to the hospital for testing, were tested 
at home. Therefore, neither refusal to participate nor 
inability to complete the testing procedure constituted 
a serious source of bias in this study. 


Test Battery 


The Blacky Pictures were selected as the major test 
in the test battery because it is specifically based upon 
psychoanalytic constructs. It can also be group 
administered and objectively scored. Each of the 
Blacky cartoons is concerned with either a. particular 
stage of psychosexual development or type of object 
relationship and can be scored for intensity and char- 
acter of conflict in that particular area. The subject's 
total performance on each card is summed up in the 
Overall Dimensional Score. A “fairly strong" or “very 


‘The background data obtained from the mothers 
in regard to ordinal position of their patient-child is 
also worthy of attention. Many have speculated about 
a possible link between ordinal position and asthma; 
they have observed that asthmatics are often oldest 
children. Jessner, Lamont, Long, Rollins, Whipple, and 
Prentice (1955) found that 19 out of 28 asthmatics in 
their sample were oldest or only children. This was not 
the case in this study, for only 11 out of 25 cases were 
oldest or only children, the differences between experi- 
mental and control groups in this regard were not 


statistically significant. 
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TABLE 1 
CHARACTERISTICS OF EXPERIMENTAL AND CONTROL SUBJECTS 
Race Religion Education Income Marital status 
Group s TN 

Negro | White | Protestant| Catholic | eats, | Totalfamily Living with | Separate 
Asthma mothers 15 10 1f 8 MIT $3,921.28 19 6 
RH mothers 15 10 19 6 11.0 $4,063.00 22 3 
SOPD mothers 15 10 24 d 11.5 $4,134.60 22 3 
strong" overall dimensional score reflects conflict. A RESULTS 


“weak” score indicates a relative lack of conflict in 
that area. The Revised Scoring System for Research 
Use (Female form) by Blum (1951) was used.5 Code 
numbers were assigned at random to all protocols so 
that scoring was done without knowledge as of whether 
a particular test belonged to an experimental or control 
subject. 

'The second test in the battery was the Parental 
Attitude Research Instrument (PARI), devised by 
Schaefer and Bell (1955). Form IV of the PARI was 
used, consisting of 23 five-item scales measuring 
attitudes toward child rearing and family life. This 
instrument was selected because it would provide data. 
complementary to the Blacky Pictures in its assessment 
of relatively more conscious, attitudinal material, and 
also because it is comparable to the other attitude 
measuring devices used in earlier studies. 

PARI items are measured on a four-point scale 
ranging from strongly agree to strongly disagree. The 
majority of the scales are so constructed that less 
desirable child rearing attitudes are associated with 
agreement. This type of scale construction has been 
criticized because scores can be distorted on the basis of 
acquiescence as a response bias. Therefore, an 
Acquiescence scale was constructed to control for this 
factor. This 30-item scale was based on 15 items drawn 
from the F Scale and 15 reversed content F Scale items, 
prepared originally by Jackson and Messick (1957). The 
acquiescence items were interspersed among the 115 
items of the PARI. 


The following two hypotheses were tested 
in this study: 

1. More mothers of asthmatic children are 
characterized by intense psychosexual conflicts 
than are the mothers of controls. This is mani- 
fested by “stronger” Overall Dimensional 
Scores on the Blacky Pictures Test. 

2. The mothers of asthmatics exhibit more 
psychologically damaging attitudes regarding 
child rearing and family life than the mothers 
of control subjects. This is manifested by a 


higher score on the pathogenic scales of the 
PARI. 


* The writer is grateful to Gerald S. Blum for his 
supervision in scoring the Blacky protocols. 


Table 2 consists of 26 comparisons made 
between the experimental and control mothers 
on their Overall Dimensional Scores on the 
Blacky Pictures Test. The chi square technique 
was used to analyze the data. The Yates cor 
rection was utilized whenever the expected 
frequencies were below 10. Three of the tests 
were statistically significant; in each case, the 
differences were in the predicted direction. On 
all tests, the asthma mothers had higher Over- 
all Dimensional Scores, the indices of greater 
psychosexual conflict, than the control mothers. 
"Three significant tests out of 26 are slightly 
more than would be expected by chance alone. 
Therefore, the first hypothesis is essentially 
supported by these data. r 

An analysis of the content of the statistically 
significant tests afforded hints as to the nature 
of the possibly major conflict areas of the 
asthma mothers. They were most disturbed in 
their Oedipal relationships. On Card IV (Oedi- 
pal Intensity) they had higher scores than the 
mothers of both the chronically ill and healthy 
controls. Moreover, the differences on Card 
between the A and SOPD mothers were greatet 
than the differences between the A and 
mothers, as expected from considerations 0 
the reactive role of physical illness in children 
in generating maternal conflicts. The A ee 
also had significantly higher scores than UP 
SOPD mothers on Card I (Oral Eroticism) 
The RH mothers had higher scores than 
SOPD but lower scores than the A mothers M 
this dimension. In comparing the RH a 
SOPD mothers with each other, one statist 
cally significant test was found on Card a 
(Sibling Rivalry). The RH mothers had 5 
higher Overall Dimensional Scores, suggest 
greater psychopathology. 


A separate analysis of the Overall pr 
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E TABLE 2 
be OVERALL DIMENSIONAL ScorEs FOR TOTAL SAMPLE 
= (n = 75) 
= 
& A RH SOPD 
" B lacky mothers. mothers mothers x nee AS between 
v» SR 3 8 mothers pue 
ES T Qua F ob | 4|] c 
T. Oral Eroticism 14 | 121 | 12 | 13 | 6 | 19 32 ^" 
"IL, Oral Sadism 6 | 19 | 9 | 16 | 10 | 15 138 E 
: TIT. Anal Sadism (Exp.) 10 15 8 17 9 16 .10 0.00 
— TIL Anal Sadism (Ret.) i3 "742^ |213: 201 dS ed 0.00 0,00 
"IV: Oedipal Intensity 22 3 | 16 9 | 14 | t1 2.74* 4.86** 
E. "Masturbation Guilt 9 16 12 13 10 15 74 0.00 
(VI. Penis. Envy 0 25 2 23 3 22 +52 1.42 
_ VIL Identification Process 19 6 16 9 16 9 38 .38 
‘VIL Sibling Rivalry 14 | 32 | 18 | 7 | 10 | 15 E 1.28 
TX. Guilt Feelings 8 17 13 12 11 14 2.06 .34 
X: Ego Ideal. : 10 | 15 | 10 | 15 8 | 17 0.00 0.10 
5 Narcissistic Love Object 18 7 23 2 19 6 2.18 0.00 
et Anaclitic Love Object 17 8 12 13 14 11 2.06 0.36 


L Notem All x? values represent one-tailed tests. 


ay "i ” “fai E, 
3 represents “very strong” and “fairly strong” Overall Dimensional Scores. 


PO represents “weak” Overall Dimensional Scores. 
p 05-025. 
"p 025-01. 


mensional Scores along racial lines was made. 
‘The same trends continued: A mothers, in 
both the white and Negro groups, demonstrated 


- Breater conflict on both Cards I and IV. 


Additional perspective can be obtained by a 
lr, more qualitative analysis of some of 

the components of the Blacky data that are 
fon up by the Overall Dimensional Score.’ 
Ta Strong" spontaneous stories of the A 
Hs lers to Card I were evasive, either ignoring 

M minimizing the fact that the mother dog is 

: nursing Blacky. In contrast, the second largest 
‘ group Wrote frank stories expressing an intense 


| desire for food on Blacky's part, describing 


Macky as being “a little greedy” and as one 
Who "loves to eat." It would P then, that 
me mothers were quite concerned about their 
| Utance needs. While they typically at- 
1 P to handle them by evasion, there 
i. me cases in which the conflicts were so 

neat Or the defenses so weak that urgent oral 


.. Concerns were quite openly revealed. 


Wy asion was also characteristic of the 

P" Stories produced by the A mothers 
The s p 

subjects Best Dimensional Score is based on the 

= Pontaneous story produced in response to 


; his responses to multiple-choice Inquiry 


3 dim » related comments in regard to that particular 


-dimensi 
Tatings 9n produced anywhere in the protocol, and on 


oi 
f cartoons as to preference. 


to Card IV. This card depicts an angry Blacky 
watching his mother and father making love. 
One A mother wrote: “Blacky has found 
Mama and Papa kind of spooning.... She 
likes these little episodes to occur as it makes 
her happy too." Such data suggest that the A 
mothers are attempting to contain their con- 
flicts with primitive denial mechanisms. The 
second largest group of strong stories produced 
by the A mothers was quite candid about their 
jealousy, e.g.; “Blacky is jealous because she 
is in love with Papa more than with Mama, 
She doesn't like the attention that Papa is 
giving Mama. ... She's unhappy and jealous." 

This qualitative analysis can be further pur- 
sued by considering those Inquiry items which 
approached or achieved statistical significance 
as in Table 3. Item IV, 3a and b, pointed up 
the greater involvement of the A mothers in 
Oedipal conflicts than the RH mothers. The A 
mothers seem to be more closely identified 
with their own mothers and, at the same time, 
tend to see their mothers as the disciplinary 
figure in the household more than is the case 
with the RH mothers (VII, 1a and c; VII, 3a 
and c). The A mothers also seem to be less 
concerned with masturbatory conflicts than 
the RH mothers (V, 2a), but they do not seem 
as confident about Blacky’s possibilities of 
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TABLE 3 
Inquiry CHOICES FOR TOTAL SAMPLE 
(n = 75) 
Card Question and answer selected AS) x penea 
IV. | 3. Which one of the following makes Blacky most unhappy? 
keeping P: Il to herself. A>RH 3.06 | .10-.05 
GE RUE Ad AS SOPD | 3.00 | .10-.05 
b) The idea that Mama and Papa seem to be ignoring her on RH>A 5.18 | .05-.02 
purpose. SOPD > A ]|5.18 | .05-.02 
V. | 2. How might Blacky feel about this situation when she is older? 
2) Happy without a care in the world. |; A > RH 2.82 | .10-.05 
VII. | 1. Who talks like that to Blacky—Mama or Papa or Tippy? 
a) Mama A> RH 2.94 | .10-.05 
b) Tippy RH» A 6.60 | .02-.01 
VIL | 3. Whom is Blacky imitating here? Mama or Papa or Tippy? 
a) Mama A> RH 3.36 | .10-.05 
c) Tippy d RH>A 4.78 | .05-.02 
VIII. | 3. Who does Blacky feel is paying more attention to Tip) 
b) Papa e Rete in A>SOPD | 4.72 | .05-.02 
X | 5. Actually, what are Blacky’s chances of growing up to be like 
the figure in her dream? 
a) Very good RH > A 4.13 | .05-.0 


* Significance levels represent two-tailed tests. 


TABLE 4 


PARI Scores or A Mormaers, RH MOTHERS, AND 
SOPD MOTHERS ror NEGRO SAMPLE 


(n = 45) 


Scale Analysis of variance | Analysis of 
is 4.10* 2.32 
10. 3155* 1.84 


* F ratio significant at .05-.01 level. 


TABLE 5 
TOTAL PATHOGENIC Scores or NEGRO MOTHERS 
AS COMPARED WITH WHITE MOTHERS 


(n = 15) 
Analysis of Analysis of 
variance covariance* 
F Ratio 9.95 1 
Level of significance .01 ns 


* Adjusted for acquiescence. 


growing up to be like her ego ideal as do the 
RH mothers (X, 5a). 


As compared with the SOPD mothers, the 
A mothers also appear more conflicted in the 
area of Oedipal relationships (IV, 3a and b). 
There is also evidence of greater sibling rivalry 
on the part of the A mothers (VIIT, 3b). 
There seems to be an Oedipal flavor to these 
feelings, inasmuch as the father is the parental 


figure toward whom the rivalry for affectionis 
directed. 

The PARI analysis allowed a test of Hy- 
pothesis 2. Analysis of covariance allowed for 
controlling the effect of acquiescence on the 
PARI scores. The mothers were first compared 
in terms of their total Pathogenic scores. When 
no significant differences were found, they were 
then compared by analysis of variance on eat 
of the 23 PARI scales. A significant P rato 
was obtained only on Scale 23 (Dependency af 
Mother scale). Since one significant F co 
have been obtained by chance in 23 tests, the 
hypothesis was not confirmed. n 

The PARI data were also analyzed by racial 
groups. The total Pathogenic scores were again 
not found to be significantly different. The 
data for the racial groups were also anal; 
separately for each of the 23 component 
scales. No significant differences emerged. 

Interestingly, these analyses would have 
been altered had there been no control for 
acquiescence response set. This can be seen 1 
Table 4. Two significant F ratios disappe^" 
when the means were adjusted for acqur 
escence. A more striking example was Er 
when the Negro and white samples Mo 
compared on total Pathogenic scores. In Tal 
5 it can be seen that the Negro mothers 


e 
significantly higher Pathogenic scores when 
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analysis of variance results alone were con- 
sidered. However, when adjustments were 
made for acquiescence, the differences in total 
Pathogenic scores vanished. 


DISCUSSION 


An overall evaluation of the Blacky and 
PARI results indicates some support for the 
psychoanalytic formulations of the psycho- 
dynamics of the mother-child relationship in 
asthma, In this population, mothers of asth- 
matic children appeared to be more emotion- 
ally disturbed than mothers of nonasthmatic 
children. However, the reported differences 
were very slight in relation to the claims of 
some clinicians regarding the etiological in- 
volvement of psychological factors. In view of 
the small number of subjects in this study and 
the slight differences in test performance of the 
experimental and control groups, considerable 
Caution must be exercised in interpreting the 
findings. Replication of this study is essential 
before one may conclude that the psychoso- 
matic hypothesis has been genuinely confirmed. 

Those who state that much of the psycho- 
logical distress of the mother of the asthmatic 
child is reactive to the child's illness can draw 
Some comfort from these data. The mothers of 
the chronically ill children appear to be more 
Psychologically disturbed than those of healthy 
children, However, this reactive factor does 
not account for all of the variance; the A 
ee appeared to be more emotionally dis- 
RM than the RH mothers. Nevertheless, 
h og factor has been too often ignored 
ie e psychosomatic literature. It may account 
Ne major portion of the pathology which 

clans have observed in the mothers of 
Psychosomatic patients. 
ee oct of the mothers’ psychosexual 
A p is also of interest, The fact that the 

SUR ers had higher scores on the Oral 
ARE dimension is consistent with the 
Sus dg speculations. Mothers of asth- 
E ci 2 dren have been described as being 
(m ependent and having inordinate 
GER Ue needs by Sperling (1949), Jessner 
bility his 8 Coolidge (1956). The possi- 
Me area of no are particularly disturbed in 
tithe a edipal relationships, as is hinted 
ae pede ent study, while implicit in all of 

ngs, has not been made a point of 
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special attention. The Inquiry data point to 
complex ramifications of the A mothers' possi- 
ble Oedipal conflict. For example, they ex- 
pressed their sibling rivalry in terms of compe- 
tition for the father's rather than the mother's 
attention. They also were more closely identi- 
fied with their mothers and did not expect to 
realize their ego ideals. These latter two trends 
in the Inquiry data are consistent with height- 
ened Oedipal conflicts, Identification with the 
mother is assumed to be an integral step in a 
girl's attempt to resolve her Oedipal problem; 
an exaggerated attempt at identification can 
follow from an exacerbated Oedipal conflict. 
If a woman renounces her ability to achieve 
her ego ideals (e.g., to become an attractive 
woman, desired by men), she cannot be 
“accused” of being a serious rival of her 
mother's in bidding for her father's affections. 
Tn this fashion, anxiety over Oedipal wishes is 
allayed. 

Finally, in view of the large claims of clini- 
cians regarding the “asthmatogenic mother,” 
it may be useful to consider some different in- 
terpretations of the very modest trends sup- 
porting the psychosomatic position. First, it is 
possible that the present findings are limited 
to a lower income group. Perhaps larger dif- 
ferences in favor of the psychosomatic hy- 
pothesis would be found in higher socioeco- 
nomic groups. This is especially likely since 
psychoanalytic formulations have been based 
primarily on work with such patients. Another 
possibility lies in the fact that the mother may 
not be as potent an etiological factor in asthma 
as is often believed. Psychological causation 
may be a more general phenomenon. In one 
case, it may be the father who plays the pri- 
mary role; in another, it may be the siblings. 
In still a third, it may be the interaction of the 
entire family. Consequently, perhaps small 
differences are all that can be expected when 
attempts are made to isolate single etiological 
factors which are presumed to be responsible 
for large, complex phenomena. It is also possi- 
ble that dynamic involvement is primary in 
only a few cases and the reactive factor in 
many. Future studies could profitably compare 
severely asthmatic with mildly asthmatic 
groups to determine whether psychological 
processes vary, as some allergists believe, with 
severity of illness. 
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Some of the most important implications of 
the present study are methodological. The 
PARI analysis indicated that an acquiescence 
response bias drastically alters findings with 
this instrument. The Negro group, for example, 
was far more acquiescent in this study than 
the white sample. Therefore, their Pathogenic 
scores were spuriously higher than the whites’ 
indices. There are pitfalls in using attitude 
scales with multiple-choice responses along an 
agree-disagree continuum for purposes like the 
present ones. 

It has also been demonstrated that a chroni- 
cally ill control group is very necessary in this 
type of psychosomatic research, Without such 
a group, the reactive factor cannot be properly 
assessed. A large majority of psychosomatic 
studies do not provide for such a control group, 
and differences reported in such studies are un- 
warrantedly attributed wholly to dynamic 
processes. 

The fact that the Blacky test was able in 
some degree to differentiate the clinical groups 
while the PARI was not is also worthy of note. 
Perhaps projective tests are more appropriate 
for research involving psychodynamic phe- 
nomena of the kind considered here. As was 
previously indicated, the same underlying con- 
flicts may accompany totally different person- 
ality types. The conscious attitudes of indi- 
viduals may differ although the underlying 
pathology is identical. The performance of such 
varying types of personalities on attitude 
questionnaires would therefore not be in a 
given direction, and as a group, such subjects 
may appear similar to controls. 

Finally, this study represents some advance 
in the validation of the Blacky Pictures. Since 
the Blacky test was able to differentiate the 
asthmatic mothers from control groups, more 
or less along lines predicted by clinicians, this 
can be interpreted as another confirmation of 
its discriminative power. While the evidence is 
by no means entirely persuasive, at this stage 
of development in projective test validation, 
the Blacky seems to represent one of the 
better methods. 


SUMMARY 


The etiological role of the mother-child rela- 
tionship in the genesis of bronchial asthma has 
long been implicated in psychosomatic theory. 


Marvin MARGOLIS 


An attempt was made to examine this notion 
by studying the psychosexual conflicts and the 
attitudes towards child rearing and family life 
of 25 mothers of children with bronchial 
asthma. They were compared with 25 mothers 
of chronically ill children and 25 mothers of 
healthy children. The test battery consisted of 
the Blacky Pictures and the PARI. 

The mothers of asthmatic children appeared 
more emotionally disturbed than the control 
mothers on the Blacky, manifesting greater 
disturbance in the areas of Oral Eroticism and 
Oedipal Intensity. 

The reactive role that the child's chronic ill 
ness plays in aggravating the psychological 
conflicts of the mothers is suggested by the fact 
that in both Cards I and IV, while A mothers 
gave evidence of being the most psycholog- 
cally disturbed, the healthy control group ap: 
peared least disturbed. The chronically ill con- 
trol group fell between the two. 

There were no significant differences in the 
PARI performance of the experimental and 
control groups. 

An acquiescence response bias was shown to 
influence the scores obtained on the PARI 
This bias seems to limit the use of this instnr 
ment. 

In general, this study gives a very small 
measure of support to the psychosomatic hy- 
pothesis that the mother-child relationship 
plays a primary etiological role in the genet 
tion of asthmatic symptoms. 
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OBJECTIVE COMPARED WITH SUBJECTIVE MEASURES OF THE 


SAME BEHAVIO. 


ERNEST B. GURMAN 
Mississippi Southern College 


Ww REVIOUS studies in the research program 
from which the present paper originates 
have shown that the objective measures 

of social interaction described by Bass, Gaier, 
Flint, and Farese (1957) are reliable (Bass, 
1959b; MacDonald, 1960). In addition, the 
objective measures of stability and coalescence 
of opinion from before to after discussions 
have been shown to covary meaningfully with 
each other (Bass, 1955b). Further, Flint and 
Bass (1957) have demonstrated the construct 
validity of the objective measures of influence 
or leadership by noting that they are associated 
with ability, esteem, and subsequent merit. 
The present study also explores properties 
of these objective measures of social inter- 
action, which are based on the rank-difference 

correlations of judgments of members of a 

group before and after a discussion about the 

judgments. It examines the agreement between 
the objective measures of interaction and 
observers’ and participants’ ratings about what 
occurred during and after a group’s interaction. 

If extremely high correlations were to be 
found between the objective and subjective 
assessments, either assessment could serve in 
the place of the other. If no correlation were 
found, it would put into question the meaning- 
fulness of what small group observers report 
and/or of the objective measures. If, as 
expected, moderate correlations should emerge, 
the objective and subjective assessments of 
such social concepts as successful leadership 
would appear to share some common variance, 
although reliance on observers and participants 
alone would be inadequate to describe the 
behavior completely, if behavior, per se, were 
the focus of study. Similarly, if observers’ or 
participants’ perceptions should be the focus 


1 This work was aided by funds from the Louisiana 
State University Council on Research and Contract N7 
ONR 35609. Parts of the report were included in a 
thesis of the senior author partially fulfilling the 
requirements of a master’s degree at Louisiana State 
University. 

?Now at the University of California, Berkeley, 
1961-62. 
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BERNARD M. BASS? 
Louisiana State University 


of a study, objective assessment alone would 
be inadequate. 


METHOD 

Subjects 

The subjects consisted of 42 male and 18 female 
undergraduates from an introductory psychology 
course. The 60 students were all volunteers and were 
randomly assigned to 12 groups of five members each, 

Each subject was to be given an innocuous report 
after the completion of the experiment, With only this 
offer as an incentive, the groups were expected to have 
relatively low motivation to perform, a desirable 
condition in view of the higher reliabilities found for 
low motivation groups by Bass (1959b). 


Procedures 


Five case histories including a practice problem 
were discussed by each group. Five possible solutions to 
each problem were provided. f 

Figure 1 shows such a case. The five alternatives atè 
not necessarily the five best solutions but rather five 
from a much larger list which were originally judged by 
a large sample of subjects to be fairly close in quality 
but still reliably different from each other in value 
Closeness in quality serves to produce a discussion 
problem of considerable difficulty and conflicting 
opinion, yielding maximum opportunities for AN 
personal influence and change. As expected theoretict y 
(Bass, 1960b, p. 136) and confirmed empirically oa i 
& Flint, 1958), there is a high correlation between hos 
ambiguous or difficult the problem confronting & group i 
is, and how much influence occurs in the group. " 

The alternatives are shown here by worth e j 
ing to two graduate psychology classes of 25 stu en! 
each. Interclass agreement was .9. a 

The objective measures of social interaction Be 
collected using the analog computer described by 
et al. (1957). The computer indicates directly the f: 
difference correlation of the various sets of ru 
of merit of the five solutions to a case history e us 
by each subject before and after discussion of thal 
history and its five solutions. housing 

Each of the five members has his own panel, ei P 
two columns of five-position selector switches. desired 
left column (X), the member sets his initial E 
rank for each alternative, using the topmost E i 
for the first alternative, the second switch s ae 
second, and so forth. If a subject ties ranks by b 
a light on the experimenter's panel Ee 
signals the experimenter, who notifies the suy Rs 
he must correct the error. In the right COU" ite 
Switches, the member sets his final (Y) rani 
discussion. The experimenter's panel contains 


the 


d 


f 
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Case History A 


The Case History of Paul 


Paul, a sophomore at a state university, knew that 
a certain group of boys had bribed a person in the 
mimeograph office and obtained an important exam. 
He knew that if he exposed the bribe, the exam would 
be changed, but the people involved, many of whom 
he knew quite well, would be caught and since the 
university enforced the rules against cheating very 
strictly, would probably be suspended from the school, 
or at least be given an F in the course. Such an action 
would obviously make Paul extremely unpopular with 
the students during the rest of his stay. He could afford 
to go nowhere else. 

Paul was an average student, but a series of personal 

problems last semester has affected his studies and 
caused him to be put on probation. He had to pass this 
yery rough course to stay in college, and he was just on 
the borderline between passing and failing. 
i The gang with the stolen exam offered to cut Paul 
in on it, since they knew Paul had seen a copy in the 
hands of one of the fellows in the dorm; but Paul had 
strong moral feelings against cheating and had turned 
down the offer. But since the course was graded on the 
curve, he felt the added advantage of the others would 
be sufficient in such a small class to cause him to fail. 
What should he do? 


A. Try to convince the other fellows not to use the 
exam to study by. 

B. Paul could inform officials that the exam had been 
passed among the students; he could do this in a 
letter, hence would not involve himself. 

C. Consult with teacher. 

D. Keep mum and take test as is. 

E. Seek aid to problem from minister. 


Fic. 1. Human relations problems: 
Solutions provided. 


dis Íor recording the group decision (G) and 
Á Mo rennes (R) when these are available. 
im p ding door cover on each subject’s panel prevents 
res rom observing the X and Y columns simul- 
eously, 
Bee patentee can read on a calibrated ammeter 
Pee oe correlation between any of the 12 
Duc h s (Xi, Xs - + +, G, R) by closing the appro- 
UD mmeter switch on his own panel He can 
ro Ad communicate these readings back to members’ 
fide ird as desired, although such was not done 
ent study. 
lé participants were given 3 minutes on the 
ee PUES and 10 minutes for each case history 
dey xum , B, C, and D are reported in Bass, 1960a). 
H mh eee of the four cases was randomized. 
groups r due was considered sufficient since most 
time limit. final decisions before reaching the 
E E D member first read the discussion problem 
eating tanked the solutions on his panel before 
ig the discussion (X). 
BUD then was asked to enter into a discussion 
ve at a decision as to the best rank ordering. 
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After the group decision (G) was reported to the experi- 
menter, each subject, without referring to bis first 
rankings, once again ranked the solutions in the order 
he preferred (Y). 

Two observers rated the groups after each case 
history on each of the same variables purportedly 
assessed objectively, In addition, each participant 
made one rating on each variable of all members 
(including himself) at the termination of each discussion 
session. 


Objective Measures 


Stability. The rank-difference correlation between 
each subject's rank judgments before (X) and after (Y) 
a discussion provided an objective measure of that 
subject's stability of opinion for that discussion. This 
rho correlation is symbolized hereafter as XV. 

Initial agreement with group decision. This was the 
rank-difference correlation between a subject’s initial 
judgment (X) and the group decision (G), symbolized 
as GX. 

Acceptance of the group decision. This was the rank- 
difference correlation between a subject’s final opinion 
(Y) and the group decision (G), symbolized as GY. 

Initial agreement with others. This was each subject's 
mean correlation initially with all other members of his 
group, symbolized as XX. 

Final agreement. This was each subject’s mean 
correlation after discussion with all others in his group, 
symbolized as YY. 

Altempled. leadership. This was the total amount of 
time spent talking by a subject during a discussion 
(T). (For a discussion of the validity of this objective 
measure, see Bass, 19592.) 

Public success as a leader (L). Public successful 
leadership of a subject was the extent the group decision 
correlated with that subject's initial ranking rather 
than with other members’ judgments and the extent 
the subject agreed more with the group decision relative 
to his initial agreement with others. Algebraically, the 
public successful leadership of member 2 was equal to 
the amount the group decision agreed with member i's 
initial decision (GX), less the amount the group decision 
did not agree with the other members" initial rankings 
(GX), plus the amount member #’s final decision agreed 
with group decision (GY), less the average amount he 
agreed initially with the other members (XX). (Subse- 
quent research had indicated that L is not as valid a 
measure of successful leadership as ^, relative success 
as a leader, Bass, 19592.) 


Subjective Measures 


of each other’s di ion behavior on the basis of what 


they saw or heard and on 
(One possible additional complex 
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by the clicking of the switches. For example, one might 
infer rightly, or sometimes wrongly, that much clicking 
meant much changing of opinion.) To indicate how 
much a particular behavior was exhibited by each dis- 
cussant, raters used a five-point scale as follows: 40, A 
great deal; 30, Fairly much; 20, To some degree; 10, 
Comparatively little; 0, Not at all. 

The correspondence, by definition, between the ob- 
server and participant-rated behavior and objective 
measures is as follows: 


OBJECTIVE SYMBOL BEHAVIOR RATED BY OB- 
MEASURE SERVERS AND BY 
PARTICIPANTS 
Stability (XY) He maintained his initial 
decision. 
Initial agreement (GX) The group decision agreed 


with group de- with his initial private 


cision decision. 

Acceptance of (GY) His final private decision 
the group de- agreed with the group 
cision decision. 

Initial agreement (XX) He agreed privately with 
with others the rest of the group ini- 


tially. 
He agreed privately with 
the rest of the group 


Final agreement 


(Y) 


finally. 
Attempted lead- (T) He attempted to influence 
ership others. 
Public success as (L) He successfully influenced 
a leader others publicly. 
ResuLTS 
Reliability of Observers 


The two observers’ ratings of the 60 subjects 
were correlated for each variable on each 
problem. The resulting 28 correlations were 
corrected by the Spearman-Brown formula to 
provide an estimate of the reliability of a 
composite observation based on the two obser- 
vers. The four reliability estimates for a 
measure were averaged via Fisher’s z trans- 
formation, and this average was then corrected 
with the Spearman-Brown formula to obtain a 
final estimated reliability of a score based on 
two independent observations of a total of 
four problems, Table 1 shows these estimated 
reliabilities of the observations pooled for two 
observers across four problems. 

The estimated reliabilities ranged from .63 
to .94 for the seven variables. Observations 
were more reliable when two observers rated 
overt rather than covert forms of behavior. 
The highest reliability of .94 occurred when 
observers rated T. Next most reliable were 
observers’ ratings of L. The most unreliable 
observer ratings were of XX before any 
discussion began. This is understandable since 
the observers could only estimate the amount 
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TABLE 1 

CORRECTED SPLIT-HALF RELIABILITIES oF 

OBSERVER RATED MEASURES OF 
INDIVIDUAL PERFORMANCES 


Problem 
Behavior rated by observer or 
participants 


He maintained his initial decision. |.30 |.58 |.36 |.62 
(xv) 
The group decision agreed with his |.23 |.36 |.60 |.44 
initial private decision. (GX) 
His final private decision agreed with |.62 |.56 |.64 |.45 
the group decision. (GY) 
He agreed privately with the rest of|.47 |.26 |.44 |.00 
the group initially. (XX) 
He agreed privately with the rest of |.42 |.56 |.42 |.06 
the group finally. (YY) 
He attempted to influence others. (T)|.82 |.81 |.73 |.79. 
He successfully influenced others |.61 |.70 |.67 |.53 
publicly, (L) 


of XX from the direction of action and possible 
opinions expressed during the progress of thé 
discussion. 


Reliability of Objective Measures 


Objective performance on a particu. 
variable such as XY on each problem wis 
intercorrelated with similar performance 0 
every other problem. The Spearman-Browl 
formula was used by MacDonald (1960) to 
estimate from this average intercorrelation y | 
reliability of a test four times the length of # 
single problem. 

Each variable was treated in the same wayi 
The results were generally lower than had been. 
attained for 10-12 shorter problems about the 
familiarity of words or the size of cities requi 
ing the same overall testing time (Bass, 1959 b), 
The estimated reliabilities for combina 
scores based on the four human relation 
problems are as follows: XY = .31, Gx E 
20, GY = .69, XX = 45, YY = 3L T7. 
93, L = .40. 

These reliabilities were reduced cons 
by one inconsistent case (Case D). They “a 
have been lowered further by differences JE 
problem content and difficulty. Altho E 
extrinsic reward was equal for all subi 
intrinsic motivation associated with bo f [: 
and the difficulty of the problem may adl 
varied. Absence of feedback may also have ios 
an adverse effect on consistency. In prevo 


iderably 
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reliability studies, feedback was given as to 
the correctness of the solutions. Since many 
of the solutions were arbitrary, the lack of 
feedback in the present study may have 
reduced the consistency of performance. 


Correlation of Observer Ratings and Objective 
Measurements 


As Bass (1960b, Ch. 2) suggested, the same 
construct may be defined by a variety of 
operations. The empirical associations among 
these operations indicates the validity of the 
construct that they define. A correlation of 
definitional validity indicates the amount of 
association between two different measure- 
ments, both of which are related to the same 
construct by different operations. Thus, the 
definitional validity correlation coefficient of 
10 between the observers’ rating of how much 
participants talked and the clock reading of 
time the discussants talked provides evidence 
that both measurements can be regarded as 
sense impressions” tied to the same construct. 
As McClelland (1951) has noted “. . . A con- 
cept is useful theoretically (1) to the extent 
that the person using it makes explicit its 
operational meanings and (2) to the extent 
that those operations cut across and are con- 
firmed in different areas of behavior” (p. 158). 
ae observer ratings and all objective meas- 
i intercorrelated (by the product- 
ana qmm to provide measures of 
bd ona validity of the seven variables of 
omm Interaction, These definitional corre- 
All of a presented in Column 1 of Table 2. 
s x correlations of definitional validity 
th Pt that of XY and XX were significant at 

4 01 or .05 level. 
the Se publie the variable, the higher was 
rn dou between objective and sub- 
eiue US Thus, T, the most public 
E ea ibited a definitional validity of 
Heitor od those measures with least 
cae aa dity were mainly about private 
5 RR zd i and XY, which produced 
tively. The a Ed .21 and —:09, respec- 
standable in rrelation of .21 of XX is under- 
had to na of the fact that the observer 
the discussi d prediscussion agreement from 
correlation OY followed. The very low 
Correlation, indic for XY, the only negative 
completely una Hee that the observers were 
e to sense who actually shifted 
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TABLE 2 


CORRELATIONS OF OBSERVER, PARTICIPANT, AND 
SELF-RATINGS WITH OBJECTIVE MEASURES 


à Patt 9 
Measure | ^ Observer-objecti pantob- | objet 
server abjective ety | arrela 
tions tions 
xY —.09 (—.018)* .20 .16 
GX Taas .99) .12 18 
GY .34** .59) iol .20 
XX .21 74) .08 BAY fe 
YY z494 ( .93) 10 24 
ac ed b .58** a5 8m 
L 431%) Ae 053) :325 EK te 


* Coefficients corrected for attenuation by the unreliability of 
the correlated measurements. 
* p < 05 when r = (.25). 
** p < 01 when r = (33). 


his opinions, But observers did seem to be more 
accurate in judging final private agreement. 

Estimates of the true definitional validities 
of the measures, assuming perfectly reliable 
measuring instruments and observers, are 
shown in parentheses in Column 1 of Table 2. 

The definitional validity obtained may have 
been due in part to the fact that the two 
measures related by definition are positively 
correlated with a third measure not related by 
definition. For this reason, Table 3 was con- 
structed to examine the extent to which the 
correlations reflected some generalized observer 
rating not directly associated with the behavior 
the observers were supposed to be rating. This 
type of examination has been advocated by 
Campbell and Fiske (1959), who propose that 
in addition to being able to measure constructs 
by two different methods, “Measures of the 
same trait should correlate higher with each 
other than they do with measures of different 
traits involving separate methods.” 

Table 3 compares the definitional validity 
of a measure with the correlation of observers’ 
ratings with all other objective measures not 
related by definition. Only in one instance, 
XX, was the average correlation of observers’ 
ratings with all other irrelevant objective 
measures equal to the correlation with the 
relevant objective measure related by definition, 
and in no case did any objective measure 
correlate on the average with all other irrele- 
vant ratings higher than it did with the 
definitionally relevant rating. 
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TABLE 3 


AVERAGE CORRELATIONS OF ALL OBSERVER 
RATINGS AND ALL OBJECTIVE 
MEASUREMENTS 


(Fisher's z transformation used for averaging) 


Definitional va- | Average correla- | Average correla- 
lidity (der tion o SUE tion o poner 
Measure eem diem all other irrele- Hob 
and relevant ps vant (nondefini- (nondefini- 
jective measure | tional observer | tional) objective 
(by definition) ratings) measures 
XY —.09 .03 07 
GX 38 .21 19 
GY -34 +21 17 
XX .21 «13 +21 
YY 49 .16 .20 
ib 70 37 15 
L 31 -16 29 


Correlation of Self and Participant Ratings with 
Objective Measures 


Table 2 lists the correlations between 
participant ratings and objective measure- 
ments (Column 2) and between self-estimates 
and objective measurements (Column 3). 
It can be seen that generally the observers’ 
ratings had higher definitional validities than 
the validities of participant or self ratings, 

Like the observers, the Participants were 
able to recognize the overt measures more 
accurately. Again, T yielded the highest 
correlation with its relevant objective assess- 
ment while XY and XX, both private events, 
produced the lowest correlations of definitional 
validity. 

Participants were about as close to objective 
measurements as were observers in gauging 
GY, T, and L. There were five participants 
whose ratings could be averaged compared to 
only two observers, This Suggested that the 
ratings based on participant opinion were 
likely to be more reliable than those drawn 
from observers. Yet participants were less able 
to sense GX, YY, and T as objectively defined, 
Personal involvement seemed detrimental to 
perceiving who actually talked the most, who 
finally was most in agreement with others, and 
whose initial opinions were best matched by 
the group decision. 

Likewise, participants were less accurate in 
perceiving their own objective performance 
measures, GX, YY, and T, than were obser- 
vers. Moreover, each participant seemed less 
accurate in discerning how much he himself 
really accepted the group decision (GY). On 
the other hand, each participant was the most 
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accurate reporter of his own XX, in contra 
to the ratings of observers and participants, 


Discussion 


A number of inferences may be drawn from 
these results. First the results for all th 
measurements, except XY and XX, suppot 
the argument that something meaningful- 
differentially so in the sense of Campbell ant 
Fiske (1959)—is being assessed by the meas 
ures, GX, GY, YY, T, and L. 

Second, observers and participants were 
unable to sense objective private events, 
particularly XY—the stability of the d 
cussants from before to after discussion 
Contributing to the inability of members to 
sense private stability are coercive effects 
within the group itself, producing public but 
not private agreement. Coercion is brought 
about by the desire of the discussants for the 
discussion to move along smoothly and reach) 
a group decision in the time allowed. The 
individual members are thus likely to indicit 
their willingness to “go along" with the group 
to avoid being a stumbling block in the path 
to the group goal, while still holding to their 
original opinions. Coercion also arises when! 
some cases there is little chance among mem- 
bers to come to agreement and the group 
decision is reached by a majority vote. In thi 
case, the out-voted members sometimes take 
the “PIl show them" attitude and again !' 
the solutions privately exactly as they 7 
them initially, even though other membes 
were able to see other points of view and rei 
a common decision, Participants and observers 
may therefore see more final agreement than 
actually occurs. 

Third, despite their fewer numbes 
observers tended to estimate objective po 
formances more accurately than di à 
participants. Several reasons may be an p 
Participants only experience their own 8 
and their own personal ranking wl jeg 
group. They have no absolute know’ 

z E obse 
across groups such as is available to an n 
ver. It is more difficult to be an er 
participant-observer than only an oM tis j 

Fourth, the lack of understanding of W tis 
actually happening in the group Si^ 
points to the potential value of feeding i 
to the members the true situation w 
participating in as it is taking place (se ipet | 
1960a). The participants were unable p 


ly the extent of agreement among the 
after the close of the discussion 


in small group behavior are open to 
question. No simple conclusion about 
ter is possible for first of all investiga- 
may be particularly interested in mem- 
ceptions, per se, not what actually 
ed, Moreover, our data suggest that 
, participants, and self-appraisers, 
he order named, are fairly good estimators 
talked the most in the group. But, 
is, investigators interested in what 
appens in a group would find it most 
e to rely on observers rather than the 
ts themselves for gauging public 
objective assessments are not avail- 


in this study. For private events, 
ators would do well to seek objective 


, if these data have generality, then 
Specific practices bear re-examination. 
iple, a common criterion of group 
eness is the self-rating by members of 


‘data suggest that this particular self- 
does not even correlate significantly 
Zero at the .05 level with objective 
of how much each member agreed 
y with the group decision. Observers and 
rticipants even seem better at making 
ticular estimate. 
y should participants, as a whole, and 
have been more accurate in assessing 
age individual’s actual acceptance of 
TOU decision than the individual himself 
iting his own degree of acceptance? The 
vet may be a matter of statistics, but the 
ences are there, nevertheless. The 
„accuracy of the other participants 
tsmply have been due to the greater 
lity of the pooled judgment of four 
pets about a fifth than the fifth member 
imself. The superior accuracy of 
may have been due to their ability 
end group differences. Typically, we 
po and intercorrelated differ- 
groups in these problem situations 
1955b). Observers had an opportunity 
luate both members and groups differ- 
Y augmenting the total correlation of 
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their observations with the performance of the 
differing members in the differing groups. 
Individuals could only assess themselves based 
on their experience in one group. Any correla- 
tion between groups could not inflate the 
individual raters’ overall accuracy. 

Regardless of the reason, the absolute 
correlation of only .20 between members’ 
stated acceptance of the group decision and 
their actual acceptance suggests increased 
caution concerning the utility of subjective 
estimates as criteria of group effectiveness, This 
low correlation of .20 for the 12 groups of 60 
subjects is consistent with a product-moment 
correlation of only .13 found for 19 similar 
groups of 95 college subjects in the extent they 
actually accepted the group decision (GY) on 
10 trials and the extent they rated themselves 
attracted to their group (Bass, 1955a). If 
rated satisfaction with group decisions does not 
really reflect actual acceptance of the group 
decisions, what does such subjective satis- 
faction measure? A review of conference 
research studies suggests that rated satisfaction 
with group decisions is much more closely 
associated with a member's satisfaction with 
his own work at the conference than with 
group performance per se. For example, 
satisfaction with the group outcomes was found 
to correlate much more highly with. personal 
matters than with such group indices as 
number of agenda items completed (Peter- 
man, 1951). Conferees were less satisfied with 
decisions that took a longer time to make, yet | 
“rapid decision-making may reflect apathy, 
not efficiency” (Bass, 1960b, p. 46). 

Seventh, the generality (or lack of it) of 
these data should be considered in the light of 
other possible subjective and objective assess- 
ments and modes of analysis. For example, the 
objective data were unrestricted in degrees of 
freedom by problem or group. In the same 
way, the observers were free to assign all 
members of one group higher ratings than all 
members of another group. Yet, psychologi- 
cally, participants were restricted in knowledge 
to what happened in their own group. Their 
anchor points were based on their own group 
only. Statistical results might have been some- 
what different if all subjective assessments had 
been within-group comparison judgments with 
constant means for each problem and each 
iginal data of this study, 


group, Or, given the original. stud 
correlations between objective and subjective 
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data could have been run problem-by-problem 
and group-by-group, and then averaged. This 
would no doubt have favored participants over 
observers as accurate estimators of objective 
performance. However, these results, although 
of interest theoretically, would have lacked 
practical implications, for they would have 
been results to be generalized only to situations 
where all groups are equal in performance on 
every problem. An analysis apportioning the 
covariance between each objective and sub- 
jective measure would permit estimating the 
independent effects of problem, group, and 
individual and the covariance between objec- 
tive and subjective measurement. Resources 
permitting, this may be done. 


SUMMARY 


The purpose of this study was to investigate 
the relationships between subjective and 
objective assessments of group behavior, 
Discussion behavior was measured objectively 
by a social analog computer, while subjective 
assessments were made by observers and 
participants aided by a check list. 

There tended to be agreement between the 
two different methods of measurement, not 
accounted for by mutual correlations with 
irrelevancies, suggesting that both objective 
and subjective measurements were concerned 
with the constructs, 

Observers and participants had difficulty in 
inferring private events that took place during 
the discussion, The difficulty of rating private 
events suggested that observers’ ratings are 
most appropriate for overt events. Yet the 
successful study of covert events Tequires 
reliance on objective measurement, 
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1960) theory of cognitive dissonance 
states that if a person simultaneously 
holds two cognitions that are psychologically 
inconsistent with each other (dissonance), he 
- attempts, in some way, to make them more 
consistent (dissonance reduction). One deriva- 
tion from this theory is that if a person (or a 
fox) expends effort in an attempt to reach a 
goal (eg., a bunch of grapes) and fails, he 
experiences dissonance. His cognition that he 
has tried hard to reach the goal is dissonant 
with his cognition that he did not reach it. 
Probably the simplest way for him to reduce 
dissonance in such a situation is by leaving the 
situation and convincing himself that he did 
not really want the goal in the first place (e.g., 
the grapes were sour anyway). But suppose he 
cannot readily leave the situation, and thus 
continues to expend effort in order to reach the 
goal. Each try he makes at attaining the goal 
strengthens his cognition that he wants the 
dg io an attempt to convince himself that 
hun really want the goal becomes quite 
E. S S and, therefore, is not an adequate 
AK : E. uce dissonance. Under these circum- 
E n s " must reduce dissonance by some- 
A Dr EE the expenditure of effort in 
Du oie ru failure to achieve his avowed 
ui e. That is, he must find something else 
Situation to which he can attach value— 

E yaa event, 
m E ne theory leads to the follow- 
SERA P on: if à person continuously 
ieee M toattain a goal and is unsuccess- 
Rari i associated with this experience 
ore attractive to him. This prediction 


1 This a 
in pir esas is based upon a dissertation submitted 
of Doctor B De of the requirement for the degree 
, and Ns Philosophy at Stanford University in 
Brant from A research was partially supported by a 
minister the National Science Foundation, ad- 
The Y Leon Festinger. 
to the SEPA. gratefully acknowledges his indebtedness 
to Robert RS of his committee, Leon Festinger, and 
Served as men Sears and Douglas H. Lawrence who 
members of his committee. 


Poot: (1957; Festinger & Aronson, 
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appears to be contrary to inferences that might 
be drawn from theresults of several experiments 
involving the concept of secondary reinforce- 
ment (Bugelski, 1938; Cowles, 1937; Melching, 
1954; Mitrano, 1939; Mote & Finger, 1942; 
Saltzman, 1949; Skinner, 1938; Wolfe, 1936). 
Generally, in these experiments, animals 
learned to associate previously neutral stimuli 
with the attainment of a reward. Subsequently, 
these stimuli came to function as rewards in 
and of themselves. 

These studies do not make clear exactly 
what processes are involved in secondary 
reinforcement. Strictly speaking, all that can 
be said is that a previously neutral stimulus 
temporarily serves the function of a reinforcer, 
It seems reasonable to infer, however, that 
through association with reward, the secondary 
reinforcer actually undergoes an increase in 
attractiveness. If this inference is correct, we 
have two theories that yield somewhat different 
predictions: reinforcement theory suggests that 
stimuli associated with reward gain in attrac- 
tiveness; dissonance theory suggests that 
stimuli associated with “no reward" gain in 
attractiveness. These predictions, although 
nol mutually exclusive, can be discriminated. 

The crucial discriminating variable seems to 
be the expenditure of effort. The prediction 
from dissonance theory holds only if a person 
has expended effort in an attempt to attain the 
reward; if he has not expended effort there is 
no cognitive inconsistency, hence nothing to 
justify, hence no gain in the attractiveness of 
associated stimuli. Reinforcement theory sug- 
gests no such distinction—the implication 
being that stimuli associated with the attain- 
d become more attractive to a 


ment of a rewart à 
person regardless of the effort involved. The 


following experiment was designed to investi- 
gate the inferences from these theories by 
systematically manipulating the attainment 
of reward and the expenditure of effort under 
controlled laboratory conditions. Our hypoth- 
esis is that, as effort increases, stimuli associ- 


ated with lack of reward become more attrac- 
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tive. To be more specific, since any effects 
due to secondary reinforcement should remain 
constant regardless of effort, stimuli associ- 
ated with lack of reward should become more 
attractive (relative to stimuli associated with 
reward) as effort increases. 


METHOD 


General Design of the Experiment 


To test the hypothesis, the following conditions 
were necessary: To have some subjects perform a 
difficult, effortful task, while others performed an easy, 
effortless task, in order to receive a reward; to have the 
reward occur only occasionally; and to provide a 
distinctive difference between the goal stimuli during 
the rewarded and nonrewarded trials. 

Subjects could then be asked to rate the relative 
attractiveness of the two goal stimuli prior to the 
experiment and again after having undergone the 
experiment. In this way it would be possible to ascertain 
the relative effects of reward and nonreward on attrac- 
tiveness as a function of the amount of effort expended 
during the performance of a task. 


Description of the Task 


The task involved “fishing” for containers (which 
were actually pocket-sized flashlights) in order to 
obtain money that was inside some of them. The bulbs 
and batteries were removed from the containers and 
small metal rings were inserted in the bulb sockets, 
‘The containers were identical in all respects except for 
color; some were red and some were green. Those of 
one color contained either two, three, or four dimes 
wrapped in tissue paper; those of the other color con- 
tained only tissue paper. The containers were arranged 
randomly along one wall of the experimental room, 
covered by a narrow strip of cardboard in such a 
manner that only the metal ring was visible, not the 
color. Thus the subjects could not see the color of a 
container until after they had removed it from under 
the cardboard covering. 

In the Easy condition, subjects were instructed to 
Pull the containers out from under the cardboard 
covering by attracting the metal ring with a horseshoe 
magnet. This was an extremely easy task; subjects took 
an average of 14 seconds to pull out and open each 
container. In the Effortful condition, each subject 
was given a hook that was tied to one end of a string. 
The subjects were instructed to hold the string at a 
specified distance from the hook and “catch” the 
container by inserting the hook into the metal ring. 
Subjects in the Effortful condition took an average of 
52 seconds to pull out and open each container. 

For obvious reasons, it was necessary to disguise the 
purpose of the experiment. The investigator therefore 
introduced the experiment as an attempt to ascertain 
the effects of reward on fatigue. Subjects were told 

that in this experiment people were asked to perform 
one of several tasks that varied in their degree of 
tediousness. The experimenter explained that 50% 
of the subjects received a reward Periodically during 
the performance of their task, while the other 50% 


ELLIOT ARONSON 


received nothing. “The purpose of the experiment i 
simply to determine whether getting a reward. fr 
doing a more or less tedious task will counteract 
effects of fatigue, and hence, result in more efficien 
performance on the part of the people who receives]. 
reward." All subjects, regardless of condition, we 
told that they were among the people who would bel 
rewarded for their performance. They were told thal 
several of the containers held money, and that they 
could keep all of the money they could find in them, 
In order to justify the fact that only containers o 
one color would be rewarded, the experimenter informed | 
the subjects that they would be rewarded accordin 
to a schedule of “differential reinforcement,” 
“That is,” the experimenter said, “there are 
colors; red and green. In 85% of the contain 
one color, some money has been placed; 15% t 
empty. In those of the other color, just the rever | 
is true; there is money in only 15% of them, wh 
85% are empty. The reason for this is that we 
to see how long it takes you to discover which 
is more likely to contain a reward. We'll ask yon]. 
about this after the experiment. Even after you art | 
convinced that one color is more apt to contain af 
reward than the other, it is still to your advantal 
to examine the contents of the other, since 15% 
them will contain money." | 
Actually, of course, all of the containers of one colit 


did. Subjects were given false ratios so that they wo 
open all containers and thus be exposed to each for all} 


into a box before going on to the next one. The Eo 
was used in order to limit the subject’s exposure #4 
each color. 


Obtaining Measures of Color Preference 


In order to avoid contamination of the rests 
was important to make it difficult for subjects M. 
the relationship between the fact that they were 1! 
the color of the containers for attractiveness an a 
fact that some of the containers held money. This m 
accomplished in the following way. The experimen! 


the nature of the task and the schedule of veinforcemen | 
he suddenly changed his demeanor and enti ee 
began to explain “an extremely interesting and 1 
side-finding.” 
While conducting the experiment, I happ® 
notice that it doesn’t take a person prec tained | 
same amount of time to pull out each com tt 
there is quite a bit of fluctuation. I want f chan 
whether this fluctuation was just a matter 0 hing 
or whether it might be lawfully related to some | 
else in the experiment which I could measure ifo 


I uncovered was a very intriguing relationship: 


ned to | 
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a large extent, fluctuations in a person's performance 

time are a function of his color preference. It looks 

as though the experience of seeing one’s preferred 

color builds up a person’s motivation, or gives him a 

feeling of well being, or inspires him—I don’t know 

exactly what. But whatever it does, it leads to more 

efficient performance on the very next try. What I 

would like you to do, then, is to examine these two 

containers carefully and tell me which of the two 
colors you think is prettier. 
After the subjects inade their selection, the experi- 
menter showed them a scale and asked them to rate the 
relative magnitude of their preference from 0 (virtually 
equal in prettiness) to 5 (a very large difference in 
prettiness). 

Tt was necessary to obtain from each subject two 
Tatings of the relative attractiveness of the colors—one 
immediately before the performance of the task and 
one immediately afterward. An important problem 
was justifying the necessity of the post-experimental 
rating without arousing the subjects’ suspicions con- 
cerning the true nature of the experiment. This was 
accomplished in the following manner. The experi- 

menter reminded the subject about the aspect of the 

experiment that he, personally, was interested in—the 
eflect of color preference on fluctuations in performance 
time. He explained, 

This is an extremely difficult thing to assess for the 

following reason. When people are first shown these 

containers, they are asked to rate which one they 

think is prettier. Chances are, they haven't had a 

great deal of contact with these two specific colors 

prior to the experiment. Hence their rating of color 
preference is probably not a very stable one, since it 
is not based upon much experience. Then, after 
they've rated their preference, they spend the next 
oe minutes coming into repeated contact with 
ur colors—picking up the containers, handling 

HM opening them, closing them, etc. Due to this 

eer etion of experience with the colors, most of 

i T jects undergo a shift in their color preference 

e course of the experiment. This is a rough 
poze for me; since I’m trying to determine the 

RU of color preference on performance at every 

ave te experiment, it would be nice if I could 

e at a person's color preference remains 

UE at every point in the experiment, But I 

mae ne this; the subject's color preference is 

"à e ly changing. I need some way of obtaining 

Bey mate of each person's color preference at 

d Paine in the experiment in order to get an 

P Rue of its effect upon performance time. 

Rie ets m € feasible for me to ask the subjects to 

Denn lor preference every time they see one of 

aes iners, This would be far too disconcerting. 
T pe been doing, with some success, is this. 
tad of th subjects to rate the colors again at the 
rating Lica I then compare their second 

Renee their first rating, and, by applying a 

See formula, I can plot an estimate of 

Mincir 5 color preference at every point in the 

D ENSA Th line with this, I'd like you to look at 

y Beie ane and tell me which one you think 

SWA and by how much, according to this scale. 
to think of your previous rating. That is, 
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don’t consciously try to give the same rating you 

gave the last time just to show me what a consistent 

person you are. On the other hand, don’t consciously 

attempt to report a change if there was no change. 

After the subjects had given their post-experimental 
rating, they were asked to account for any shift in their 
initial rating. Virtually all subjects referred to the hue 
and brightness of the object itself. For example, “The 
green simply looks much brighter than it did before”; 
or “The red looks much richer.” 


Subjects and Design 

e.The subjects used in the experiment were 60 under- 
graduate women. In order to be certain that money 
would be an important reward, the experimenter 
recruited the subjects from the university employment 
service and informed them that the money which they 
found in the containers would constitute their salary. 
Thirty subjects were tested in each condition. The 
rewarded and nonrewarded colors were counter- 
balanced. 

The instructions given to the subjects in the Easy 
and Effortful conditions were identical except for the 
description of the tasks. In the Effortful condition, 
subjects were told that theirs was an extremely tedious 
task; in the Easy condition, subjects were told that 
theirs was not a particularly tedious task. The task was 
then described and demonstrated. 

"There were a total of 51 containers in the population 


- from which each subject fished. Seventeen of these 


contained money while 34 were empty. All subjects 
were told that they would be working for a limited, 
but unspecified period of time. Hence the more quickly 
they worked, the more money they would have a chance 
to make. Actually, they were allowed to work until 
they had fished out 16 nonrewarded containers, Since 
the population consisted of twice as many nonrewarded 
as rewarded containers, each subject received approxi- 
mately eight rewarded containers. There was an 
average of $.25 in the rewarded containers; thus each 
subject earned approximately $2.00. 

At the close of the session, the experimenter ex- 
plained the true nature of the experiment to each 
subject and discussed the need for the deception. 


RESULTS AND DISCUSSION 
For each subject, a score was calculated 
to indicate the change in her ratings of the 


2 The number of subjects actually tested was 68. 
Three of the subjects were dropped from the experiment 
because they indicated that they had guessed the 
purpose of the experiment. That is, they suspected that 
the presence of money in the containers was supposed 
to influence the perceived attractiveness of the colors. 
Two of these were in the Effortful condition, one was In 
the Easy condition. In addition, prior to the experiment, 
the experimenter decided to discount the results of all 
subjects whose jnitial color preference was four or 
higher. Such an extreme initial preference meant, for 
all practical purposes, that any subsequent shift in color 
preference would be limited to the direction of the 
initially unpreferred color. The results of five subjects 
were discarded for this reason; two were In the Effortful 
condition, three were in the Easy condition. 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF THE SHIFT 
IN RATED ATTRACTIVENESS OF COLORS 


Easy Effortful 
g y| y y| g 

3 | 2/2) 2) 3/2 

ga s| 5 |ss s| 2 

E oE B8 $E | E 

SE 35 8 |$5|35| 8 
Initial preference | M -F.21| +.88) 4-.52| —.71|4-.17 | —.30 
forrewarded color | c — 1.49| .79| 1.24| 1.06| .85 | * 1.07 

NT 6 13 7 6 13 
Initial preference | M --1.19|--1.42|-1.31| 4-.12]4-.40 | +.27 
for nonrewarded | c 1.22} .76| 1.03 82| .54 .T0 

color N 8 9 17 8 9 17 
Combined initial | M - +.73|+1.20| +.97| —.27|4-.31 | +.02 
preference € 1,42) .81| 1.18| 1.03| .69 92 

N 15 15 30 15 fis 30 


Note.—A plus sign indicates a shift in the direction of the 
rewarded color; a minus sign indicates a shift in the direction of 
the nonrewarded color. 


relative attractiveness of the colors. The 
ratings can be considered as points on a con- 
tinuum from “Red 5” to “Green 5,” with a 
midpoint of 0. If the change was in the direc- 
tion of the rewarded color, it was given a plus 
sign; if the change was in the direction of the 
nonrewarded color, it was given a minus sign. 
For example, if green was rewarded, and a 
subject’s pre-experimental rating was “Red 2” 
and her post-experimental rating was “Red 
1,” this is a change of one unit in the direction 
of the rewarded color and, therefore, her score 
would be 4-1. 
The means and standard deviations of these 
‘scores for the subjects in the Easy and Effort- 
. ful conditions are presented in Table 1, sepa- 
rately for those subjects who initially preferred 
the rewarded color and for those who initially 
preferred the unrewarded color. An examina- 
-tion of the table shows that in the Easy con- 
dition, subjects shifted their color preferences 
in the direction of the rewarded color. In the 
Effortful condition, however, no such shift 
occurred. "This general trend can be seen most 
clearly when one examines the combined 
initial preferences. The significance of this 
difference was tested by an analysis of variance. 
This technique was used for the following 
reason. Table 1 shows that an individual’s 
initial color preference was a source of vari- 
ability; that is, within a given condition, 
those subjects who originally Preferred the 
nonrewarded color tended to shift their pre- 
ference more toward the rewarded color than 
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did those who initially preferred the rewards |: 
color. An analysis of variance was used in 
order to eliminate the effects of this know 
source of variance. 

Separate analyses of variance were compute 
for the red rewarded and green rewarded 
situations. In both situations, the interaction 
variance was considerably smaller than the 
within cells variance. The F ratios, therefore 
were computed by dividing the variam 
between conditions by the variance within| 
cells. With red rewarded, F = 9.64. With 
and 26 degrees of freedom, this ratio exceeds | 
the .01 level of significance. With green te | 
warded, F = 4.84, With 1 and 26 degrees ot : 
freedom, this ratio exceeds the .05 level d 
significance. 

This finding suggests that in the Easy con: 
dition, secondary reinforcement was operating 
to increase the attractiveness of the rewarded 
color. Why was there no such increase in the 
Effortful condition? Does secondary reinforce 
ment not operate when effort is involved! 
There appears to be no reason to assume this 
Our interpretation of the data is that in both 
the Easy and the Effortful conditions, ther 
was an effect due to secondary reinforcement. 
This resulted in an increase in the perceived 
attractiveness of the rewarded color. In the 
Effortful condition, however, dissonance wi 
also involved. In this condition, every m 
the subjects obtained a nonrewarded containtr 
their cognition that the container held ™ 
money was dissonant with their cogniti | 
that they had worked hard to obtain it, By 
finding something about the empty conia 
to which they could attach value, they C? n 
reduce dissonance by justifying the expe" 
ture of effort for obtaining an otherwise wot! 
less object. The rewarded and nonrewardé 
containers were identical in all respects ex? 
for their color. Thus a convenient way for. d 
subjects to reduce dissonance was by s i 
ing themselves that the nonrewarded C? 
was more attractive than they had previo it 
thought. In the Effortful condition, the, 
appears that two opposing forces Welt 
operation; one enhancing the value ° ale 
rewarded color, the other enhancing the M je 
of the nonrewarded color. In the Easy i ; 
tion subjects expended very little effo 
performing the task; consequently, E 
little or no dissonance involved when This 
obtained an unrewarded container. 
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subjects in the Easy condition found the 
rewarded color to be more attractive. 


Some Theoretical and Methodological Problems 


Although the differences between experi- 
‘mental conditions occurred as predicted, it is 
‘somewhat inelegant to explain the fact that 
‘there was no change in the Effortful condition 
‘by positing two equal and opposing forces. 
"The results would have been more striking if, 
in the Effortful condition, the scores had been 
markedly in the direction of the nonrewarded 
color, Perhaps this effect could have been 
achieved by reducing the size of the reward; 
this would have reduced the attractiveness of 
the secondary reinforcer, and hence, might 
have resulted in a greater relative gain in the 
attractiveness of the unrewarded color. But 
decreasing the size of the reward might have 
decreased the amount of dissonance also. For 
example, a person probably would experience 
m dissonance if he failed to obtain $.01 than 
ithe failed to obtain $10.00. 

-.À possible alternative method of bringing 
about a greater increase in the attractiveness 
of the nonrewarded color might be to increase 
the amount of effort involved in the Effortful 
Po That is, a greater expenditure of 
E vain would probably result in more 
Ei ae ao hence a greater gain in the 
A ctiveness of the nonrewarded color. 

E. oing this, one might also introduce 
Bark ; into the rewarded trials; the subject 
all the Ex iu: obtaining $.25 is not worth 
MN NM : er occurred, it would tend to 
ESO ated attractiveness of the re- 
Bitte Ss well as that of the nonre- 
Weculations ` ai emphasized that these 

ertheless " yet, have not been tested. 

TUM his d „Seems reasonable to assume 
Bid seconda esign the effects of ‘dissonance 
dent; dud E o, are interdepen- 
Probab] pting to manipulate one would 

ably affect the other. 


pre Alternative Explanation 
Bly tds of this experiment can alter- 
| (1950) a interpreted in terms of White’s 
in the Eno of competence. Since the task 
à E zw ful condition was a difficult one, 
AM aes have served as a challenge 
d Jects in that condition. Thus each 


e 
they Succeeded in hooking a container, 


379 


they may have experienced a feeling of efficacy 
or mastery. This positive feeling may have 
then generalized to the containers—red and 
green alike. The feeling of efficacy may have 
been so great that, by comparison, the positive 
effect due to the presence of money may have 
been negligible. That is, although the presence 
of money served to make the rewarded color 
more attractive in the Easy condition, its 
presence in the Effortful condition may have 
been too small to be apparent in the face of 
the effect of a very great feeling of efficacy. 
Such an interpretation would account for the 
fact that the relative attractiveness of the two 
colors remained virtually constant in the 
Effortful condition, while in the Easy condi- 
tion, there was a significant shift in the direc- 
tion of the rewarded color. 

There is evidence, however, that casts some 
doubt upon this interpretation. If the effect of 
competence in the Effortful condition had 
been great enough to dwarf the effect of secon- 
dary reinforcement, one might expect subjects 
in the Effortful condition to have derived 
more enjoyment from their task than subjects 
in the Easy condition. Some data were gathered 
that are relevant to this problem. After per- 
forming the task, each subject was asked to 
rate his feelings about the task on an 11-point 
scale, from O (it was an extremely dull, un- 
enjoyable task) to 10 (it was an extremely 
interesting, enjoyable task). The mean rating 
of subjects in the Effortful condition was 3.91, 
while that of the subjects in the Easy condition 
was 3.27. Thus both tasks were considered 
about equally dull. Although negative evi- 
dence is not conclusive, it nevertheless weakens 
an interpretation of the results in terms of 
competence. & 


Secondary Reinforcement or Scarcity | 
The experiment was designed in such a way 
that each subject received about twice as m: 

nonrewarded as rewarded containers. T 
reason for this imbalance was to increase 

number of “dissonant” experiences for eac 
subject. But the fact that the rewarded col 
was also the scarcer color provides a source 
ambiguity in the interpretation of the result: 
In the Easy condition, the rewarded color ma: 
conceivably have undergone a gain in attra 
tiveness—not because it contained a rewar 
but because people, perhaps, have a tendenc; 
to value things that are scarce in the environ: 
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ment. To eliminate this ambiguity, 60 addi- 
tional subjects were run under exactly the 
same conditions, except that there was money 
in every one of the containers. If the results 
in the Easy condition had been a function 
of scarcity rather than secondary reinforce- 
ment, one would expect the same gain in the 
attractiveness of the scarcer color under con- 
ditions of equal reward. No such gain occurred. 
In both the Easy and the Effortful conditions, 
there was a very slight change in the direction 
of the scarcer color. The mean of the change 
in the relative attractiveness of the two colors 
in the Effortful condition was .02, while in 
the Easy condition it was .13, Therefore, the 
results previously reported cannot be attrib- 
uted to the effects of scarcity. The results 
indicate that, at least with human subjects, 
secondary reinforcement does involve a gain 
in the attractiveness of a previously neutral 
stimulus, 


SUMMARY AND CONCLUSIONS 


A laboratory experiment was conducted 
which demonstrated that the degree of effort 
a person expends in attempting to achieve a 
reward has an effect on the relative attractive- 
ness of stimuli associated with rewarded trials 
and stimuli associated with nonrewarded 
trials, The hypothesis was that, as effort in- 
creases, there is a corresponding increase in 
the relative attractiveness of stimuli associated 
with nonrewarded trials. 

To test the hypothesis, the experimenter 
had subjects perform a task that involved 
fishing for containers in order to obtain money 
that was inside some of them. For some sub- 
jects, the task was made easy (Easy condi- 
tion); for others, the task was made effortful 
(Effortful condition). The rewarded containers 
were of a different color from that of the 
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nonrewarded containers. Subjects were asked 
to rate the relative attractiveness of the two 
colors before and after performing the task, 

The results were in accord with the predic. 
tion. In the Easy condition, there was an 
increase in the relative attractiveness of the 
rewarded color. In the Effortful condition, 
however, there was no change in the relative 
attractiveness of the two colors. The difference 
between the subjects’ ratings in the two com 
ditions was highly significant. The results were} 
interpreted in terms of an interaction between | 
the effects of cognitive dissonance and those di 
secondary reinforcement. : 
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| HE present study is concerned with the 

complex effects of participation in a 
presumably vicarious aggressive activity 
upon subsequent aggressive behavior. A num- 
ber of studies have demonstrated that the 
expression of aggression—whether directly or 
symbolic form—results in a lowering of 
Subsequent aggression (Berkowitz, 1960; 
Teshbach, 1955; Pepitone & Reichling, 1955; 
Rosenbaum & de Charms, 1960; Thibaut & 
Coules, 1952). However, there is also experi- 
mental evidence to the effect that aggressive 
activity has a stimulating effect upon the 
manifestation of other aggressive acts (Fesh- 
bach, 1956; Kenny, 1953); that is, aggression 
may breed aggression. 

Since both possibilities—reduction and stim- 
ulation—have been experimentally observed, 
the pertinent issue then is under what con- 
ditions a vicarious aggressive act increases 
and under what conditions it decreases the 
probability of subsequent aggressive behavior. 
One such condition suggested by differences 
in procedure between the studies that ob- 
ps evidence of a cathartic effect and those 

'emonstrating a stimulating effect is the emo- 
tional State of the subject at the time the 
E act is performed; that is, if the 
3 ject is angry at the time he engages in the 
E activity, he can then use the act 

Satisfy and thereby reduce his hostility. 
E hypothesis is suggested that in 
E activity to have drive reducing 
b components of the drive must be 
Bs: pu evoked during performance of the 
Eu i that is, there must be some func- 
‘and ae between the vicarious act 
While it rs drive instigating conditions. 
eae undoubtedly true that the vicissi- 
be direct] € will arouse hostilities that cannot 
[rur dede jt does not follow 
Property d rect aggressive act will have the 
evoked ne hostility that has been 
Sta er markedly different circum- 
rances, According to thi i 
Child's an, g to the present view, à 

ger toward its mother will not be 
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tude; 


reduced by an aggressive act toward a doll 
figure unless its anger toward the mother is 
aroused when the aggressive act is performed, 
The evocation of anger may not be a sufficient 
condition—the doll figure may have to be 
similar to the mother—but it is probably a 
necessary condition for drive reduction to take 
place. 

If the subject is not hostile at the time of 
participating in an aggressive act, his sub- 
sequent aggressive behavior will not merely 
remain unaffected but is very likely to be 
increased. An increase in aggression following 
a vicarious aggressive act could result from a 
number of different processes: a reduction 
in inhibition or aggression anxiety, reinforce- 
ment of aggressive responses, and finally 
conditioned stimulation of aggressive drive 
and/or aggressive responses. 

On the basis of the foregoing considerations, 
the following hypotheses are proposed: Partici- 
pation in a vicarious aggressive act results in 
a reduction in subsequent aggressive behavior 
if aggressive drive has been aroused at the 
time of such participation; if aggressive drive 
has not been aroused at the time of participa- 
tion in a vicarious aggressive act, such partici- 
pation results in an increase in subsequent 
aggressive behavior. 


METHOD 


mental procedure consisted of arousing & 
drive before pafticipation in a 
act or before participation in a 
g measures of aggression 


The experi 
subject's aggressive 
vicarious aggressive 
neutral act and then obtainin, sures r 
subsequent to these interpolated activities. The varia- 
tion in level of aggression was accomplished by means 
of an insult versus noninsult condition and the variation 
in the interpolated activity consisted of exposure to a 
fight film versus a neutral film. 


Subjects 

ale college student volunteers 
ne of the four treat- 
imental vari- 
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small groups by the experimenter so that nine experi- 
mental sessions in all were held, three for the Noninsult 
Fight Film condition and two sessions for each of the 
other experimental conditions. 


Procedure 


Insult versus Noninsult condition. Subjects assigned 
to the insult groups were subjected to a number of 
unwarranted and. extremely critical remarks. These 
comments essentially disparaged the intellectual 
motivation and the emotional maturity of the students.! 
Previous studies (Feshbach, 1955; Gellerman, 1956) 
have provided abundant evidence that this technique 
successfully arouses hostility toward an experimenter. 
Subjects assigned to the Noninsult condition were given 
standard test instructions. 

Aggressive Film versus Neutral Film condition. 
Subjects in the Insult and Noninsult groups then 
witnessed either a Fight Film or a Neutral Film. The 
Fight Film consisted of a film clip of a rather exciting 
prize fight sequence taken from the motion picture 
Body and Soul while the neutral film depicted the 
consequence of the spread of rumors in a factory. The 
duration of each of the films was approximately 10 
minutes, 

As a rationale for the presentation of the film, 
the subjects were told before the film was presented 
that they would be asked to judge the personality of 
the main character in the film. Following the completion 
of the film, each subject indicated his impression of the 
personality of the hero of the film on a questionnaire 
provided for that purpose. 

Dependent Measures of Aggression. All subjects were 
given a modified word association test which, in a 
previous study (Gellerman, 1956) had been shown to 
be sensitive to differences in the arousal of aggression. 
The test involves the presentation of five aggressive 
words interspersed among six neutral stimuli as follows: 
wash, choke, travel, massacre, walk, murder, relax, 
stab, sleep, torture, listen. The subjects are asked to 
give in written form a series of associations to each 
word, The stimuli are presented orally and also visually, 
the experimenter holding up a 5" X 8" card on which 
the stimulus word is printed. The subject’s Aggression 
score is based on the number of aggressive word asso- 
ciations among the first 10 responses to each of the 
aggressive stimulus words. The maximum score that 
can therefore be obtained on this measure is 50. 

Subsequent to the administration of the word 
association test, the first experimenter left the room 
having presumably completed the study. A second 
experimenter then entered and informed the subjects 
that the psychology department wished to assess 
students’ opinions of the value of participating in 
psychological experiments. A questionnaire was then 
administered dealing with the subjects’ attitudes toward 
the experimenter and with their evaluation of the 
experiment. The questionnaire which consists of six 
items, each of which has six alternatives, is described in 
more detail in a previous study (Feshbach, 1955). It is 


+The author wishes to express his gratitude to 
Abraham Wolf for his competence and courage in 
carrying out this phase of the experiment. 
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scored so that the least aggressive choice for a particular 
item is given a score of 1 and the most aggressive choi 
a score of 6. 


RESULTS 


By hypothesis, it was predicted that th 
Insult group exposed to the Fight Film woul 
manifest less subsequent aggression on ead 
of the two measures of aggression than thy 


to the Neutral Film. The word associatio 
data bearing upon these predictions are pe 
sented in Table 1. The mean differences are inf 
accordance with expectation, the Insult-Figll 
(IF) Film group responding with fewer ag 
gressive associations than the Insult-Neutil 
(IN) Film group and the Noninsult-Fight 
(NIF) Film group responding with mot 
aggressive associations than the Noninsult) 
Neutral (NIN) Film group. The results of at 
analysis of variance of the data indicate thit 
the interaction between the Insult and th 
Film variable is statistically significant. Thy 
difference between the IF Film and the N 
Film groups falls short of the 5% confident 
level, the value of / being 1.9. The different. 
between the NIF Film and NIN Film group 


^d TABLE 1 
A. Mean Accresstve WORD ASSOCIATION 
RESPONSES OBTAINED UNDER EACH 
EXPERIMENTAL CONDITION 


Film (F) 
n Fight (N) Neutral (N) 
21) 
Insult 24.5 (25) 28.9 
Noninsult 27.7 (29) 25.3 05 


p oF 
B. Summary or ANALYSIS OF VARIANCE 


AGGRESSIVE WORD ASSOCIATION RESPONSES 
r 
Source SS df MS E 
3 
D 8.93 1 8.9 
F 38.43 1 38.43 | gr 
DF 291.59 1 291-9 
Within | 6,111.80 | 96 : 
Total | 6,450.75 99 
ored due P 


® The word associations of one subject were nof $° 
illegibility. 
*$ < 05. 


Vicarious AGGRESSION: STIMULATION OR CATHARSIS? 


TABLE 2 
DISTRIBUTION OF AGGRESSIVE WORD ASSOCIATION 
RESPONSES FALLING ABOVE AND BELOW 
THE MEDIAN AS A FUNCTION OF INSULT 
Frcur FILM AND INSULT NEUTRAL FILM 
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TABLE 4 


DISTRIBUTION OF AGGRESSION QUESTIONNAIRE 
Scores FALLING ABOVE AND BELOW THE 
MEDIAN As A FUNCTION OF INSULT 
FicHTr FILM AND INSULT NEUTRAL 


TREATMENTS Firm TREATMENTS 
Treatment «n >27 Treatment «ans E 
Insult Fight Film 17 8 Insult Fight Film 
Insult Neutral Film 10 19 Insult Neatral Film i 2 


T Note.—x? = 6.02; p < .05. 


TABLE 3 


A COMPARISON OF MEAN SCORES ON THE 
i AGGRESSION QUESTIONNAIRE 


Tosult-Fight | Insult-Neu- | Noninsult- | Noninsult- 
EY It New- | N Neutral 
wan) | say dm | ND, 
M 14.6 19.5 13.7 1 
A i 2 5.0 
p 3.72 3.90 2.52 2.95 


Note IF-IN = 4.7; p < 01. 
One subject failed to complete questionnaire. 


yields a / value of approximately 1 which is 
Clearly not significant. 

‘The contrast between the IF Film and IN 
Film groups is more sharply delineated by a 
E. median split. The chi square for the 

urfold table presented in Table 2 is 6.02 
Which yields a p value of <.02. The word 
‘sociation data, then, indicate that under 
E of anger-arousal, witnessing a 
a t film results in a lowering of aggression. 
“Owever, the hypothesized stimulating effect 
k an aggressive film under nonaroused condi- 
Í E 1s not borne out by the data. 

Bs oi onnie data are presented in 
E ecause of the lack of homogeneity 
d Variance between the IN and NIF Film 
} b Separate comparisons were made 
sere pertinent groups and, in these com- 

Tm S, the variances of the respective dis- 
the e are not reliably different. As was 
IF Fi E with the word association data, the 
Pion group displays significantly less ag- 

í on the questionnaire than does the 
lis oup. The difference between the Non- 
Due: groups is not in the predicted direction 

is small and unreliable. 
Bu ree’ in subsequent aggressive 
TA etween the insulted group exposed 
ght film and the insulted group exposed 


Note.—x? = 15.1; p < .001. 


to the neutral film is further illustrated by a 
simple median split. The chi square for the 
fourfold table presented in Table 4 is 15.1, 
which is significant at less than the .001 level. 


Discussion 


The experimental results are consistent 
with the hypothesis that the drive reducing 
effect of a vicarious aggressive act is dependent 
upon the aggressive state of the subject at the 
time of the vicarious aggressive activity. 
Witnessing the prize fight film resulted in a 
significant relative decrement in aggression 
in comparison to witnessing the neutral film 
only for those subjects in whom aggression 
had been previously aroused by the insulting 
comments of the experimenter. The predicted 
increase in aggression for the noninsulted 
subjects following exposure to the fight film 
did not occur, however. Each of these two 
outcomes warrants further comment. ' 

With regard to the difference between the 
two Insult groups in subsequent aggression, à 
possible alternative to a catharsis or drive 
reduction hypothesis is one that assumes that 
guilt or revulsion stimulated by the fight 
film is the primary mechanism ‘responsible for 
the lowered aggression. Berkowitz (1958, 
1960) has strongly argued for such an explana- 
tion of a reduction in aggressive behavior 
following an aggressive act. However, it must 
be noted that the evidence for a guilt or in- 
hibition process is most indirect and inferential. 

With regard to the present study, the guilt 
alternative is certainly possible, although, 
for various reasons to be suggested below, not 
a likely one. If guilt arousal were a ubiquitous 
process, occurring whenever people are given 
the opportunity to indulge in aggressive 
fantasies, then the fight film should similarly 


384 


have inhibited the aggressive response output 
of the Noninsult group. The possibility still 
remains that guilt arousal can account for the 
aggression reducing effects of fantasy under 
conditions where aggression has recently been 
stimulated, as in the Insult condition. As a 
check on whether the lowered aggression of the 
IF Film group was due to some inhibitory 
factor, the word associations were scored for 
defensiveness. A previous study of the effects 
of inhibition upon aggressive word associations 
has shown that when inhibition is experimen- 
tally aroused, the number of aggressive re- 
sponses decreases while the number of defen- 
sive responses increases (Gellerman, 1956). 
While, in the present study, a difference was 
observed in the number of aggressive associa- 
tions, the difference between the two Insult 
groups in the number of defensive associations 
was negligible and insignificant. The absence 
of an increment in defensive responses, while 
not decisive since the experiment cited em- 
ployed an inhibition procedure more closely 
resembling fear rather than guilt, is more 
consistent with a drive reduction rather than 
guilt explanation of the decrease in aggression 
following the exposure of the insulted subjects 
to the Fight Film. 

The problem remains of accounting for the 
failure to obtain the expected increase in 
aggression in the Noninsult group. One possible 
reason is the limitation of the questionnaire 
instrument as a measure of aggression, Al- 
though one's preference for or attitude toward 
another person is frequently used as an index of 
aggression, as was the case in the present ex- 
periment, dislike and aggression are not 
equivalent dimensions. At the extreme, aver- 
sion and aggression are likely to be strongly 
correlated but within moderate ranges of 
feeling, the association between dislike and 
aggression may well be negligible. For this 
reason, the word association measure is prob- 
ably a better instrument than the attitude 
questionnaire for detecting changes in aggres- 
sion in the noninsulted groups. However, al- 
though the relative increment in aggressive 
associations following exposure of the Non- 
insult group to the Fight Film was in the pre- 
dicted direction, it was not statistically 
significant. Whether this failure to obtain 
evidence of a stimulating effect of a vicarious 
aggressive activity under relaxed emotional 
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conditions is due to inadequacies in the theo. 
retical analysis or to limitations in the methods 
utilized cannot be ascertained from the present 
data. : 
On the other hand, the data consistently 
reflect the dependence of the drive reduction 
effect upon the arousal of aggression at the 
time the subject is engaging in the vicarious 
aggressive activity. Presumably vicarious 
aggressive acts do not willy-nilly serve as 
outlets for aggressive motivation. This latter 
process warrants further attention. Aggression 
is not an ever-present tension system pervading | 
all of an individual’s activities. Like other 
acquired motives, its appearance is very much 
dependent upon situational factors; and, the 
more specific the category of objects toward! 
which the aggression is directed, the narrower 
is both the range of stimuli that can elicit 
the motivation and the range of situations 
that can serve as substitute outlets for the) 
aggression. i 
What would appear to be a relatively simple 
matter—the effects of a vicarious aggressive 
activity upon subsequent aggressive behavior 
—is in actuality a quite complex process. The 
present study has examined the influence 0 
the drive state of the organism upon this 
process. Beyond the requirement of replication 
in a variety of situations, further research is 
needed to establish the extent to which other | 
variables determine the effects of so-call 
vicarious aggressive activities and to establish | 
the precise mechanism by which the Pe 
formance, direct or vicarious, of an aggressi 
act influences subsequent aggressive behavior. 


SUMMARY 


Studies of the effects of a presumably kr. 
ious aggressive activity upon subs 
aggressive activity suggest that under cer 2 
conditions the activity will tend to increas 
and under other conditions decrease, he pi 
ability of subsequent aggressive b d 
The purpose of this experiment was E d 
the effects of one such condition navem do 
vicarious aggressive activity 
Specifically, it was proposed th: 
aggressive activity results in a 
subsequent aggressive behavior } 
is emotionally aroused at the time 
gaging in this activity, but if anger 
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been aroused, the activity results in an increase 
in subsequent aggressive behavior. The two 

independent variables manipulated in the 
study consisted of an Insult versus Noninsult 
| condition and an Aggressive Film versus 
- Neutral Film condition. One hundred and one 
college students wereassigned at random to the 
- four treatment groups generated by the two 
experimental variables. The subjects met the 
experimenter in small groups so that nine 
experimental sessions in all were held. Sub- 
jects assigned to the Noninsult condition were 
given standard test instructions while sub- 
jects in the Insult groups were subjected to a 
number of unwarranted and extremely critical 
remarks. The subjects then witnessed either 
an Aggressive Film or a Neutral Film. The 
- former consisted of a film clip depicting a prize 
fight sequence while the latter depicted the 
consequences of the spread of rumors in a 
q 


factory. They were then administered a word 

association test and under the guise of a 

departmental assessment of the value of 

students’ serving as experimental subjects, a 

Second experimenter administered a question- 

Maire dealing with the subjects’ attitudes 

toward the first experimenter and with their 

evaluation of the experiment. The degree of 
- Aggression manifested on the attitude question- 
. aire and the number of aggressive responses 
On the word association test constituted the 
dependent measures. 

A Significant interaction in the predicted 
direction was obtained for the Word Associa- 
tion Measure—the Insult-Aggressive Film 
E aie with fewer aggressive associa- 
Eu ^m the IN Film group, and the Non- 
= -Aggressive Film group responding with 

Aggressive associations than the NIN 
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Film group. A similar significant difference 
between the two Insult groups was found on 
the attitude questionnaire, but the difference 
between the two Noninsult groups on this 
measure was unreliable and was not in the 
predicted direction. 

The results were interpreted as being con- 
sistent with a drive reduction theory, although 
an inhibitory process (guilt arousal) cannot be 
excluded by the evidence at hand. The de- 
pendence of the aggression reducing effect of 
exposure to a film depicting violent activity 
upon the prior or simultaneous arousal of 
aggressive drive was stressed. 
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THE INFLUENCE OF LANGUAGE ON THE DEVELOPMENT OF 
CONCEPT FORMATION IN DEAF CHILDREN! 


HANS G. FURTH? 
Catholic University of America 


TTEMPTS to appraise the contribution 
A: language? in the development of 
thinking are made difficult by the 
fact that ordinarily language and thinking 
develop together. However, children born 
with profound deafness, or afflicted with it 
at a very early age, do not learn ordinary 
language as the usual by-product of living; 
through them we may study the influence of 
language deficiency on the development of 
cognitive functions and clarify the role of 
language in cognition. There appears to be 
wide agreement that the average deaf person 
is inferior to hearing persons in all activities 
requiring thinking in abstract terms (Levine, 
1950). Ewing and Stanton (1943), Templin 
(1950), Myklebust and Brutten (1953), and 
Oléron (1953) among others have held that 
the "conceptual retardation” of deaf people 
is intrinsically related to their lack of language 
experience, 

The purpose of this study is to demonstrate 
that the capacity of deaf people to deal with 
conceptual tasks may not in fact be generally 
retarded or impaired. Cognitive capacity, it 
is proposed, develops naturally with living, 
whether or not spoken language is part of 
the child's experience, and the role of language 
is restricted and extrinsic: e.g., familiarity with 
certain words may increase the efficiency with 
which the solution of certain problems may be 
reached. It follows that deaf children should 
not differ from hearing children in their per- 
formance on conceptual learning tasks where 
it can be assumed that specific language ex- 
perience does not favor the hearing child, 
However, on conceptual tasks where one has 


1 This report is based on a PhD dissertation sub- 
mitted to the University of Portland, and part of this 
paper was read at the APA convention, Chicago, 1960. 
This investigation was supported by a fellowship, MF 
9902, from the National Institutes of Health, United 
States Public Health Service. 

2 The writer is deeply indebted to Nissim Levy for 
his guidance and critical review of the manuscript. 

3 The meaning of language in this article is restricted 
to the ordinary spoken language of a society. 


reason to assume that specific language famili 
arity gives the hearing child an advantage, 
deaf children should be inferior to hearing 
children. 

Following a lead of Levy and Cuddy (1956), 
three concept learning tasks, differing in| 
relation to the language repertoire of the two 
samples, were chosen to measure the basic 
cognitive potential of the children and estab- f 
lish norms for various age groups. The tasks | 
were nonverbal and consisted of the opera- 
tional attainment of a concept or a principle 
according to which a subject's choice could 
be consistently correct. For the first two tasks 
the correct principle or concept could be 
assumed to be as familiar or unfamiliar to tle 
deaf as to the hearing children. The idea of 
“same,” involved in the first task, is so primi- 
tive that workers with deaf children report 
that there is no deaf child in school who doss 
not have at least some gesture for this idet 
On the basis of Levy and Ridderheim's (1958) 
study, partially replicated here, it seems that 
hearing children before the age of 12 do not 
have the concept of "symmetry" readily 
available and, therefore, also should have no 
advantage over the deaf children on the Sym 
metry task. 

The concept of "opposition," however 
should be quite familiar to hearing children 
beyond 6 years old, as a study of Kreezer ra 
Dallenbach (1929) showed. Our languag 
employs many dimensions in terms of b 
sites: hot-cold, good-bad, long-short. As i 
child learns the words denoting extremes b A: 
he learns the words characterizing the M d 
sion as such. In such linguistic contexts & C à 
becomes naturally acquainted with the a a 
of opposition and when he has reach í 
age of 6-can readily grasp the meaning A eff 
word "opposite." In distinction from the itd 
ing child, the deaf child, without the be Hy 
this specific language experience, finds ! site 
tively difficult to learn the concept of oPP™ 
for one dimension and then gener x die 
concept to other dimensions. Teachers ° 
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“deaf report that their pupils are being 
“taught” the meaning of “opposite” when they 
are in the intermediate grades or about 14 


years old. 
| METHOD 
- Subjects 
All pupils aged 7-12 years from the three schools 
— for the deaf located in one state, with an additional 44 
— subjects from a neighboring state, made up the deaf 
— sample, Excluded were those few who did not have an 
early hearing loss of at least 50 decibels in the better 
tar. There were 30 subjects in each of the six age groups. 
Since the majority of the deaf sample consisted of 
| practically all deaf children of the desired age living in 
thestate at one time, it can be reasonably assumed that 
a relatively unbiased and representative sample of deaf 
children was achieved. The hearing sample consisted of 
180 subjects randomly selected from five different grade 
Schools and classified into six age groups, 7-12 years, 
— of 30 subjects each. The hearing sample had an equal 
number of boys and girls; however, the deaf sample at 
ages 7, 9, and 12 showed a slight imbalance in favor of 
- boys. 
The procedure of matching the hearing and deaf 
group only on age and sex and permitting other possibly 
P, pertinent variables to vary randomly was judged 
Superior to an attempted control of IQ, institutional 
education, etc. With regard to IQ in particular, it is 
known that tests, standardized on a hearing sample and 
i ae on an average experiential and educational 
A oe are of questionable value with deaf 
E The deaf group had a somewhat lower overall 
VB ‘onomic rating than the particular hearing sample 
"m this study. In the light of the reported relation- 
etween the intelligence of children and parental 
Reo (Terman & Merrill, 1937), the somewhat 
ho Tm Socioeconomic distribution in the two samples 
uld bias the results, if at all, against the deaf and, 
erefore, against the main hypothesis. 


Tasks 


Es c tasks employed in the study were these: 
a aang task. This task consisted of a series of 40 
s end of round tin covers with two simple 
M nup on each cover. The two figures on one of 
De. ue identical, on the other the two were 
checker is nder the cover with the identical figures a 
bice The Placed indicating to the child the correct 
- oa ES criterion for success was 10 consecutive 
teri ‘ices. The trials were terminated after the 
; mn ore with the first error after Trial 
| ‘lied stimulus presentation’ was 

Used m this and the following task. i. 
eae task. Forty different pairs of 7 X 9 inch 
awn in aea pared for this task and simple figures were 
One card eranl black ink on a white background. On 
| Symmetric: Aus pair the figure to be rewarded was a 
. The one, while on the other it was asymmetri- 
“used eres and criterion were the same as those 
the Sameness task. 
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3. Opposition task. This task was in two parts. (a) 
Opposition Acquisition: From a set of eight wooden 
discs ranging in diameter from .5 inch to 2.25 inches and 
hidden from the subject’s view, four were selected in a 
fixed order and randomly thrown on the table in front 
of the child. The experimenter either pointed to the 
largest or to the smallest disc. If the experimenter 
picked the largest, the subject’s task was to discover 
that he had to pick the smallest; if the experimenter 
picked the smallest, the subject had to pick the largest. 
Subjects were given a maximum of 36 trials. The 
criterion was six consecutive correct choices, (b) 
Opposition Transfer: One uncorrected trial was given 
only to those subjects who succeeded on Opposition 
Acquisition, on each of the following six Transfer 
dimensions: Volume, Length, Number, Brightness, 
Position, and Texture. While the experimenter pointed 
to the stimulus on one extreme of the continuum of 
four or five stimuli, the child showed transfer of concept 
by pointing to the opposite extreme. 

The performance of the two groups was compared 
in terms of the proportion of subjects reaching the 
criterion. Other methods of measuring performance 
yielded similar trends and are not reported here except 
in the case of Opposition Acquisition. For Opposition 
Transfer the total number of nontransferred responses 
was used for computational purposes. 


RESULTS 


The results for the three Acquisition tasks 
are presented in Table 1. The chi squares are 
based on the actual number of passing and 
failing subjects, a total of 30 for each age group 
and 180 for each entire sample. 

The comparative results for the Sameness 
and Symmetry tasks were fairly similar. The 
hearing group showed no superiority over the 
deaf; on the contrary, there was a tendency 
for the younger deaf children to surpass the 
hearing children of comparable age. Also, the 
overall comparisons for all ages combined on 
the Symmetry task was significantly in favor 
of the deaf group. í 

Both groups showed a significant change of 
proportional success with age: on the Sameness 
task the chi square (df = 5) for the hearing 
group was 18.27 (p < .01) and for the deaf 
group 13.75 (P < .02); on the Symmetry 
task the respective values were 23 04 and 29.00 
(p « .01). While this change was consistently 
in a positive direction for the hearing children, 
the 11-year-old deaf group reversed the trend 
for the deaf sample: the chi square, comparing 
the successes of the 11- and 10-year-old deaf 
children for the Sameness task was 5.42 (p 
< .05) and for the Symmetry task 4.27 (p 


< .02). 
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TABLE 1 
NUMBER or SUCCESSFUL HEARING AND DEAF SUBJECTS on CONCEPT Tasks IN EACH AGE Group 
(30 subjects in each age group) 
Sameness Symmetry Opposition Acquisition 
Age 
Hearing Deaf xt Hearing Deaf x Hearing Deaf x 
7 4 7 44 6 4 12 29 14 16,09" 
8 6 16 5.81* 4 11 3.20 26 19 3.20 
9 12 16 -60 8 19 6.73** 28 23 2.09 
10 11 20 4.27* 12 21 4.31* 30 28 31 
11 13 11 .07 14 11 -27 30 28 B 
12 22 15 2.54 21 19 .08 30 27 140 ] 
"Total 68 85 2.91 65 85 4.13* 173 139 26,18" 
*p = 05. 
p= 01 ! 
Hearing children were significantly superior TABLE 2 


to deaf children on the Opposition Acquistion 
task, as indicated by the chi square testing for 
overall differences. The consistency of this 
superiority at each age level is more clearly 
revealed in Table 2, The median number of 
errors for the combined samples was 3.05. 
Analyzing these data for improvement of 
performance with age, the chi square values 
of 27.67 for the hearing group and 14.17 for 
the deaf group were found to be significant 
at the .01 and .02 levels, respectively, (df = 
5). 

Table 3 provides the relevant data for the 
Transfer test, As the nontransferred Tesponses 
were fairly normally distributed they were 
subjected to an analysis of variance. For this 
purpose 34 hearing subjects were randomly 
eliminated so that the JV of corresponding 
cells for each age group was equal. The analysis 
yielded an F of 24.02 for the Hearing-Deafness 
variable, an F of 9.42 for the Age variable, 
and an F of 4.50 for the interaction, The signifi- 
cant (p < .01) interaction and differences 
obtained between the two samples at each age 
level indicated that the hearing children were 
superior, the degree of superiority varying 
with age. 


Discusston 


The facilitative influence of language was 
highlighted in the present study when one 
considers that the same deaf children who 
demonstrated their equality on the Sameness 
and Symmetry tasks were consistently below 
the same hearing children on the Opposition 


NUMBER or Supyects Maxine LESS THAN 
MEDIAN Errors ON OPPOSITION 


ACQUISITION 
Age Hearing Deaf Chi square 
5 5.79" 
i $ 3 3.75, 
9 16 7 5.71% 
10 17 : ae pn 
11 19. 
12 25 10 15.49 
Total 94.5 38 36.19" 
*$- 05. 
“p= 01 
TABLE 3 


MEAN NONTRANSFERRED Responeya or 
DEAF AND HEARING SUBJECTS O: 
Opposition TRANSFER IN EACH 


Acr{GrouP 
Hearing Deaf 

E N Mean N iud 
responses Ec 

7 2 | 20 | 14 | 2% 

8 26 1.54 19 241 

9 28 1.25 23 161 

10 30 43 28 1.78 

11 30 -60 2 1.48 
12 30 -57 CUM 

00 

Total 173 1.10 139 2 
the del | 


task. On the two former problems, ually 
children in the lower age range Were ‘phis 
superior, perhaps because of their less ae 
cated approach to the problem § 


LANGUAGE AND CONCEPT FORMATION 


Tnsofar as the problem required the attainment 
of one simple concept, the hearing child's 
greater store of available categories may have 
been distracting. 

Regarding the developmental trend in the 
hearing group, the findings on the Symmetry 
task are in excellent agreement with the results 
of Levy and Ridderheim (1958) on an entirely 
different sample. The drop in the relative 
"performance of the 11- and 12-year-old deaf 
“group is somewhat puzzling, if it is not a 
‘sampling artifact. 

— Successful performance on these concept 
“learning tasks, it should be understood, is 
related to but in no way identified with the 
"knowledge of the concept or of the word. Al- 
though both deaf and hearing children at the 
“youngest age level knew the word or the con- 
cept of sameness, only few of them succeeded 
“on the Sameness task. At the other extreme, 
‘Neither deaf nor hearing 7-year-old children 
“Were familiar with the word “symmetry,” 
"yet a few 7-year-old children succeeded on that 
particular task, 


SUMMARY 


— Contrary to widely accepted conclusions 
that deaf people are inferior in conceptual 
‘thinking and the theories proposed to link 
Conceptual inferiority and language retarda- 
: tion, the present study suggested that the in- 
—Püence of language on concept formation is 
B sc and specific. According to this view, 
l E guage experience may increase the efficiency 
iM Concept formation in a certain situation, 
: Ut is not a necessary prerequisite for the 
“velopment of the basic capacity to abstract 
“4nd generalize, 
" To test this assumption, 180 deaf and 180 
faring subjects, 30 subjects for each age 
Soup from 7 to 12 years, were given three 
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nonverbal concept learning tasks differing 
with respect to the relevance of the language 
experience of the two samples. With regard to 
the Sameness and Symmetry task, the groups 
were assumed to be equivalent in relevant 
language experience. In contrast, concerning 

Opposition Acquistion and Transfer, specific 

language experience was assumed to give 

hearing children an advantage over the deaf, 

Accordingly, it was predicted that the hearing 

subjects would not be superior to the deaf 

subjects in their performance on the Sameness 
and Symmetry tasks, yet would be superior 
on Opposition Acquisition and Transfer prob- 
lems. The results confirmed the experimental 
predictions and gave support to the proposed 
theory. 
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INTERPERSONAL SENSITIVITY AND MOTIVE STRENGTH 


DAVID E. BERLEW! 
Wesleyan University 


NTERPERSONAL sensitivity implies em- 
pathy, understanding, ability to judge 
others, sensitivity to other people, and 

other similar concepts. Efforts to investigate 
interpersonal sensitivity as a personality trait 
have been frustrated by the lack of a satisfac- 
tory technique for measuring sensitivity, In 
the last decade the general procedure has been 
to ask judges (Js) to predict the responses of 
others (Os) to a questionnaire of some sort, 
and then to compute accuracy scores by com- 
paring each J’s predictions with the actual 
responses of the Os. Recently a number of 
studies have demonstrated that accuracy 
scores obtained in this way are more the result 
of statistical artifacts and response sets than 
of any differential predictive accuracy on the 
part of the Js (Bender & Hastorf, 1953; Cron- 
bach, 1955; Crow & Hammond, 1957; Gage & 
Cronbach, 1955; Gage, Leavitt, & Stone, 1956; 
Hastorf & Bender, 1952). Several investigators 
have suggested procedures for measuring vari- 
ous aspects of interpersonal sensitivity which 
yield scores apparently free from response set 
influence (e.g., Cronbach, 1955; Gage, 1952), 
Bronfenbrenner and his co-workers (Bron- 
fenbrenner, Harding, & Gallwey, 1958) at 
Cornell University used such a procedure for 
obtaining interpersonal sensitivity scores that 
were apparently reliable and which related 
significantly to personality (behavior) vari- 
ables, 

Using Bronfenbrenner et al. (1958) technique 
for obtaining interpersonal sensitivity scores 
free from artifactual components, the present 
study investigated the relationship between 
motivation and sensitivity to others. The 
hypothesis was that moderately motivated Js 
would make more accurate judgments of 
Os than either highly motivated or relatively 
unmotivated Js. 

If judging others is viewed as a complex 
problem solving task, certain empirical findings 
are relevant to the hypothesis, Most rele- 
vant, perhaps, is the Yerkes-Dodson principle 

1 This study was carried out while the author was a 


graduate student in the Department of Social Relations, 
Harvard University. 
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(Yerkes & Dodson, 1908) that moderate | 
motivation is optimal for efficient performance 
on complex tasks. Since Yerkes and Dodson 
made their discovery in 1908, other studies 
have demonstrated that their conclusion ap- 
plies to a variety of cognitive processes essen- 
tial to efficient problem solving. Motive in- 
tensity and attention to motive related cuesare 
positively related (Atkinson & Walker, 1956; 
Wispé & Drambarean, 1953), but at the upper i 
extremes of motive intensity there is some loss 
of perceptual objectivity (Allport, 1955, Ch, 
13). Moderate motivation facilitates “generic! 
or “rule” learning, learning which has maxi- 
mum transfer effect in other similar situations 
(Bruner, 1957). Birch (1955) indicates that 
moderate motivation is optimal for "insight- 
ful problem solving.” Taken together, these 
results suggest that moderate motivation 5 
optimal if a J must perceive a variety of mo- 
tive related cues and perceive them veridically, 
have learned something about the meaning 
of identical or similar cues in past situations, 
and process such information to arrive at a | 
accurate solution or judgment. 

Estimating the motivational level of the Js 
is crucial to any test of the hypothesis. Me 
Clelland and his co-workers (Atkinson, 1958; 
McClelland, Atkinson, Clark, & Lowell, 1953) 
have had considerable success inferring pe; 
strength of various motives from imaginative 
stories. Atkinson (1953), however, has pointe i 
out that motivation inferred from p 
analysis of fantasy is probably a. bo 
characteristic of personality which is mal’ 
fested in behavior only when engaged oF Br 
ported by appropriate environmental cue i 
(p. 387). To estimate the intensity of a p 
motive in a given situation, one must kn d 
in addition to the strength of the motive m 
disposition, the extent to which the situ? 
engages or arouses the motive. vide 

Fortunately, there are data that p 3i 
information as to the motive engageme 
arousal properties of task oriented a 
groups. Factor analytic studies of such gr 
indicate that group members perceive ed 
rate each other in terms of three basic 
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sions—task competence, influence, and affec- 
tion (Tagiuri, 1958). Thus we can assume 
that information or social cues built into the 
groups are relevant to these three dimensions 
to the relative exclusion of cues relevant to 
other dimensions, Since the motives n Achieve- 
ment, n Power, and n Affiliation correspond 
very closely to task competence, influence, 
and affection, respectively, it can be argued 
that these three motives particularly should 
be engaged or aroused by the small group 
situation. 

The problem of obtaining estimates of the 
intensity of the subjects’ motives in the group 
situation still remains. One solution is to 
attempt to design a group situation that 
arouses all three motives equally. If this can be 
accomplished, the contribution of the situa- 
tion to motive arousal can be treated as a 
constant across all three motives, and the 
fantasy measure of motivation (or motive 
predisposition) used as an estimate of motive 
Strength. This is admittedly a gross method 
for obtaining estimates of motive intensity in a 
Particular situation, but the desirability of 
Studying several motives simultaneously seems 
to warrant a departure from more conventional 
Methods of controlling motive intensity even 
at the expense of some rigor. 

In light of the foregoing reasoning, it was 
Predicted that Js with a moderate n Achieve- 
aN n Power, and n Affiliation would make 
Eus judgments of Os in a task 
high small group situation than either 

eny motivated or relatively unmotivated Js. 


a METHOD 
Subjects and Setting 


The 
Coll 
to 


bd CP were male members of a Harvard 
particioat relations class, The subjects were invited 
Preparati pate in small informal discussion groups as 

on for an approaching examination. Eight six- 


ee groups were scheduled. Except for the 
lot be m S Students who were acquainted could 
allowed n ers of the same group, subjects were 
convenient ee up for any group that met at a time 
€ groups x them. Due to absentees, only three of 
While the s With a full complement of six members 
group met S five groups had five participants. Each 
ES and discussed the course material for 2.5 

jr... ing this period they were not aware that 


their sensitiv: 
n 
tested, sitivity to other group members was to be 


Fact 
Bote yee to moderately engage each of the 
Discuss; "E Studied were present in the situation. 


Course material in preparation for an 
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examination should engage the subjects’ n Achieve- 
ment; the facts that the test in question was an “hour 
exam" and not a final, and success on it did not neces- 
sarily depend on what they contributed to or got 
from the group discussion, were expected to mollify 
what otherwise might be a highly arousing situation. 
Similarly, the lack of a designated chairman or leader 
for the discussion, and the fact that group members 
were meeting each other for the first time, were designed 
to moderately engage the power and affiliation motives, 
respectively. 


Measure of Interpersonal Sensitivity 


Following the discussion period, each group member 
was'asked to rate himself from 0 to 6 on the series of 12 
adjectives, half representing desirable and the other 
half undesirable qualities, originally used by Bronfen- 
brenner et al. (1958, p. 51)? The instructions were: 
“Describe youself with respect to the way you acted 
during the discussion in which you have just partici- 
pated." After he had rated himself, each group member 
was asked to predict how each other group member had 
rated himself on the same list of adjectives. 

Interpersonal sensitivity (IS) scores were computed 
using Bronfenbrenner's “method of differential com- 
parison.” The method of computing such a score for a 
single subject consists of: expressing his prediction of 
each other group member’s self-rating on a specific 
item as a deviation (xı) from the mean of his predictions 
for all the other group members on that item, expressing 
each other group member's self-rating on a specific item 
as a deviation (yı) from the mean of all the other group 
members’ self-ratings on that item, and then computing 
the simple correlation (ram) between xi and y, over 
all items? The resulting correlation coefficient reflects 
the subject's ability to recognize individual differences 
among other persons in their responses to each item. 
Since both the criterion ratings for evaluating the 
accuracy of predictions and the predictions themselves 
are expressed as deviations from the respective means 
for all members of the group, the measure is independent 
of the subject's similarity and sensitivity to the gen- 
eralized other. After a statistical analysis of sensitivity 
scores obtained using the “method of differential 
comparison,” Bronfenbrenner et al, (1958) state: “It is 
our belief that the only way in which a judge can obtain 
a high score (i.e., a high correlation) with this type of 


2 The “desirable” or “favorable” adjectives, adopted 
from Bronfenbrenner, Harding, and Gallwey (1958), 
were helpful, influential, interesting, observant, reason- 
able, and warm. The “unfavorable” adjectives were 
domineering, immature, shy, submissive, unimagina- 
tive, and worried. T 

3 According to Bronfenbrenner, Harding, and 
Gallwey (1958), this index corresponds to the “average 
Jation” in conventional analysis of 


within-group corre 
covariance and can be treated in exactly the same way 
as the ordinary correlation coefficient except for the 
fact that the usual methods for evaluating statistical 
significance of r cannot be applied directly. Because 
successive paired values cannot be assumed independ- 
ent, the exact number of degrees of freedom associated 


with 7 is not known. 
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index—except, of course, through errors of measure- 
ment—is by being aware of objective differences among 
the persons he is asked to judge” (p. 49). 


Measures of Motivation 

Six-card, self-administered Thematic Apperception 
Test (LAT) booklets were completed by the subjects 
during a regular class hour prior to the scheduling of 
the small group meetings. The subjects were not aware 
of any association between the TAT and the group 
meetings. The six TAT pictures used comprise a 
standard set recommended for use when distributions 
of scores for the motives, n Achievement, n Affiliation, 
and n Power, are desired (Atkinson, 1958), The TAT 
protocols were scored for the three motives prior to the 
small group meetings using coding systems developed 
by McClelland and his co-workers (Atkinson, 1958). 
The experimenter, who scored the protocols, has 
demonstrated reliability of over .90 with the practice 
materials included in the three scoring manuals. In 
addition, reliability coefficients between the experi- 
menter and a second experienced scorer were .95, .93, 
and .87 for n Achievement, n Affiliation, and n Power, 
respectively, on a sample of one-third of the protocols 
used in the study. 


RESULTS 


The IS scores, in the form of correlation 
coefficients, ranged from —.31 to +.50, with 
a mean of .23, a median of .29, and a standard 
deviation of .21. As noted previously (see 
Footnote 3), the usual methods for evaluating 
statistical significance of r cannot be directly 
applied to the IS scores because the exact 
number of degrees of freedom cannot be de- 
termined.‘ 

The eight discussion groups did not differ 
with respect to mean and standard deviation 
of IS scores. For purposes of analysis, Js were 
designated as Sensitive or Insensitive according 
to whether their IS scores were above or below 
the median of the entire distribution of scores, 

The distribution of scores for each motive 
was divided as nearly as possible into thirds 
and each J classified as high, moderate, or 
low on each motive. In Table 1 Sensitive and 
Insensitive Js are classified according to the 
strength of each motive, 

The predicted curvilinear relationship be- 
tween the motives, n Power and n Affiliation, 
and IS scores is significant. Sensitive Js tend 
to be characterized by moderate n Power and 
n Affiliation, whereas Insensitive Js tend to 


* Treating IS scores as r's to obtain a rough idea of 
how successful the subjects were in making judgments, 
27 of the 43 subjects exceed chance accuracy at the 
-05 level, with 17 of these 27 subjects attaining scores 
significant at the .01 level. 
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TABLE 1 
NUMBER OF SENSITIVE AND INSENSITIVE 
ING Low, MODERATE, OR HiGH 
ON n ACHIEVEMENT, n AFFILIATE 
AND n PowER 


n Achieve- | s Agiliation [im 
In nal 
5 
HHEE 
Sensitive Js 6197/5 
(N = 22) 
Insensitive Js 9|4|8]|8 
(N = 21) 
Chi square* 2.43 4.25 
> (two-tailed) :20 105 


* Computed with one df by combining H and L cell 
2 X 2 table. 


have either high or low n Power and i 
tion. The relationship between n A 
and IS scores, although in the predicte 
tion, is insignificant. In conside 
results, it is important to note that 
motives are not positively related; 
there is a significant negative correla 
tween n Power and n Achievement (r 
P < .01). While this negative rela 
increases the probability that particul 
tensity patterns relative to the two 
will occur (i.e., moderate-moderate, a 
low), there is no reason to believe tl 
tributes spuriously to the relationship 
motive intensity and interpersonal s 
To test the relationship between 1 
sonal sensitivity and general level of 
tion, each motive score was conve 
z score representing its distance in $ 
deviation units from the mean of the? 
Score distribution. A total z score 
puted for each J by summing the 2 S¢ 
each motive, disregarding signs. Thus 
total z score represents extremes 1D 
intensity (either high or low) and 4 los 
z Score represents moderate motive mi 
across all three motives. r 
The mean total z scores obtained 
sitive and Insensitive Js were 2.075 ant 
respectively. The mean difference 
yields a ¢ of 3.000 which is significan 
.005 level, using a one-tailed test of sig? 
An attempt to assess more sensitivi 
covariation of motive intensity (total 


| and IS scores yielded an r of — .20, just signifi- 
| cant at the .05 level. 


f DISCUSSION 


i The results clearly support the hypothesis: 
moderately motivated Js are distinctly su- 
perior to highly motivated and unmotivated 
Js at judging Os in a task oriented small group 
situation. The same trend is apparent whether 

. one motive or all three motives are used as 
estimates of motive intensity. 

Since the group sessions were neither ob- 
served nor tape recorded, it was impossible to 
determine objectively whether or not there 

| was roughly the same amount of social inter- 
action relevant to each of the three dimensions, 
task competence, influence, and affection. 
However, the experimenter got the general 
impression that there was from interviewing 

- the subjects. The fact that a significant pro- 
portion of the variance in IS scores can be 
attributed to differences in Js' motivation 
also Suggests that the attempt to control 
situational factors was relatively successful. 
Itis doubtful whether the predicted curvilinear 
telation would have obtained for each motive 
and sensitivity had one motive been aroused 
disproportionately. For example, a situation 
extremely arousing to a particular motive 
would serve to moderately arouse or motivate 
even those Js with a relatively weak motive 
Predisposition, thereby enhancing their ability 
to Make accurate judgments of Os. In the 
a Situation, Js with a moderately strong 
te would become highly aroused 
while oe be relatively poor at judging Os, 
ae ; with a strong latent motive would 
8 3 e inept as Js. In this hypothetical 
En the. relationship between latent 

nu) oe and sensitivity would be 
n uen negative rather than curvilinear. 

Blüte jd with a very slight tendency to 

uu. arouse a motive might conceivably 

[em rid the opposite effect—a positive 

e Pme lonship between latent motivation 
0 lüvity. 

the E m that might be asked is whether 

iine «d E ues cannot be accounted for 
telationshi p. d way, without reference to the 

(cf, sche, motive Intensity to sensitivity 
ing is that " 1958). One point worth reiterat- 

im e estimates of motive intensity 

Eu M Sensitivity are experimen- 

pendent. Thus except for the pos- 


tally 
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siblity of measurement error, the empirical 
relationship between the two variables can 
be accepted as genuine. The meaning of the 
results is quite another problem. One might 
ask, for example, whether it is not possible 
that persons with low motivation use only the 
middle of the rating scale, whereas highly 
motivated subjects concentrate their ratings 
at the extreme ends of the scale, resulting in 
low IS scores for both groups? The answer to 
this and similar questions is that subjects 
differing in motive intensity may indeed use 
the rating scales in consistently different 
ways, but that is part of the reasoning under- 
lying the prediction the study was designed 
to test. Subjects with low motivation were 
expected not to be attentive to interpersonal 
cues, and as a result either refuse to differ- 
entiate or be forced to make wild guesses, 
Highly motivated subjects, on the other hand, 
were thought to be predisposed by their needs 
and expectations to see Os in certain ways, 
so their judgments should reflect a tendency 
not to differentiate, or to make only gross, 
dichotomous discriminations among persons, 
In either case, motivation affects the pattern- 
ing of responses, and the patterning of re- 
sponses is one factor that affects IS scores, 
The study does not attempt to separate the 
effect on interpersonal sensitivity of a tend- 
ency not to differentiate among Os and 
efforts to differentiate that are inept. Both 
factors operate to determine the sensitivity 
of a person’s interpersonal perceptions. 

It should be pointed out that while subjects 
were encouraged by the instructions to try to 
discriminate among Os, the form of the test 
instrument did not force them to do so. This 
study does not answer the question as to how 
well subjects characterized by different degrees 
of motivation might be able to judge Os if 
actually forced to differentiate. However, if 
the arguments just presented have any valid- 
ity, the same hypothesis should apply. 

Several comments concerning the interpre- 
tation of the results of this study are in order. 
Ability to judge Os’ self-percepts may eventu- 
ally prove to be a convenient index of inter- 
personal sensitivity, but for the present it 
must be considered one discrete skill that may 
or may not be related to other aspects of sen- 
sitivity to others. Assumptions regarding the 
level of motive intensity of subjects in the 
group situation are tenuous, at best; more 
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rigorous means of controlling the contribution 
of the situation to motive arousal should be 
developed or adopted. Finally, no conclusions 
can be drawn concerning the relationship 
between the intensity of à specific motive and 
ability to make certain kinds of judgments 
(e.g., judgments related to that motive). 
As Cronbach (1955) has pointed out, no sen- 
sitivity score can be thoroughly understood 
until it has been broken down into discrete 
and meaningful component parts. 


SUMMARY 


On the assumption that judging other people 
is a special case of complex problem solving, 
the Yerkes-Dodson (1908) principle concerning 
the relationship between problem solving 
efficiency and motive intensity should apply. 
The hypothesis tested was that Js with mod- 
erate n Achievement, n Affiliation, and n 
Power would make more accurate judgments 
of Os in a task oriented small group situation 
than either highly motivated or relatively 
unmotivated Js, 

Six-card TAT booklets were completed by 
43 male undergraduates and coded for n 
Achievement, n Affiliation, and n Power. The 
subjects then met in groups of five or six to 
prepare for an undergraduate examination. 
At the end of 2.5 hours of discussion, each 
group member was asked to rate his behavior 
in the group, and then to predict how each 
other group member had rated himself, Inter: 
personal sensitivity scores free of response 
set influence were computed using Bronfen- 
brenner's “method of differential comparison." 

The results supported the hypothesis: Js 
with moderate motivation as reflected by an 
index combining the three motive scores ob- 
tained higher sensitivity scores than either 
highly motivated or relatively unmotivated 
Js. Need for Power and n Affiliation, taken 
separately, showed a similar relationship to 
interpersonal sensitivity scores despite the 
fact that the distributions of motive scores 
were not positively related. 


REFERENCES 


ALLPORT, F. H. Theories of perception and the 


concept of 
structure. New York: Wiley, 1955. 


BERLEW 


ATKINSON, J. W. The achievement motive and recall 
of interrupted and completed tasks, J, m 
Psychol., 1953, 46, 381-390. | 

Arxinson, J. W. (Ed.) Motives in fantasy, action, and 
society. Princeton, N. J.: Van Nostrand, 1958, 

ATKINSON, J. W., & WALKER, E. L. The afilliation 
motive and perceptual sensitivity to faces, J, 
abnorm. soc. Psychol., 1956, 53, 38-41. 

BENDER, I. E. & Hastorr, A. H. On measuring 
generalized empathic ability (social sensitivity). 
J. abnorm. soc. Psychol., 1953, 48, 503-506. 

Bcs, H. G. The role of motivational factors in 
insightful problem solving. In D. C. McClelland 
(Ed.), Studies in motivation. New York: Appleton: | 
Century-Crofts, 1955. Pp. 428-439. | 

BRONFENBRENNER, U., HARDING, J., & GALLWEY, M. 
The measurement of skill in social perception, In 
D. C. McClelland, A. L. Baldwin, U, Bronfen | 
brenner, & F. L. Strodtbeck (Eds.), Talent and | 
society. Princeton, N.J.: Van Nostrand, 1958) 
Pp. 29-111. ; 

BRUNER, J. S. Going beyond the information given, Iy | 
Contemporary approaches to cognition. Cambridge 
Mass.: Harvard Univer. Press, 1957. Pp. 41-69. 

Cronsacu, L. J. Processes affecting scores on “under 
standing of others” and “assumed similarity: 
Psychol. Bull., 1955, 62, 177-193. i 

Cronpacu, L. J. Proposals leading to analytic treat: 
ment of social perception scores. In R. Tagiuri & 
L. Petrullo (Eds.), Person perception and imt 
personal behavior. Stanford, Calif.: Stanford Univer: 
Press, 1958. Pp. 353-379. "m. 

Crow, W. J., & Hawwoxp, K. R. The generality 
accuracy and response sets in interpersonal percep 
tion. J. abnorm. soc. Psychol., 1957, 64, 384 3%, | 

Gace, N. G. Judging interests from expressive behavió 
Psychol. Monogr., 1952, 66(18, Whole No. 350). 

Gace, N. G., & Cronpacu, L. J. Conceptual and 
methodological problems in interpersonal pete? 
tion. Psychol. Rev., 1955, 62, 411-422. gi 

Gace, N. G., Lzavrrr, G. S., & STONE, G. C. vil 
intermediary key in analysis of mer | 
perception. Psychol. Bull., 1956, 63, 258-2 int 

Hastorr, A. H., & BENDER, I. E. A caution Mu 
the measurement of d Sa 
soc. Psychol., 1952, 47, 574-576. / 

McCrettanp, D. C., Arxius, J. W., CLARK, Ri g 
& LoweLL, E. L. The achievement motive. 
York: Appleton-Century-Crofts, 1953. tion. In 

Tacrori, R. Social preference and its percep epi 
R. Tagiuri & L. Petrullo (Eds.), Person Peer 
and interpersonal behavior. Stanton 
Stanford Univer. Press, 1958. Pp. 316-3 iological 

WispÉ, L. G., & Dramsarean, N. C. Physio P. 
need, work frequency, and visual duration 
olds. J. exp. Psychol., 1953, 46, 25-31. elation of 

Yerkes, R. M., & Dopsow, J. D. The P mation. 
strength of stimulus to rapidity of hab 482 

J. comp. neurol. Psychol., 1908, 18, 45 


(Received October 24, 1960) 


Journal of Abnormal and Social Psychology 
1961, Vol, 63, No. 2, 395-401 


+ 


. THE CONDITIONING OF VERBAL BEHAVIOR AS A FUNCTION OF 
THE NEED FOR SOCIAL APPROVAL 


DOUGLAS P. CROWNE anp BONNIE R. STRICKLAND! 
Ohio State University 


Marlowe, 1960; Marlowe & Crowne, 
1961; Strickland & Crowne, in press), 
- the behavior of individuals displaying a socially 
desirable response set on personality indices 
has been investigated both in psychometric 
situations and on experimental tasks involving 
nontest measures as dependent variables. 
These studies suggest that it is meaningful 
to conceptualize socially desirable responding 
on personality inventories as a function of a 
need for social approval. In contrast to more 
descriptive approaches to the problem of 
stereotyped responding (Edwards, 1957), it 
| has been assumed in these investigations that 
_ self-evaluative styles in psychometric situa- 
tions reflect personality characteristics of the 
respondent (cf. Couch & Keniston, 1960; 
Jackson & Messick, 1958). The conceptualiza- 
tion of need for social approval provides a 
Predictive link between socially desirable 
Tesponding on the scale described below and 
the degree of yielding to the perceived demands 
f nontest situations. The behavioral depend- 
tnt variables that have been investigated 
M conformity in a simulated Asch situa- 
m (Strickland & Crowne, in press) and the 
ey of attitudes expressed towards 
à boring task (Marlowe & Crowne, 1961). 
4 e the postulation of a need for social 
E gives theoretical consistency to the 
sults that have been obtained, further sup- 
age motivational conception is clearly 
Es ha This experiment was designed to 
ante Aud the motivational properties of 
lied d Or Social approval and to attempt to 
A ea Meanings of the construct. 
ae Hs conditioning paradigm was chosen 
OB cem First, the verbal conditioning 
Miniatu as been. widely considered as & 
als ie and simplified model for more com- 
€tpersonal situations like interviewing 


1 
hive ae would like to acknowledge the helpful 
Shephard 1 Li Suggestions given by Reed Lawson, 
i iverant, and Delos D. Wickens in the de- 


Sign and i ` 
of this n of this study and in the preparation 


i a series of recent studies (Crowne & 


and psychotherapy (Krasner, 1958; Ullmann, 
Krasner, & Collins, 1961), and the reinforcers 
typically employed have been of a "social" 
nature (e.g., “mm-hmm,” “good,” etc.). Thus, 
this experimental situation appeared to provide 
a task particularly appropriate to investigate 
the effects of a need for social approval. Second, 
verbal conditioning constitutes a difficult 
and exacting test of the hypothesis because 
subjects are typically unaware of the rein- 
forcement contingency. Hence, the need for 
social approval would have to exert its in- 
fluence without the mediation of awareness 
of the correct response-reinforcement sequence. 
The major hypothesis of the study can be 
stated as follows: Individuals whose need for 
social approval is high increase their rate of 
response in a verbal conditioning task in which 
reinforcement involves approval to a greater 
degree than those with a weaker approval 
need. Specifically, under conditions of positive 
reinforcement, subjects with a high approval 
need, in contrast to subjects less motivated 
for approval, tend to show a greater increase 
in the reinforced response class. But further, 
it follows that approval motivated individuals 
should be more affected by negative reinforce- 
ment in interpersonal situations, since this 
connotes denial of approval and punishment. 
Thus, within the compass of the major hypoth- 
esis, it was additionally hypothesized that 
subjects with a high need for approval tend 
to show a greater decrease in the negatively 
reinforced or punished response class than 
subjects to whom approval satisfactions are of ' 


less consequence. 
METHOD 


Subjects 

The subjects in this experiment were 145 under- 
graduate students in introductory psychology classes 
at the Ohio State University, 74 males and 71 females, 
who volunteered to serve in a study of “verbal reac- 
tions.” Subjects were randomly assigned to the experi- 
mental conditions as they appeared for appointments. 
Male subjects were run by a male experimenter and 
female subjects by a female experimenter in both rein- 
forced conditions. This restriction was not placed on the 
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assignment of subjects to the four experimenters who 
ran the nonreinforced control group. 


Procedure 


The verbal conditioning task employed was the 
plural nouns paradigm of Greenspoon (1955). Each 
subject was brought to the experimental room and 
seated directly facing the experimenter. (In Green- 
spoon’s procedure, the experimenter sat behind and out 
of sight of the subject.) The subject was then informally 
asked his year in college, major, and vocational aspira- 
tions. The purpose of this brief and structured inter- 
action was to get the subject to talk about himself to 
the experimenter and to establish the experimenter as 
a source of approval satisfactions. 

The experimental task itself was then introduced, 
employing the following modification of Greenspoon's 
instructions: 

What I want you to do is to say all the words you 
can think of. Say them individually. Do not use any 
sentences or phrases. Do not count. Please continue 
until I say stop. I will be writing down some things 
as you say the words. Do you have any questions 
at this point? All right, go ahead. 

The subject then said words for 25 minutes. In the 
positive reinforcement condition, every plural noun 
uttered by the subject was immediately followed by the 
experimenter’s “mm-hmm” and a head nod. The sub- 
jects in the negative reinforcement condition elicited 
an "uh-uh" and a shake of the head for each plural 
noun spoken. A nonreinforced control group was em- 
ployed to establish the base rate of plural nouns, and 
subjects in this condition said words for the duration of 
the experimental time in the absence of any verbal re- 
inforcement by the experimenter. Forrecording purposes, 
the conditioning task was divided into five 5-minute 
periods, during which a frequency tabulation was kept 
of the subject’s responses, plural and nonplural, which 
the subject was not permitted to see. In the course of 
this procedure, the subject’s spontaneous questions 
were briefly answered only if they involved a n 
clarification of the task. The subject was reminded of 
the instructions if at any time he stopped or gave unac- 
ceptable responses (counting, using sentences, etc.). 
Tn actuality, few subjects required further elaboration 
or reminders. 

At the end of 25 minutes, the subject was Stopped 
and asked the following questions: 

1. What do you think it was all about? 

2i zy did you go about deciding which words to 

say 

3. Did you notice any change in the kind of words 

you were saying? 

4. Was there anything I did that you particularly 
noticed? (If “mm-] ” [uh-uh”] was not spon- 
taneously mentioned, the subject was then asked: 
What about my saying “mm-hmm” [“uh-uh”}? 

5. What do you think the Purpose of that was? 

These questions were adapted from those employed in 

several studies reviewed by Krasner (1958) and were 

considered to represent a reasonably careful test of the 
subject’s awareness of the response being reinforced. 

Where necessary, the experimenter attempted to clarify 

ambiguous responses by additional probing. 

As a measure of performance on the conditioning 
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task, since the sheer number of words spoken potentially 
could affect (and thereby confound) the results, eah 
subject’s score was expressed as the ratio of plug] 
nouns to the total number of words emitted, thus on. 
trolling for the effect of verbal output. 


Inventory | 

The measure of the need for social approval was tie 
M-C SD scale (Crowne & Marlowe, 1960), a 33-iten, 
true-false questionnaire. The need for approval is in- 
ferred from the extent of the subject’s endorsement t. 
the socially approved and desirable characteristia 
which the items represent. An illustrative item is, "T 
have never intensely disliked anyone.” 

Tn the experimental procedure, the M-C scale and 
the verbal conditioning task were presented in counter: 
balanced order to eliminate an order effect. It is im 
portant to note that for all subjects the personality 
inventory was not scored until after the completion o 
the entire procedure. There was, thus, no possibilty 
that the experimenter’s knowledge of the M-C SD sot | 
could influence the results. 


RESULTS 


Preliminary to the analysis of the data, !) 
subjects were discarded. One subject was 
omitted for failure to comply with the instrue 
tions (counting); 15 subjects were dropped 
from the negative reinforcement condition 
because they correctly verbalized the ren 
forcement contingency; 1 subject was exclu! i 
from the positive reinforcement group for a 
same reason. Two subjects were dropped by 
means of random selection from the positive 
reinforcement condition in order to on 
proportionality of the cell Ws for the an n 
of variance. Of the final W of 126 subjects, d 
were in each of the three conditions, Thes 
subjects were further divided by dichotomitt 
the scores on the M-C scale at the ov i 
mean (16.82) to yield the high and low net 
for-approval groups. The experimental Tal 
was, thus, a 2 X 3 factorial with ju s 
measures across the treatments. The DE. 
of the data was accomplished by m. 
Grant's (1956) orthogonal polynomi: 
analysis. 

Ta the two specific hypotheses of the study, 
it was predicted that subjects with ease 1 
need for social approval show an, xs 
plural nouns under positive renio 5i 
and a decrease in plural nouns under I! i. 
reinforcement in contrast to subjects b 
need for social approval is lower. Fes jm 
and 2, respectively, show the curves det 
high and low need-for-approval go cot 
the positive and negative reinforcem 
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5 MINUTE PERIODS 


e. 1. Proportions of plural nouns given by high 
e low need approval groups under positive reinforce- 
ent and nonreinforced control conditions. 


ditions and compare them with the curves of 

highs and lows in the nonreinforced control 

condition, From the curves in Figure 1, it 

Would appear that the high need-for-approval 

j ia shows the predicted increase in the 
fency of plural nouns, while the frequency 

: plural nouns for the low group is intermedi- 

7 p the highs and lows of the control 

D tion. In Figure 2, the high group seems 

Ee in the use of plural nouns, as pre- 

P with the low group again occupying 
Position between the nonreinforced high and 
"aed for-approval groups. 

4 p 1 presents the results of the analysis 
E pors There is a significant departure 
E zero slope (overall trend F = 3.83, p 
m ; oe curves following both linear and 

E. ctions, It is clear that change in the 
E n of plural nouns occurred in the 

a ol c experiment. Of major interest to 
Boe esis are the significant between- 
@ tue e effects, Inspection of this portion 
med reveals both that the reinforcers 

ey ae (F = 5.25, p < .01) and that 

Boson erentially effective on the need- 

05) s groups (interaction F = 6.77, p 

$ O dng first the positive rein- 
ho. ata, to isolate the differences, a 
as Tun between the means of the posi- 
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&-— —& LOW NEED APPROVAL 


PROPORTION OF PLURAL NOUNS 


5 MINUTE PERIODS 


Fic. 2. Proportions of plural nouns given by high 
and low need approval groups under negative rein- 
forcement and nonreinforced control conditions. 


tively reinforced high and low need-for-ap- 
proval groups for the first time period, yielding 
a nonsignificant value of 1.21. It appears that 
at this early stage, verbal conditioning had not 
begun to differentiate the high need-for- 
approval group from the other groups. Thus, 
the differences between the reinforced high 
need-for-approval group and the low or control 
groups are to be found later in the task with a 
consistent superiority of the reinforced high 
group during the remainder of the conditioning 
periods. The smallest of these differences, that 
between the positively reinforced high and 
low need-for-approval groups for the fourth 
5-minute period, was tested by means of t. The 
obtained value was 2.20, which is significant 
beyond the .02 level. (One-tailed tests were 
employed in this and subsequent analyses 
because directional differences were specified 
in advance of the study.) It is interesting to 
note that the cubic between-group trends 
terms of the analysis of variance reached 
borderline significance, suggesting some tend- 
ency for the curve of the positively reinforced 
high need-for-approval group to depart in 
slope and form from theothercurves. —— 

In the negative reinforcement comparison, 
t tests were also run between certain of the 
means of the reinforced and nonreinforced 
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TABLE 1 


ANALYSIS OF VARIANCE OF HiGH AND Low NrED MEANS AND STANDARD DEVIATIONS OF THE Prou] 


APPROVAL GROUPS UNDER THE THREE 
REINFORCEMENT CONDITIONS 
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Source df MS R 
A. Overall Trend (4) | -031839 | 3. 
Linear 1 «075927 | 6. 
Quadratic 1 «006956 | 0. 
Cubic 1 043719 | 6. 
Quartic 1 -000754 . 
B. Between-group means (5) | .261322 | 4. 
a. Need Approval 1 «003586 | 0. 
b. Reinforcement condition 2 «284475 | 5. 
c. Interaction 2 «367035 | 6.77*** 
C. Between-group trends (20) | .008762 | 1.05 
1, Linear (S) | -008948 | 0.72 
a. Need Approval 1 «014859 | 1,19 
b. Reinforcement condition 2 |.010737 | 0.86 
c. Interaction 2 «004203 | 0.34 
2. Quadratic (S) | .009420 | 1.16 
a. Need Approval 1 |.005375 | 0.66 
b. Reinforcement condition 2 | .017112 | 2.11 
c. Interaction 2 «003750 | 0.46 
3. Cubic (5) | .012002 | 1.89 
a. Need Approval 1 «003226 | 0.51 
b. Reinforcement condition 2 -013586 | 2.14 
c. Interaction 2 «014805 | 2.33 
4, Quartic » | 6) | .004677 | 0.74 
a. Need Approval 1 «001788 | 0.28 
b. Reinforcement condition 2 |.006786 | 1.07 
€. Interaction ‘ 2 | .o04011 | 0.63 
D. Between-individual means 120. | .054212 
E, Between-individual trends (480) | .008310 
1. Linear 120 012465 
2. Quadratic 120 «008092 
3. Cubic 120 4006363 
. 4. Quartic 120 +006319 
F. Total 629 
*p < 02. 
** p< 01 
"> < 005. 
"t$ < 01. 


groups. Comparing the high and low need- 
for-approval Broups under negative reinforce- 
ment, significant differences were obtained 
for the first, third, and fourth time periods. 
However, *uh-uh" was effective in differen- 
tiating the punished high need-for-approval 
group from its nonreinforced counterpart 
only during the fourth time period. The effect 
of the experimenter’s verbal punishment was, 
thus, not as consistent in producing between- 
group differences in plural noun usage as the 
effect of approval. 

A finding of potential interest is the consist- 
ent though nonsignificant elevation of the 
curve of the low need-for-approval group over 
that of the high group under the control 
condition. It is unlikely that subjects with a 
low need for approval are characterized by 
more pluralistic speech than approval moti- 


TABLE 2 


Noun Ratios or Hic AND Low NEED APPROVAL 
GROUPS UNDER THE THREE REINFORCEMENT 
CONDITIONS 


Group 


Positive reinforcement 
High need approval M 


SD 
Low need Approval | M 
SD 
Negative reinforcement 
High need Approval | M 
SD 
Low need Approval M 
SD. 
Nonreinforced control - 
High need Approval M 
» SD 
Low need Approval M 


attributable to sampling error, minimal 
ferences in task orientation of the two gro 
or some other presently unspecifiable variabli 
Table 2 shows the means and’ standat 
deviations of the high and low need-dor9pl 
proval groups under the three experiment 
conditions. | 


notably that of the high need-for-approw 
group under positive reinforcement. : 
curve is characterized by increase, dear 
and a terminal increase. While this obs s 
tion does not bear on the hypothesis, it ae 

the interesting problem of why the deer an 
in the proportion of plural nouns Kk. si 
occurred in the periods following the ditio 
5 minutes. Conceivably, under the con P 

of the plural nouns paradigm, | a 
asymptotic limit to the number © ot pd 
a subject can give relative to his to ure d 
ductivity. Hence, this may be in the eno | 
a partial, temporary "exhaustion oct 
enon, from which recovery appears ted thal) 
in the final period. It may also be no al group. 
the nonreinforced low need-for-apPFOV2, © faq 
tended to increase slightly in the n m 
of plural nouns from the first to E that b 
period. Similarly, Daily (1953) foun ecient 
nonreinforced group showed an incr study | 
the response class investigated in his tive & 

Tt remains to examine an altern? 
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planation of the results. It is not altogether 
impossible that the obtained differences in 
- verbal conditioning are a function of intelli- 
gence. Such a finding would occur if there 
were a relationship between intelligence and 
the M-C scale. Scores on the Ohio State 
Psychological Examination (OSPE), a scho- 
lastic aptitude test highly related to other 
measures of intelligence, were available on 
34 of the positive reinforcement subjects and 
31 of the subjects in the negative reinforcement 
condition. The mean OSPE scores of the high 
- and low need-for-approval groups were sepa- 
rately compared for the two reinforcement 
conditions. For positive and negative rein- 
forcement analyses, the obtained ¢ ratios were, 
respectively, 0.56 and* 0.23, providing no 
evidence of differences in intelligence between 
the two groups. Finally, to determine if any 
| Tation exists between intelligence and verbal 
Conditioning, correlations were run between 
OSPE scores and a total conditioning ‘score 
| obtained by summing over the time periods. 
For the positive reinforcement condition, ‘r 
E — 06, and the corresponding z for the 
Negative reinforcement condition is —.04. It 
- evident that intelligence, as measured by 


the OSPE, is not predictive of verbal con- 
ditioning, 


Discussion 


E aix of this experiment establish 
TM x jects whose need for social approval 
ag put with subjects less con- 
trol E approval and nonreinforced con- 
PN Jects, tend to increase the relative 
quency of the reinforced response class of 
lid iie under positive reinforcement and 
3 o inhibit plurals when they are followed 
E nnt. In fact, the low need-for- 
“stent ch groups failed to demonstrate con- 
Positive | in rate of response either under 
Bue vd negative reinforrement when com- 
stab the control subjects employed to 
Sa base rate of plural nouns. 
Bi lcg Suggest that, in the context 
connotin mer procedure, a reinforcer 
ing the a mag is more effective in increas- 
inhibitin € of response than is punishment in 
approva response rate for subjects to whom 
Baden satisfactions are important. This 
di cies In evidence despite the fact that 
to the x ee in general far more salient 
UbJects and for many was highly 
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anxiety arousing, as shown by their questions 
and comments, increased response latency, and 
other behavioral indications of disturbance. 
There were, however, no differences between 
high and low need-for-approval groups in 
these manifestations of anxiety as assessed by 
observational ratings of their behavior during 
the conditioning period. 

For all groups, there was considerable vari- 
ability in conditioning. While the need for 
social approval clearly seems to be one deter- 
minant of change in behavior in this experi- 
mental situation, it is undoubtedly not the only 
variable involved. Other studies of individual 
differences in “conditionability” or “respon- 
sivity" have predicted change in response 
rate from such diverse personality measures 
as manifest anxiety (Taffel, 1955), compliance 
in psychotherapy as well as test anxiety and 
fearfulness in new situations (Sarason, 1958), 
achievement via independence (Krasner, Ull- 
mann, Weiss, & Collins, 1960), and hypnotiza- 
bility (Weiss, ann, & Krasner, 1960). 
It is important to determine if these and other 
personality?variables operate in a meaningful 
constellation and whether prediction can be 
increased by employing multiple measures, It 
remains to be determined what relations obtain 
among these personality variables, and efforts 
to achieve theoretical articulation are needed. 

As to the problem posed by this study, the 
conceptualization of need for social approval 
is considerably enhanced by the findings which 
provide support for the theoretical bridge 
between an individual’s attribution of socially 
approved characteristics to himself on a per- 
sonality inventory and the inference of strength 
of approval motivation from the degree of 
this endorsement. The need for approval was 
demonstrated to conform to a major attribute 
of needs; the ability to effect modifications of 
behavior and, in particular, to influence the 
effectiveness of a reinforcer. 

It appears from the present research and 
other investigations (Allison & Hunt, 1959; 
Couch & Keniston, 1960) that response sets on 
personality measures involve more than a 
simple tendency to agree, dissimulate, or 
respond in a socially desirable fashion in re- 
stricted self-evaluative situations. There is 
increasing evidence to indicate that a subject's 
needs in a testing situation and his categoriza- 
tion of that situation affect his test responses 
(Rotter, 1960), and these needs may show 
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considerable generality beyond this context. 
The present findings and previous research 
on the need for approval are suggestive of the 
extent of this generality, as is Couch and 
Keniston’s research on agreeing response set. 
Here, as with studies of verbal conditioning, 
research to determine the relations among the 
various response-set measures is required. Of 
particular interest in this regard is Couch and 
Keniston’s characterization of the naysayer, 
who demonstrates a "denial" response set, as 
conventional and conforming in his attitudes. 
These characteristics are also descriptive of the 
individual with a high need for social approval. 

It is important to note that no tautology is 
involved in this study in the finding that ap- 
proval motivated subjects increase their rate 
of response under conditions of approval ori- 
ented reinforcement. Subjects were carefully 
screened for awareness of the correct response, 
and only those subjects who were unable to 
recognize the response-reinforcement sequence 
were retained for analysis? Thus, the effect 
of the need for approval was mediated at a 
level of awareness below that which the sub- 
ject was capable of verbalizing. In the main, 
subjects, both those high and low in need for 
approval, tended to characterize the experi- 
menter’s “mm-hmm” or “uh-uh” as gener- 
alized encouragers or discouragers, or saw these 
reinforcers as pertaining to specific content 
categories which they were to pursue or to 
abandon. There were no differences between 
high and low groups in this respect.? 


* The following comments of subjects are quoted to 
illustrate their lack of awareness of the correct rein- 
forcement contingency. 

Positive reinforcement, to Question 5: “I would 

_ Say it encouraged me more than discouraged me. I 
knew you were approving—I completed that word, 
it was successfully done." 

Positive reinforcement, to Questions 4 and 5: 
“Yes, to certain of my reactions you would say ‘mm- 
hmm,’ but I don’t know why—not unless you en- 
joyed the word.” 

Negative reinforcement, to Question 5: “I don't 
know. That really shakes you up. You seemed to 
say it whenever what I said didn’t have any associa- 
tion with something I'd said before.” 

* Eriksen (1960) has recently reviewed an experi- 
ment by Dulaney (1961) in which a relationship was 
found between the hypotheses held by subjects in a 
verbal conditioning task and their rate of plural noun 
responses. Verbal conditioning occurred only for those 
subjects who interpreted the experimenter’s “mm-hmm” 
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Generalized to the area of psychotherapy, 
the results of this study are suggestive of the 
kinds of problems that may be encountered 
by therapists attempting to deal with the 
ubiquitous problems of resistance and trans- 
ference. The resistance of some patients to the 
therapist’s efforts may be due in part to their 
low need for the approval he dispenses and 
perhaps stronger needs to defend against 
anxiety arousing thoughts and to preserve 
partially satisfying, if neurotic, modes of 
behavior. At the opposite extreme may be 
those highly approval motivated patients for 
whom the therapist has high reinforcement 
value indeed. Perhaps in psychotherapy these 
are the patients rated as good by their thera- 
pists—patients who develop a strong positive 
transference and are sensitive to very subtle 
nuances in the therapist’s behavior. 


SUMMARY 


This experiment was undertaken to compare 
the changes in response rate of subjects 
differing in the strength of need for social 
approval on a plural nouns verbal condition- 
ing task. The hypotheses were that subjects 


as an indication that they were correct in pursuing 
certain content categories. Further, it was shown 
that while there is a tendency for plurals to evoke 
other plurals, this is not true of singular associations. 
Dulaney concludes that the relationship between the 
chaining of plural nouns and the subject’s belief that 
reinforcement was pertinent to content categories is 
sufficient to account for the (spurious) increase in rate 
of plural nouns in that increase in plural noun output 
is only incidental to increase in associations along rein- 
forced content lines. The present data were analyzed to 
test Dulaney’s hypothesis. Subjects were divided into 
two groups on the basis of their responses to the aware- 
ness questioning: a group containing 18 subjects who 
verbalized a relation between “‘mm-hmm” and correct- 
ness of associations along content lines, and a group of 
26 subjects who gave any other account of “mm-hmm” 
including lack of recognition. These groups were further 
broken down into high and low need for approval sub- 
groups and their plural noun ratios compared for the 
second time period in which the maximum rate oc- 
curred. No difference was obtained between high and 
low need for approval groups in the explanation of 
“mm-hmm,” and /s computed between the mean 
plural noun ratios of the explanation groups were non- 
significant. A similar analysis conducted for the nega- 
tive reinforcement data yielded nonsignificant results. 
Thus, Dulaney’s contention that the verbal condition- 
ing of plural nouns is artifactual does not appear to ap- 
ply to the present experiment. 
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with a high need for social approval, in con- 
trast to subjects with a weaker approval need, 
tend to show an increase in the proportion of 
plural nouns under positive reinforcement and 
a decrease in the proportion of plural nouns 
under negative reinforcement. The reinforced 
groups were also compared with a nonrein- 
forced control group employed to establish 
the base rate for plural nouns. 

Results supported the hypotheses, with the 
high need-for-approval groups increasing the 
positively reinforced response and inhibiting 
the response followed by negative reinforce- 
ment. No subjects retained in the analysis of 
the data were able to verbalize the reinforce- 
ment contingency. The significant differences 
obtained were found not to be attributable to 
intelligence, nor was intelligence related to 
verbal conditioning. 

The findings were interpreted as providing 
support for the inference of need for social 
approval from a personality inventory meas- 
uring the degree of personal endorsement of 
socially approved characteristics. 
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INTERACTION AMONG RETARDED CHILDREN AS A FUNCTION OF 
THEIR RELATIVE LANGUAGE SKILLS! 


SEYMOUR ROSENBERG,’ JOSEPH SPRADLIN, anp SANFORD MABEL 
University of Kansas 


WERE has been very little direct analysis 

of the interpersonal processes associated 

with behavioral retardation. A few 
authors discuss the possibility that inter- 
personal processes play an important role in 
such psychopathology (e.g Cameron & 
Magaret, 1951, Ch. 6; Levinson, 1958; 
McCandless, 1952; Masland, Sarason, & 
Gladwin, 1958, Ch. 14). It is conjectured that 
the social environment is an important deter- 
minant in the inception of a process of be- 
havioral retardation and that interpersonal 
relationships between a developmentally 
retarded child and his social environment 
affect the degree of behavior pathology. How- 
ever, as some writers have noted, these inter- 
pretations are based primarily upon studies 
of such gross variables as institutional up- 
bringing, socioeconomic status of the family, 
regional differences, maternal care, and on 
records of a few accidental cases of extreme 
social isolation. Psychological descriptions of 
the probable nature and relevance of inter- 
personal relationships to behavior development 
are, at present, inferred largely from these 
sociological variables and from clinical 
protocols, 

There are very few specific facts concerning 
the details of natural or contrived interpersonal 
events involving the behaviorally retarded. 
There is the general suggestion that inter- 
personal behavior of a retarded person is not 
determined exclusively by his own various 
skill levels, but also by the skills of the other 
interactors. Somewhat more specific is the 
notion that the discrepancy of skill between 
participants is important in determining 
interpersonal action. The present study was 
designed explicitly to ascertain the importance 
of discrepant levels of linguistic skill between 

1 This study is part of a project of research in 
language retardation sponsored by NIMH Grant 
(OM-111). The project is a joint effort of the Bureau of 
Child Research, University of Kansas, and Parsons 
State Hospital and Training Center. 

2 Now at Bell Telephone Laboratories, Incorporated, 
Murray Hill, New Jersey. 


participants, as well as absolute levels of 
skill, in determining the amount and type of 
interpersonal behavior exhibited. 

Pairs of children, both of homogeneous and 
of heterogeneous levels of linguistic skill, 
were assembled for observation. Detailed 
quantitative records were made of inter- 
personal behavior over a series of free play 
situations. All children had been institu- 
tionalized as a result of demonstrated be- 
havioral retardation and associated psycho- 
pathology. The composition of each dyad was 
systematically varied according to the 
linguistic skills demonstrated by each child in 
preliminary individual tests. Although all 
subjects were institutional residents, a wide 
range of talent on the tests was readily found. 

The use of psychometric tests to measure 
skills has been successful both in the area of 
retardation and in social psychological re- 
search. The important relationships between 
the psychometric score (intelligence or other) 
and the interpersonal behavior of an individual 
may not always be linear, however. A number 
of empirical studies in social psychology, 
dealing with social interaction among normal 
adults (Hoffman, 1959; Rosenberg, 1957; 
Schutz, 1958), suggests that individual test 
measures obtained before interaction relate, 
but not in simple ways, either to subsequent 
Social performance by the individual or 
to subsequent group products. It is also quite 
apparent from the literature that the relation- 
ships between psychometric scores and inter- 
action measures have not been highly 
systematized. Although this body of research 
contains certain commonalities, generalizations 
are at present limited almost exclusively to 
specific measures, populations, and inter- 
personal settings (see, for example, cautions 
by Shaw, 1960). In effect, the present study 
may be viewed as an initial attempt to deter- 
mine the presence and type of an assembly 
effect (Roby, 1953; Rosenberg, Erlick, & 
Berkowitz, 1955) in dyads composed of 
pathologically retarded children. 
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METHOD 


Psychometric Tests of Linguistic Skill 


Children were tested one at a time with a standard 
set of language items? The test items were divided into 
two categories, an Intraverbal subtest and a Naming 
subtest. The Intraverbal subtest contained 26 items, 
an example of which is, “The flag is red, white, and 
." The child is asked to complete the sentence. 
The Naming subtest contained 28 items and consisted 
of various objects, ranging in difficulty, which the 
child is asked to name. A child receives one point for 
each item named correctly. A child who failed to attain 
a score of at least one on both subtests combined was 
excluded from the study. For the remaining children, 
only the scores from the Intraverbal subtest were con- 
sidered in classifying them in level of linguistic skill. 
Two levels were specified: *High" (H) children were 
those who obtained a score >20 on the Intraverbal 
subtest. “Low” (L) children were those who obtained a 
score <9 on the Intraverbal subtest. Children with in- 
termediate scores were eliminated. The verbal IQ 
scores on the WISC for the H subjects ranged from 45 
to 82. None of the L subjects could be tested with the 
verbal part of the WISC. The correlation between the 
Intraverbal subtest and the WISC verbal is .70; the 
correlation between Naming and the WISC verbal is 
.05. The low latter correlation is readily explained in 
terms of the low overlap in relative difficulty; most 
items on the WISC are empirically more difficult than 
those in the Naming subtest. These correlations are 
based on an N of 34 for whom WISC scores were avail- 
able from the case files. These 34 children are part of 
the sample of 64 to be described below. 


Subject Selection 


All subjects were selected from the institutional 
population of Parsons State Hospital and Training 
Center, Parsons’ Kansas, This institution is for “emo- 
tionally disturbed, mentally retarded children without 
multiple handicap” between the ages of 6 and 21. The 
total number of children in the institution has varied in 
recent years between 619 and 672. There is considerable 
variation within the population in characteristics such 
as self-care, length of institutional residence, psycho- 
pathological behavior, obvious biological involvements, 
and intellectual range. The institutional staff carries on 
continuous programs in education, psychotherapy, vo- 
cational habilitation, and a number of the adjunctive 
therapies. 

At the time of the study, the institution contained 
159 children between the ages of 12 years 11 months and 
15 years 0 months. From this group, 64 children (30 
boys and 34 girls) were randomly selected for testing 


* The test format and items used in the present study 
have been deposited with the American Documentation 
Institute. Order Document No. 6867 from ADI 
Auxiliary Publications Project, Photoduplication Serv- 
ice, Library of Congress; Washington 25, D. C., re- 
mitting in advance $1.25 for microfilm or $1.25 for 
Photocopies, Make checks payable to: Chief, Pho- 
toduplication Service, Library of Congress. 
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and classified according to the procedures described 
above. All tests were administered within a 6-week 
period. Eight males and 13 females were classified as H; 
6 males and 11 females were classified as L. Fourteen 
children, 8 girls and 6 boys, were randomly selected as 
potential subjects from each of two categories, making 
a total of 28 children initially selected. The final re- 
sults are based upon 20 of these children because an 
additional criterion, to be described subsequently, was 
imposed during the experiment proper. 


Experimental Design 

Seven subgroups of four children each were formed, 
Each subgroup contained two H and two L subjects of 
the same sex. Aside from these two restrictions, assign- 
ment to the subgroups was random. Three male and 
four female subgroups were thus obtained. All experi- 
mental assemblies of dyadic play groups took place 
within a subgroup. Thus each subgroup constituted 
an independent replication of the same experimental 
design. 

The two H and two L subjects in each subgroup 
were randomly assigned to a row or a column of the 
following matrix with the restriction as shown. A, A’, 
B, and B’ represent dyads formed from one row sub- 
ject and one column subject. 


d COLUMN SUBJECTS 


ROW SUBJECTS 


The four subjects in a subgroup were brought to the 
research area of the hospital at the same time. Two 
dyads, each containing one row subject and one column 
subject, were then formed and placed in two separate 
playrooms for 15 minutes. For example, dyads A and 

! were formed and each put into separate rooms. Two 
new dyads were then formed from the same four sub- 
jects after this 15-minute period. Again each dyad was 
composed of one row subject and one column subject. If 
A and A’ had been formed first, B and B’ would be 
formed for the second period of play. The four cells of 
the matrix above were thus completed. Two more play 
periods for the same four subjects followed immediately. 


The order of assembly was not necessarily the same, 


however. Two completions of the matrix constituted a 
never assembled with 


"session." Row subjects were 
Dou za Similarly, column subjects never formed an 
imental dyad. 

igure lisa Sketch of the floor plan of the playrooms 
in which two dyads of a subgroup were simultaneously 
assembled. An observation room with a one-way mirror 
to each playroom is also shown in the sketch. Playroom 
I was wired so that the subjects could be readily heard 
in the observation room. All behavior recordings were 
done with the dyad in Playroom I. Thus each dyad 
spent one play period in each playroom during a session. 
The assignment of dyads to the playrooms also satis- 
fied the following restriction: If the subject was not in 


Fic. 1. Sketch to scale of the two playrooms and ob- 
servation room. (D is a door to hall; W is an outside 
window; M is a one-way mirror; Oi, O» are locations of 
observers.) 


& dyad in Playroom I during one of the first two 15- 
minute periods during one session, he was required to 
be in a dyad in Playroom I during one of the first two 
15-minute periods of the next consecutive session. Each 
playroom contained one table, two chairs, one beach 
ball, one teddy bear, and one package of colored clay. 
The windows of the playroom were covered with drapes. 

‘Thirteen sessions were scheduled for each subgroup. 
The first 8 sessions were scheduled 2-4 days apart. If 
one or more children in a subgroup were absent for a 
session, the session was postponed until the next sched- 
uled period. The first session started 3 weeks after the 
psychometric test, and all subgroups were started in the 
same week. After the first 8 sessions, the sessions were 
scheduled once each week. The 13 sessions which were 
to have required 9 weeks actually required up to 27 
weeks in some subgroups because of illnesses and Christ- 
mas vacations. Two of the seven subgroups were 
dropped after the third session because one child within 
each of these two subgroups refused to remain in the 
playroom for the scheduled time. Some preliminary but 
imperfect screening on this characteristic had probably 
already occurred during psychometric testing. Fre- 
quently, a child made a zero score because he did not 
remain in the test room even after several attempts to 
attract him to the tests. As noted previously, such ex- 
tremely “low” children were excluded before subgroups 
were formed, 


Behavior Measures 


Two observers (O1, Os) in the observation room 
categorized the behavior of the subjects in the play- 
room. An O never observed the same subject two 
times in succession. Within a play period, one O ob- 
served a row subject and the other O observed a column 
subject. The Os were required to judge the occurrence 
of three individual behavior categories. Three electrical 
buttons, one for each category, were available to each 
O, who was instructed to depress the appropriate key 
when he observed his subject behave in the defined 
manner and to release the button as soon as the be- 
havior ceased. Records of O's judgments were made on a 
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20-pen Esterline-Angus operations recorder. The be- 
havior categories were described to the Os as follows: 
1. Vocalization: All verbalization excluding shriek- 
ing, screaming, crying, laughing, and other re- 
flexive noises. 
2. Gesture: Commonly learned gestures clearly ob- 
served by the other child. Include only: 
Beckoning with finger or hand 
Shrugging 
Pointing 
Nodding yes or no 
Waving 
3. Physical Contact: Includes actual grasping of 
part of other child with hand, touching, hitting, 
pushing, chasing, offering of toys toward other 
child, throwing ball or object toward other child. 
Mutual hugging of an object. This excludes rock- 


ing. 

The last 5 of the 13 sessions were systematically ob- 
served by the two Os. Only data from the last 4 sessions 
were analyzed. The first 9 sessions were used to allow 
the relationships in the dyads to stabilize. Effects of un- 
equal prior contact among the subjects due to institu- 
tional routine were partially neutralized this way. 

Three people were selected as Os for the study, but 
only 2 were used at a time. They were 2 junior college 
students and 1 housewife, selected from a preliminary 
group of 6 housewives and 12 students. Final selection 
of the 3 Os was made on the basis of their reported 
availability and interest after a preliminary contact 
with the task (i.e., self-selection as Os) and the accuracy 
with which they used the categories during observations 
of a film.‘ Thirteen persons were eliminated on the first 
criterion. 

On the second criterion, after preliminary instruc- 
tions concerning the behavior categories, the complete 
film was shown to the potential Os. During this period, 
they did not rate the children. This showing was then 
followed by a number of repetitions of the first 5 min- 
utes of the film, which contained more communicative 
interaction among the children than other parts. The 
potential Os were asked to observe one of three different 
children during each of three 5-minute repetitions. Ina 
fourth run, the experimenter demonstrated his judg- 
ments. Questions by Os were then freely answered. 
Three repetitions then followed with the potential Os 
assigned to a different child each time. The selection of 
Os was based on these three final repetitions. The 3 Os 
whose ratings of the children most closely approximated 
the mean ratings of the original 18 Os were selected to 
rate the sessions, Two of these Os received additional 
practice in rating the dyads of each subgroup for the 
sixth and seventh sessions. These 2 Os rated all the 


4 A 20-minute film titled, One, Two, Three, Go, p19- 
duced by Metro-Goldwyn-Mayer, 1946. The film 18 
readily available through Teaching Films Custodian, 
Inc., 25 West 43rd Street, New York, N. Y. One, Two 
Three, Go is an “our gang” type of safety film. One of 
the gang is injured by being hit by an automobile 
while chasing a ball in a sand-lot baseball game. There 
is considerable child-child interaction during portions of 
the film where the rest of the gang is making the trip to 
visit him in the hospital. 
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sessions with the exception of one session for each of two 
matrices. For these two sessions, the third O served as 
an alternate. 


RESULTS 


Three measures were obtained for each 
subject on each of the last four trials: total 
frequency of vocal responses during a 15- 
minute play period, total frequency of gestural 
responses during the play session, and square 
root transformation (Vx + .5) of the total 
frequency (x) of physical contacts during the 
play session. The transformation of frequency 
of physical contacts was necessary because of 
extreme skewness in the original frequency 
counts (Edwards, 1950, pp. 199-202). 

The individual behavior measures obtained 
for each subject were classified according to 
one of the five subgroups (matrices) and one 
of the four sessions. Furthermore, each cell of 
each 2 X 2 matrix contains two values, one 
designating the behavior of the row subject 
and one of the column subject. 

The response data of row and column 
subjects were analyzed separately. The 
separate analyses of row and column subjects 
do not constitute two independent replications 
of an experiment, however. The presence of 
one member of each dyad in one set of data 
and the other member in the other set may 
introduce systematic similarities between the 
two sets of results. The data of the row sub- 
jects were arbitrarily selected as “primary” 
data. The column subjects’ data were analyzed 
and are presented to permit comparisons of 
results from the two sets of individual be- 
havior data. 

Each response measure of the row subjects 
was subjected to a separate analysis of vari- 
ance. The statistical design was the same for 
all three measures. The nature of the design 
and the results for row subjects are sum- 
marized in Table 1. The row subjects in these 
analyses are termed “speakers” and the column 
subjects are termed "listeners." 

. The three variance sources of greatest 
interest are those attributable to speaker's 
linguistic skill (Source 1), listener’s linguistic 
Skill (Source 2), and the statistical interaction 
of listener’s and speaker's skill levels (Source 
5). Each of these effects was tested (Table 1) 
against two error terms. One type of error 
term permits generalization across trials 
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(Sources 6, 8, or 11). The second and more 
important error term permits generalization 
across individual matrices (Sources 7, 9, 12). 
Both Fs for each variance source are shown in 
Table 1. 

Table 2 presents the analyses of response 
measures obtained from column subjects. In 
general, the results are comparable with those 
obtained from row subjects. With little 
exception, the two main effects, speaker's skill 
or listener's skill, are generally not significant 
in any of the analyses. It is obvious that 
linear relationships between psychometric 
score and individual situational performance 
fail to obtain here. On the other hand, the 
statistical interaction effects in both tables are 
highly significant for vocal responses and 
generally significant for gestural responses. The 
least stable measure, in terms of significant 
interaction effects, is transformed physical 
contact. 

Figure 2 presents the data graphically. The 
three graphs on the left of the figure are data 
from row subjects; the graphs on the right are 
from column subjects. Examination of the 
graphs in Figure 2 reveals the nature of the 
sources of variances tested in Tables 1 and 2. 
The top two graphs of Figure 2 clearly show 
that vocal productivity of a subject is related 
directly to the similarity of his psychometric 
score with that of the listener or recipient with 
whom he is assembled. This is a relatively 
simple but important interaction. It is not 
removable by any transformation of the 
measure (Scheffe, 1959, pp. 95-98). The graphs 
give very little indication of any main effect. 
Tt is also interesting to note that in dyads 
composed of heterogeneous skill levels (H-L 
and L-H), the absolute frequency of vocal 
responses by either child is almost zero in all 
sessions. A 

The gestural responses which are plotted in 
the second two graphs of Figure 2 suggest a 
similar type of statistical interaction. The 
effect is not as striking as that obtained for 
vocal data, but it is apparent that a dyad 
composed of two L subjects yields as much 
gestural behavior, on the average, as does a 
dyad of two H subjects, and that main effects 
are minimal. Again, heterogeneous dyads show 
very little gestural behavior in the absolute 


sense. 
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TABLE 1 
SUMMARY OF STATISTICAL ANALYSES OF THREE RESPONSE CLASSES OF ROW SUBJECTS 
Vocal behavior Gestural behavior Physical contact DS 
soe m ———EP 
us F MS F MS F REN 
1. “Speakers” 1 289 1.25 0.0} <1 1102 | <1 6 
(Row subjects) «1 «1 5.11 7 
2. “Listeners” 1 1141 | 17.83* 5.0| <1 1436 3.97 8 
(Column subjects) «1 «1 14.37* 9 
3. Tri: 3 242 15.5 404 
4. Matrices 4 1796 48.2 851 
5.1X2 1 | 83980 | 272.66*** | 871.2 | 57.32** 3.75 11 
90.11*** 20.84* | 2566 51.30* 12 
61X3 3 232 32.8 1379 
T.1X4 4 12899 108.5 216 
8.2X3 3 30.8 363 
9.2X4 4 12292 114.1 100 
10. 3X4 12 1058 22.7 561 
11.1X2X3 3 308 15.2 684 
12.1X2X4 4 932 41.8 50 
13. 1X 3 X4 12 440 17.9 231 
14.2X 3X4 12 405 15.5 580 
15.1K2X3xX4 12 1047 21.6 564 
16. Total 79 2975 42.3 528 
*p «0. 
>< Qi. 
"> < 001. 
TABLE 2 
Summary OF STATISTICAL ANALYSES OF THREE RESPONSE CLASSES OF COLUMN SUBJECTS 
(A Nonindependent Replication) 
Vocal behavior Gestural behavior Physical contact Error 
Source aff cach We 
MS P Ms F MS F 5 
1, "Speakers" — 1 16 | «1 9.8| «1 1133 8.58 6 
(Column subjects) «1 «1 7.36 7 
2. "Listeners" 1 328 1.47 68.5 9.01 567 | «1 8 
(Row subjects) «1 «1 2.32 9 
3. Trials 3 104 25.5 651 
4. Matrices 4 4392 23.3 521 
5.12 1 57460 | 174.12*** | 336.2 5.29 | 2236 4,84 11 
442.00*** 18.37* 9.98* 12 
61X3 3 114 16.9 132 
7.1X4 4 8831 144.6 154 
8.2X3 3 223 7.6 870 
9.2X4 4 6351 80.5 244 
10.3 X 4 12 526 27.2 370 
11.1X2X3 3 330 63.6 462 
12.1X2X4 4 130 18.3 224 
13.1X3X4 12 586 20.3 476 
14.2X3X4 12 343 23.2 196 
15.1X2X3X4 12 197 16.9 260 
16. Total 79 2009 36.4 386 
*) « 05, 
**)« 0. 
*** p < 001. 


The transformed physical contact data do from the interaction effects found with the 
not conform to the patterns demonstrated by other two measures. Essentially, the inter- 
the other two measures. When tested with action effect is an increment in L-L dyads as 
Source 12, the effect is significant, but it differs ^ compared with all other combinations. Physical 
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COLUMN SS 


AVE. FQY. VOCAL 
RESPONSES 


AVE. FQY. GESTURAL 
RESPONSES 


AVE. TRANSFORMED FQY. 
CONTACT RESPONSES 


10 " 12 13 10 " 12 13 
SESSION NUMBER 


Í ` Fic. 2. Graphic summary of the results showing 
each of the three response classes for row and column 
subjects. (The symbols, e.g., H-L, refer to response data 
of a high “speaker” assembled with a “low” listener; 
other symbols are similarly interpreted.) 


contacts in the other three compositions are 
near zero (Vx + .5, when x = 0 is .7). Note, 
however, that this statistical interaction effect 
is not stable across trials, i.e., is not significant 
when tested with Source 11. 

The similarity of findings between row and 
column subjects’ data is not surprising. The 
findings are not expected to be exactly the 
same with only a few subjects in the analyses. 
Nevertheless, even with small Ns, the two 
sets of data will appear more similar to each 
other than would two independent sets if we 
make the likely assumption that the inter- 
Personal process is such that if one person is 
high or low in vocal, gestural, or other meas- 
ured behavior, the other person is equally 
productive. If, on the contrary, there is a 
systematic inequality in the response outputs 
of the two members, and if it occurs in all 
conditions, it will tend to inflate one of the 
error terms used in the present analyses. 
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However, the inflated error term does not 
render equivocal any significant findings 
obtained from such analyses. Unequal output, 
if it occurs only in some conditions, will tend 
to produce heterogeneity of variance (although 
it is by no means the only “reason” for hetero- 
geneous variance) or contribute to statistical 
interaction of speaker X listener levels. 
Significant results from analyses under such 
conditions are not artifactual, either, since 
asymmetric output in some combinations and 
not in others may be viewed as a possible 
assembly effect in a group product “measure,” 
the difference in productivity between the two 
participants. 


Discussion 


The present study is a modified replication 
of a preliminary study (Rosenberg, 1959), in 
which similarly striking results were obtained 
for vocal data, using only two matrices of 
four children each. The present experiment 
was undertaken to determine whether or not 
the preliminary results were spuriously high. 
We speculated that these findings, based on 
the first five sessions only, might have been 


“due to routines in the hospital which fostered 


or imposed more contact between children of 
similar abilities than between children of 
different abilities, and that such differential 
contact resulted in more spontaneous social 
interaction among the homogeneous children. 
There is no satisfactory way of actually 
measuring prior contact among institu- 
tionalized children, but it seems unlikely from 
a knowledge of the institution that the almost 
complete absence of action shown by the 
subjects within the high-low pairs mirrored 
with any accuracy the frequency of contact 
imposed by institutional routines. f 

One of the important changes in experi- 
mental design, therefore, was the introduction 
of the preliminary sessions among the children 
in the new sample before systematic behavior 


measurement was undertaken. One could 


assume, of course, à process whereby children 


gradually interact exclusively with those 
whom they see more frequently in their 
institutional routines. While the preliminary 
sessions did not eliminate the possibility that 
hospital routine per se imposed such a rela- 
tively immutable social structure, it did 


408 


eliminate any effects which might operate 
only during initial contacts between previously 
unacquainted children. 

If it had been possible to assure no prior 
contact among all subjects, the observation 
and analysis of all sessions would have been 
extremely interesting. However, interpreta- 
tions of initial temporal effects would be more 
or less equivocal with the subject population 
of this study. If the interaction patterns 
stabilized rather early, as the preliminary study 
seemed to indicate, it would not necessarily 
follow that strangers discriminate each others’ 
skills quickly and interact accordingly. Even 
relatively slow stabilization of interaction 
pattern (say, after 5-10 sessions) may not be 
representative of temporal effects between 
children who are known to be complete 
strangers. Research with children who are 
initially unacquainted would throw light on 
the temporal development of the interaction 
patterns as well as on the possibility of 
immutable institutional structuring as respon- 
sible for the effects obtained here. 

Even in dyads for which absence of prior 
contact is assured, the behavior of the subjects 
may be a generalization based upon greater 
familiarity with relatively homogeneous 
children. That is, if the stranger is similar in 
various psychological or physical dimensions 
to the child’s frequent contacts, high inter- 
action will be expected to occur. For non- 
institutionalized subjects, cultural segregation 
correlated with individual differences in skill 
may exist in the school, the neighborhood, the 
family, etc, In the case of the present insti- 
tional study, rapid generalization to a strange 
child may be based, in part, upon the physical 
appearance of the low children, That is, 
physical deformities are more likely to exist in 
an L group than in the H group of children. 

Of the 10 L children in this study, 7 were 
judged as having a physical deformity. None 
of the 10 H children were so judged. Two 
research staff members made independent 
judgments of physical appearance and rated 

4 Ls as mongoloids, 2 Ls as cerebral palsied 
(only one judge), and 1 L as having extreme 
protruding lips and a small, pointed cranium. 
Nevertheless, inspection of the data indicated 
that H children emitted as many or more 
vocalizations and gestures when assembled 
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with the 7 deformed Ls as with the remaining 
3 Ls. Deformed Ls were likewise indistinguish- 
able from the other Ls in the amount of 
behavior they emitted in the presence of Hs. 
Generalization based upon obvious physical 
deformities may, of course, be most pronounced 
in initial sessions. For the present, we have 
satisfied ourselves that the laboratory effect 
was stable and not due to absolute absence of 
contact between heterogeneous subjects. The 
results are important whether interpreted as 
an interpersonal process likely to occur spon- 
taneously or as a process initially imposed by 
external social structuring. 

The results of the present study are com- 
patible with the commonly held notion that 
homogeneity of certain individual psycho- 
logical properties plays an important role in 
interpersonal attraction. However, most 
systematic work relating individual char- 
acteristics to interpersonal choice and inter- 
action has been performed with normal adults. 
Newcomb (1956), for example, has examined 
the relationships between attitudinal agree- 
ment and interpersonal attraction among 
college students and has suggested that these 
relationships are close and important. The 
contrasting hypothesis of complementarity of 
psychological characteristics is exemplified by 
Winch’s (1958) work on mate selection. 
Actually, these two general notions are not 
necessarily in conflict. Unfortunately, the 
research on assembly is neither sufficiently 
large nor consistent to provide a systematic 
account of the ways in which individual 
measures may be combined in order to predict 
interpersonal behavior. In any event, the 
population types, the individual measures, 
and the interactional contexts for most 
theoretical discussions in this area are quite 
different from those of the present study. 
The present study is not, therefore, à “test” 
for any theory, nor are the results anything 
more than compatible with the gross similarity 
hypothesis. Also, extensive psychological 
interpretations concerning the interpersonal 
dynamics which may mediate homogeneity 10 
psychometric scores, while possible, are 
actually almost wholly speculative at the 
present time. To the extent that these inter- 
pretations of dynamics are testable, they 
require considerably more empirical research. 
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INTERACTION AMONG RETARDED CHILDREN 


The direction of these empirical efforts may be 
suggested not only by theoretical interpreta- 
tions, but also by the experimental results 
themselves. 

The present experiment certainly suggests 
additional and more detailed experimental 
analysis, employing the same general social 
psychological methodology. First, the details 
of the interpersonal behavior are not well 
known. For example, the distinction between 
intelligible—i.e., culturally normative—and 
unintelligible vocal behavior must be made. 
The preliminary study (Rosenberg, 1959) 
attempted to make this distinction by in- 
structing raters to judge the “intelligibility” 
of each vocal response. The problems of 
reliable measurement and removal of obvious 
rating artifacts were insoluble. It appears to 
be necessary, or at least desirable, to devote 
Some research efforts to this measurement 
problem per se. The lumping of all vocalization 
into one category does not detract from the 
fact that vocal behavior, as defined for the 
observers in this study, is social in form. 
Whether it is culturally negotiable or not, the 
“low” children maintain a high rate of response 
in the presence of a social object who also 


. maintains a high rate of similar behavior. 


They are somehow able to discriminate quite 
accurately between a peer who is equally low 
from one who is not. 

Second, an expanded range of talent and 
more than two levels of linguistic skill would 
be extremely important in spelling out the na- 
ture of the statistical interaction. 

Third, the previous and present studies both 

limited assemblies to members of the same sex. 
Generalizations are obviously limited to such 
assemblies. The institutional routine does 
decrease the frequency of contacts between the 
Sexes. However, it would be surprising if any 
Polarization effects would hold in the case of 
heterosexual assemblies, particularly among 
older children. A variety of sexual advances, 
tather than mutual isolation, would certainly 
be expected from older children who are 
necessarily quartered according to sex. Hetero- 
Sexual assemblies could change considerably 
the Statistical interaction effects discovered 
within the present design. 

Fourth, the free play situation is only one 
of the many types of situations in which inter- 
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action may be observed. Other, more struc- 
tured tasks may be developed for analysis of 
interaction among these children. 

The present results, evaluated against the 
possible cultural processes of segregation which 
perpetuate themselves, imply a number of 
things about the learning situations which 
may obtain for children. These processes may 
set up learning conditions for children which 
are not necessarily optimal with respect to 
their learning potential. Or these social condi- 
tions may be optimal for some children but 
not for others. It is conceivable that institu- 
tional routines may be geared, in part, to take 
advantage of these social processes. The 
composition of children during play periods, 
eating periods, in wards or cottages, and in 
classrooms, are potentially manipulatable. Con- 
siderably more research is necessary, however, 
before the implications of the social process 
for learning become useful. 

Finally, the present methodology would 
seem to offer some interesting possibilities for 
other types of psychopathological process. 
Pairs composed of adults with demonstrated 
pathology of various kinds could be observed 
over an extensive period of time. Pairs com- 
posed of a normal and a pathological person 
may be instituted in some settings also. The 
subtle processes of psychotherapy may be 
viewed as a dyadic situation in which assembly 
effects may play a prominent role. 


SUMMARY 


The present study was concerned with the 
analysis of interpersonal behavior of pairs of 
children over a series of free play situations 
as a function of the linguistic skills of the 
individual participants. Children between the 
ages of 12-15, residents of an institution 
because of retardation and associated psycho- 
pathology, served as subjects. Subjects were 
selected on the basis of preliminary psycho- 
metric tests of linguistic skills. Two extreme 
groups, “high” and “low” were obtained using 
these test scores. Subgroups of four subjects, 
containing two high and two low subjects were 
then identified. Within each subgroup dyads 
composed of combinations of high and low 
skill levels were formed and scheduled for a 
series of play sessions in a laboratory playroom. 
The design permitted explicit analyses of 
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differences in social behavior among various 
dyads which were attributable to the skill 
levels of each subject per se as well as to the 
combination of skill levels in the dyad. 

Three responses were measured during play 
by observations through a one-way mirror. 
Responses of each subject in the dyad were 
separately recorded and consisted of vocal 
behavior, gestural behavior, and physical 
contact with the other subject. Results with 
the vocal and gestural responses indicated 
that the skill level of the subject per se is not 
significant in predicting his productivity. 
However, the combination of skill levels in 
the dyad is highly significant in the prediction 
of interaction behavior. The vocal and gestural 
productivity of a low subject in play with 
another low is as large as that of a high subject 
with another high subject. In heterogeneous 
assemblies, however, productivity drops almost 
to zero. Results with the physical contact 
measure are not as clear-cut. The combination 
of a low with a low yields the largest frequency 
of response while the other combinations ap- 
pear to be equally small. 

The results were discussed in terms of social 
processes which may be associated with 
behavioral retardation and with the implica- 
tions for learning as a result of these processes. 
Additional research, not only with retardation, 
but with other forms of psychopathology were 
suggested, 
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ACCEPTANCE OF PUNISHMENT AND CHANGE IN BELIEF! 
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tive dissonance, discusses the effects of 

“forced compliance" on dissonance and 
change in belief (pp. 84-122). A person, asked 
to perform an act that he finds disagreeable, 
may, in some instances, face the alternative of 
losing a reward that is offered for performing 
the act. In other instances, the alternative may 
be accepting a punishment that is threatened 
unless the act is performed. Compliance occurs 
if the negative valence of the punishment or the 
positive valence of the reward is greater than 
the negative valence of the act itself. The more 
similar these valences are, the greater the con- 
flict before decision. Following compliance, 
the individual tends to resolve the incompati- 
bility between his behavior and the negative 
valence of the act that he has performed. 
Similarly, following noncompliance, he must 
resolve the dissonance between his behavior 
and his acceptance of punishment or his giving 
up a reward. The dissonance may be reduced 
by cognitive change. For the noncompliers, one 
means of reducing it would be to increase the 
negative valence of the task, while for the com- 
pliers, dissonance would be reduced if the task 
Were seen more favorably, i.e., if the negative 
valence were reduced. 
, The tendency to bring one's private beliefs 
into line with one's verbal statements has re- 
ceived considerable support (Culbertson, 
1957; Janis & King, 1954; King & Janis, 1956; 
Raven, 1959; Sarbin & Jones, 1955; Scott, 
1957; Sims & Patrick, 1936). Further, a great 
deal of support for the theory with respect to 
compliers has also been reported. Festinger 
(1957) has interpreted data from a study by 
Kelman (1953) to support such a hypothesis, 


! This study was supported by the Group Psychology 
Branch, Office of Naval Research, Contract Nonr 
233(54), and is reported more fully in Technical Report 
No. 3 under that contract. A preliminary report of this 
one was presented at the Annual Meeting of the 
iene Psychological Association, Chicago, Septem- 
. Weare indebted to Lon R. Davis for his assistance 
if Constructing the equipment which was utilized in 

us investigation, and to Leonard Vosen and Stephen 
indman who conducted the experiment. 


Fes (1957), in his theory of cogni- 


but with the usual trepidation that accom- 
panies post hoc interpretation of experimental 
data. Cohen, Brehm, and Fleming (1958) re- 
port that after students were asked to make 
statements contrary to their opinions, they 
tended to change their opinions so as to make 
their opinions consonant with their state- 
ments—especially when no adequate justifica- 
tion was given for compliance. Festinger and 
Carlsmith (1959) were able to secure clear-cut 
support for dissonance reduction by compliers 
in an experiment in which subjects, after en- 
gaging in an unusually boring task, were asked 
to tell succeeding “subjects” that the task had 
in fact been quite pleasant. When they were 
paid $1.00 to do this, dissonance after the act 
was expected to be quite great, but with a 
greater reward of $20.00, dissonance would be 
less since the $20.00 would provide justifica- 
tion for saying something that seemed con- 
trary to fact. In a separate questionnaire, 
that presumably was not to be seen by the 
original investigators, subjects were asked to 
indicate how pleasant or enjoyable the task 
was. As predicted, the high dissonance sub- 
jects ($1.00) were especially likely to rate the 
task as more enjoyable, thus bringing their 
private evaluation of the task into line with 
their statements to the presumed “subject.” 
Other support for dissonance reduction in 
compliers may be found in Brehm (1957 un- 
published, 1959, 1960 unpublished) and Brehm 
and Cohen (1959). 

Dissonance reduction among noncompliers 
has received less attention, though here, again, 
Festinger’s interpretation of Kelman’s (1953) 
data is supportive. It is to this derivation from 
Festinger’s (1957) theory that our experiment 
is directed. Specifically, we wished to present 
individuals with a situation in which they 
would be asked to indicate whether they had 
received an ESP image. Some subjects who did 
not receive the image would receive punish- 
ment in the form of an electric shock. Others 
would not receive punishment for failure to see 
an ESP image. Those who received punish- 
ment should experience dissonance between 
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the knowledge that they had not indicated the 
reception of an image and the knowledge that 
they had thus accepted a quite painful electric 
shock (which could have been avoided had they 
reported reception). They should therefore be 
especially likely to show cognitive change so 
as to reduce dissonance. Specifically, we pre- 
dicted that these subjects would be especially 
likely to reduce their belief in the existence of 
ESP; if they were convinced that ESP did not 
exist and that reception was impossible, then 
a failure to report reception would appear 
justifiable even if this led to a painful shock. 


METHOD 
Measure of Belief 


Belief in extrasensory perception was measured on 
the AB scales (Fishbein & Raven, 1959). Suggested by 
Osgood's semantic differential (Osgood, Suci, & 
Tannenbaum, 1957), this instrument consists of 20 
polar adjectives, 5 of which (the B scale) specifically 
measure belief in the existence of an object (impossible- 
possible, false-true, nonexistent-existent, improbable- 
probable, unlikely-likely). Since each polar pair had a 
seven-point scale, a rated concept could give a belief 
score ranging from 5 (complete disbelief in existence of 
the concept) to 35 (complete belief in its existence). 
All students in a section of introductory psychology 
at UCLA were asked to rate ESP and two other con- 
cepts in class. These ESP scores were taken as estimates 
of belief at the time of the experimental session and 
were used to attempt to match groups in mean initial 
belief score. The belief measure was then given both at 
the beginning and at the end of the experimental 
session. 


Subjects 


Data reported in this paper are based on 52 under- 
graduate subjects, 13 males and 13 females in each of 
two experimental conditions. It was decided before the 
experiment that only data from subjects who had 
initial belief scores within a range from 10 to 30 would 
be considered in testing our hypotheses, since we 
wished to allow for some possibility of change in belief 
in both the positive and negative directions. Seventeen 
subjects were excluded from further consideration on 
this basis. Since we wished to test only the behavior of 
those subjects who did not indicate reception of ESP 
in the experimental situation, an additional 12 subjects 
who indicated that they had received an ESP image 
were not used in testing our hypotheses. The eliminated 
subjects did not come differentially from one experi- 
mental condition as compared to another? 


2 In addition to these, 11 other subjects were elimi- 
nated from consideration in our data: 5 for failure to 
complete the semantic differential measure, 4 because 
of apparatus failure, 1 for knowing the experimenter, 
and 1 for being suspicious of the experiment. "These 
subjects were not disproportionately distributed in the 
experimental conditions. 
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Procedure 


Premeasure of belief. When the subject appeared to 
take part in the experiment, he was met by a male 
interviewer. Care was taken that the interviewer did 
not know the experimental condition to which the 
subject would be assigned. Initially, the subject was 
asked to rate the concept “‘extrasensory perception” on 
the AB scales. It was explained to him that “extra- 
sensory perception" involved the "transmission of in- 
formation between people through means other than 
the usual sense organs" and that it was sometimes 
called “mental telepathy.” Following this, the subject 
rated the concept “racial prejudice” on the same scales. 
The purpose of the latter measure was to decrease the 
possibility of subjects’ remembering their precise 
responses to ESP. The subject was then taken into an 
adjacent room, for the major part of the experiment. 

Experimental situation. In the experimental room, 
the subject was greeted by a second male experimenter 
and seated at a table which was perpendicular to the 
experimenter’s desk, and facing the door. On the door, 
directly in front of the subject and 6 feet away, a 
20” X 30” white cardboard was mounted on a 30” X 
40” black background. On the subject’s table were 
two telegraph keys mounted on a plywood board, with 
the words Yes and No printed under the left and right 
keys, respectively. On the subject’s left were two copper 
electrodes attached to rubber wrist bands. On the 
experimenter’s desk, there was a plywood board on 
which was mounted a telegraph key, a transformer 
with a dial to regulate amount of current, a buzzer, 
a 20-pen Esterline-Angus multiple-event recorder, two 
6-volt dry cells, and a collection of relays, transformers, 
timers, and toggle switches. 

After the subject was seated, the electrodes were 
fastened to his left hand, one across the fleshy part and 
the other on the wrist. It was explained that his shock 
level would be measured later. Subjects were given 
the opportunity to withdraw from the experiment if 
they wished to avoid shock. Only one subject withdrew 
for this reason. It was explained that the experiment 
was to be a test of extrasensory perception. The experi- 
menter emphasized that although he did not believe 
nor disbelieve in ESP, it was necessary to test its 
possible existence in as many situations as possible. A 
man claiming to be a “sender” would attempt to trans- 
mit the image of the word contemporary to the subject, 
and a white card with the word on it in red was shown 
to the subject. (The letters had been traced from the 
cover of Contemporary Psychology.) Periodically, 8 
buzzer would sound simultaneously in both rooms, 
during which time the "sender" would concentrate on 
the word on the card, while the subject would concen- 
trate on the white card in front of him and attempt to 
receive the image. 

When the subject felt that he received an image of 
the word, he was to push the telegraph key marked 
Yes, as rapidly after the buzzer had stopped as pos- 


3 We selected the word Contemporary for transmis- 
sion because its complexity rendered it particularly 
difficult to image. In pretesting, a large number of 
subjects reported “receiving” a red circle, a red star, 
and several other simple objects even when there was 
No social pressure or threat of punishment. 
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sible. Similarly, he was to push the key marked No 
when he did not feel that he had received an image. 

Tt was stressed to the subject that he should only 
push the Yes key “if you are certain that the impres- 
sion that you see is actually being sent to you by the 
person in the other room. If you get no impression, or 
if you feel that the impression you see is merely an 
image you are projecting, without the help of the 
sender, then don't push the key....” 

The subject then went through a series of practice 
trials, during which the experimenter said “Yes” or 
“No” while the buzzer was sounding. This was to give 
the subject practice in waiting until the buzzer had 
stopped and then responding as soon as possible. 

The next step involved getting the maximum shock 
level which the subject could endure. By now, the 
amount of perspiration on the electrode connections had 
presumably stabilized, and the subject was given in- 
creasing amounts of shock and encouraged to accept 
more until he felt that he could not endure any addi- 
tional increase, 

The major ESP test was now to begin. The instruc- 
tions were reviewed and the subject told to respond 
“Yes” for “receive” and “No” for “not receive" as 
rapidly as possible after the buzzer had stopped. He 
mee not told exactly how many trials would be given 

um, 

The “sender,” who was presumably in an adjacent 
room, was given a signal to begin his transmissions. 
Each trial took 16 seconds: the buzzer automatically 
sounded for 6 seconds, followed by 10 seconds of silence 
before the next trial. The subject pushed a Yes or No 
key as soon as possible after the cessation of the buzzer. 
An Esterline-Angus multiple-event recorder recorded 
the buzzer signal and also the response, which gave a 
measure of latency of response. All subjects went 
through a series of 12 trials. 

y Punishment variation. There were two major varia- 
tions in the experiment: one with high punishment for 
failure to report ESP images—the Shock condition; 
and one with no punishment—the No Shock condition. 
Since the experimenter was not aware of the premeasure 
of belief and since the interviewer was not to know the 
experimental condition into which the subject would be 
assigned, the belief scores which the subject had given 
in class before appearing at the experiment were used 
to assign them to either the Shock or No Shock condi- 
tion. These assignments were made so as to make the 
premeasures of belief approximately similar. Un- 
fortunately, as we shall see later, the classroom scores 
did not prove to be as good an estimate of the prebelief 
scores as we would have wished. 

The punishment variation was introduced just 
before the trials began. Subjects in the No Shock 
condition were told that there would be no further 
shock and the electrodes were removed. Subjects in the 
Shock condition were told that they would receive a 
shock after every trial for which they pushed the No 
key. They were told: 

One condition under which I am interested in 
Studying ESP is that of shock. I am not sure if 
shock helps, hinders, or changes ESP, but I hope to 
find out. Whenever you push the “No” key you 
will receive an electric shock, which is at the highest 
level that you experienced earlier. When you push 
the “Yes” key you will not receive any shock. 
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However, do not push the “Yes” key just to avoid 
the shock, but only when you are certain that you 
are receiving an impression from the sender. 

In the Shock condition, a 1-second shock, at the 
highest level that the subject had said he could endure, 
was automatically administered immediately after each 
time that the No key was pushed. As mentioned earlier, 
these 12 subjects who pushed the Yes key at any time 
were not considered further in this experiment since 
we were interested only in subjects who did not feel 
that they had received ESP images and who responded 
No even if they did expect to receive punishment for it. 
Seven of these subjects were in the No Shock condition 
and five in the Shock condition. 

Postmeasure of belief. All subjects then went back 
to the room where they had their initial interview and 
were once again asked to rate “‘extrasensory perception" 
on the AB scale. They also answered a number of other 
questions intended to check on the effectiveness of the 
experimental variables and to gain additional informa- 
tion to help interpret the results. 

At the completion of the postmeasure of belief, 
subjects were asked additional open-ended questions to 
determine whether they were suspicious of the experi- 
mental situation, and were allowed to make comments 
and criticisms of the experiment. Subjects were then 
informed that there was no ESP "sender" and the true 
purposes of the experiment were explained. Comments 
and explanations were invited, after which subjects 
were pledged to secrecy and allowed to leave. 


RESULTS 
Effectiveness of Punishment Variation 


In order to ascertain whether or not the 
shock was in fact painful and did produce con- 
flict, several measures were used. 

A questionnaire item asked: “How much 
discomfort or pain did you feel just before or 
just after you pressed the No button?” The 
subject could check on a 13-point scale ranging 
from 1 (no discomfort) to 13 (unbearable). The 
males and females in the Shock condition gave 
ratings of 7.5 and 8.1, respectively. The cor- 
responding scores for the No Shock conditions 
were 1.2 and 2.2. These differences between 
conditions were highly significant (p < .001; 
z = 5.75; Mann-Whitney U test), with prac- 
tically no overlap of scores for Shock and No 
Shock conditions. There were no significant 
differences between males and females. 

A further indication of the effect of shock 
might come from a question that asked: 
“Given a situation such as the one which you 
have just experienced, what percentage of 
students such as yourself do you think might 
say that they kad received an impression when 
in fact they had not, or say they had not re- 
ceived an impression when in fact they kad?” 
Subjects could check a scale ranging from 0 to 
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100%. Subjects who indicated a high per- 
centage were assumed to be likely to have 
themselves considered the possibility of in- 
dicating a Yes response in order to avoid shock. 
The males and females in the Shock condition 
showed mean percentage estimates of 26.4% 
and 26.8%, respectively. For the No Shock 
condition, the corresponding percentages were 
12.7% for males and 15.4% for females. Again 
the differences between conditions were sig- 
nificant (p < .02; z = 2.47; Mann-Whitney 
U test), and the differences between sexes non- 


significant, 
Punishment and Latency of Response 


According to theory, the degree of dis- 
sonance after decision to accept punishment or 
to indicate reception of ESP is a function of 
the conflict before the decision. Conflict could 
be estimated from latency of responses, A 
measure of response latency was recorded by 
the multiple-event recorder—the length of 
time between the cessation of the buzzer sig- 
nals and the Yes or No response by the sub- 
ject. We would expect that the greater the 
latency, the greater the conflict before decision, 
and consequently, the greater the dissonance 
after decision. If our experimental conditions 
were appropriate, we would then expect greater 
latency in the Shock conditions than in the No 
Shock conditions, As can be seen in Table 1 
such differences in latency seemed to occur 
only for females. In the early trials, especially, 
the females in the Shock condition show con- 
siderably greater latency in response than do 
the males in the Shock condition or males and 
females in the No Shock condition. The shock 
does not seem to make much of a difference for 
males, insofar as latency of response is con- 
cerned. We find significant effects of the shock 
variation, sex differences, and a significant 
interaction effect (5 = 05; analysis of vari- 
ance). The mean latency of the females in the 
shock condition (.85 seconds) was also sig- 
nificantly greater than that of subjects in the 
other three conditions combined (.32 seconds; 
$ < 01). 

One might speculate as to why the males 
showed no increase in latency of response in 
the shock condition, particularly since they 
did indicate that they found the shock as pain- 
ful, on the rating scale, as did the females in 


TABLE 1 
Mean CORRECTED LATENCY or RESPONSE 
IN SECONDS 
Shock No Shock 
Female .85 :91 
Male -36 +28 


Note.—Mean of three lowest latencies were subtracted from 
mean of three highest latencies for each subject, thus correcting 
for individual differences, In this table, as in others, there are 13 
males and 13 females in each experimental condition. Effects of 
difference of sex, condition, and interaction significant at less than 
.05 level by analysis of variance. Females in Shock condition are 
significantly different from all other conditions (p < .01). 


the Shock condition. One reasonable possibility 
is that the males saw the threat of shock as a 
test of their masculinity, and perhaps gained 
some intrinsic reward from rapidly responding 
No. Some anecdotal reports at the end of the 
experiment support this view. Steiner (1960) 
and Tuddenham (1959) have found greater 
conformity tendency in females, and an asso- 
ciation of independence with masculinity. In 
terms of the amount of conflict produced by 
the shock, it appears in any case that the males 
in the Shock condition were more similar to 
males and females in the No Shock condition 
than to the females in the Shock condition. 


Pretest Belief Measure 


Before testing whether differences in con- 
flict before decision, and acceptance of punish- 
ment, were related to change in belief after de- 
cision, it was first necessary to establish that 
there were no significant differences in pre- 
test scores. Since classroom test scores were 
used to assign subjects to groups, we might 
have expected no differences in the scores 
taken at the beginning of the experimental ses- 
sion. Unfortunately, our assumption here was 
incorrect (see Table 2). Subjects in the Shock 
condition unaccountably showed higher pre- 
test belief scores than those in the No Shock 
condition. This lack of comparability in the 
pretest complicates the testing of our major 
hypothesis, 


Change in Belief as a Function of Dissonance 


The basic hypothesis predicted that when 
the subject received shock for reporting that he 
did not receive ESP, he would be especially 
likely to change his belief in the direction of 
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TABLE 2 
PRETEST BELIEF SCORES 
Shock No Shock 
Female 25.3 20.7 
Male 24.6 22.7 


Note.—Scores could range from 5 (no belief in existence of 
ESP) to 35 (strong belief in existence of ESP) with a score of 20 
being the midpoint. Difference between conditions was significant 
at less than the .05 level by analysis of variance (F = 4.828; df = 
1/48). Sex differences and interaction were not significant. 


TABLE 3 


Mean REDUCTION IN BELIEF IN ESP 
FROM PRETEST TO POSTTEST 


Shock No Shock 
Female 7.15 3.71 
Male 3.54 3.68 


Note.—Belief scores could vary from 5 (disbelief) to 35 (strong 
belief), on the AB scales, Over-all analysis of variance and inter- 
action was not significant. Assuming that male shock subjects 
were part of a common population with the nonshock subjects, 
with respect to dissonance, an analysis of variance was conducted 
which showed the female shock subjects to be significantly different 
from the others (F = 5.39; df = 1/50; p < .05). 


rejecting the existence of ESP. According to 
our original design, this was to be tested by an 
analysis of variance, comparing Shock and No 
Shock conditions on the amount of change in 
belief from pretest to posttest. However, since 
the male Shock subjects did not indicate con- 
flict as a function of the shock, we should not 
have been surprised that the over-all analysis 
of variance did not indicate a significant differ- 
ence (see Table 3). 

The similarity in latency scores of the three 
groups other than the female Shock condition 
(Table 1) would give us some justification in 
combining these three groups in testing our 
dissonance theory hypothesis. It was clear, of 
course, that such an analysis would have to 
be evaluated in light of the fact that it was 
conducted on the basis of data examined after 
the experiment had been completed. In addi- 
tion, it was important to consider the differ- 
ences in pretest scores. Thus we retested our 
hypothesis, using an analysis of covariance, 
which in effect controlled for the differences in 
Pretest scores. This analysis (see Table 4) did 
Provide support for our major hypothesis. 
Controlling for initial belief in ESP, the female 
Sübjects who continued to report *no ESP," 
ànd who experienced conflict in accepting such 


TABLE 4 


ANALYSIS OF COVARIANCE OF CHANGE CONSIDERING 
Pretest BELIEF SCORES 


Female All 
shock others 
(N = 13) (N = 39) 


Mean pretest belief 25.3 2237 
Mean amount of negative change Mae, 3.9 


Note.—Analysis of variance on pretest scores significant at 
less than .01 level (F = 64.3; df = 1/50); analysis of variance on 
change significant at less than .05 level (F = 5.39; df = 1/50); 
analysis of covariance for change significant at less than .02 level 
(F = 6.46; df = 1/49), 


TABLE 5 
MEAN ArtirupE SCORES 
Pretest Posttest Change 
Female, Shock 21.5 19.8 —1.7 
Male, Shock 23.5 22.8 —0.7 
Female, No shock 22.8 20.8 —2.0 
Male, No shock 23.8 21.7 —24 


Note.—Scores could range from 5 (indicating great dislike 
for ESP) to 35 (great liking for ESP). Analysis of variance on 
pretest scores and change scores yielded no significant differences, 


punishment, were significantly more likely to 
change their belief in direction of rejecting 
ESP. This effectively reduced dissonance, 
justifying their acceptance of the punishment 
by convincing themselves even more firmly 
that ESP did not exist. 


SuMMARY AND DISCUSSION 


In summary, we predicted from Festinger's 
(1957) theory of cognitive dissonance that 
subjects who consistently denied receiving 
ESP images would be especially likely to re- 
duce their belief in ESP if their denial of re- 
ception led to shock. Such a reduction in be- 
lief would reduce dissonance between the 
knowledge that they had reported "no recep- 
tion" and the knowledge that they had thus, 
in effect, accepted punishment. The prediction 
was confirmed for female subjects. Our results 
were also consistent for male subjects, if we 
allow for the evidence that males also seemed 
to find the shock less dissonance producing. 

These findings are especially significant 
since they would seem contrary to what we 
might expect from "common sense" or from a 
reinforcement theory. From the latter, we 
might have expected that the shock would 
serve as a negative reinforcer for the response 
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of pushing the “no reception” key, and that 
this would generalize to a “no ESP” response 
on the belief questionnaire. Specifically, we 
would expect that shocking a “no ESP” re- 
sponse would then lead the person to indicate 
that he believes even more strongly in ESP, 
i.e., making a “yes ESP” response on the ques- 
tionnaire, Yet the opposite was obtained in 
this experiment. Is it then possible that our re- 
sults could be attributed to some generalized 
change in attitude? Perhaps the series of shocks 
increased the negativeness of the entire ex- 
perimental situation, and served as a negative 
reinforcer for any ESP response. Then on a 
questionnaire any response indicating exist- 
ence of ESP or liking for ESP would be 
avoided. Fortunately, our measures allow us 
to ascertain change in attitudes toward ESP 
directly, by using the evaluative dimension 
(A scale) of the AB scales. These scores ranging 
from 5 (indicating great dislike) to 35 (great 
liking for ESP) show no significant change in 
any of the experimental conditions (see Table 
5). This explanation can also, therefore, be re- 
jected. 

Our hypothesis and findings are, however, 
quite consonant with Festinger’s ( 1957) analy- 
sis of data from the study by Kelman (1953), 
and also complement the study by Festinger 
and Carlsmith (1959), Festinger and Carlsmith 
found that if reward was sufficient to bring 
about compliance, the greater the reward, the 
less the dissonance, and the less beliefs would 
change so as to become consistent with be- 
havior. They would predict the same effect if 
punishment rather than reward had been used 
to bring about complicance. We found that 
when punishment was insufficient to bring 
about compliance, the greater the punish- 
ment, the greater the dissonance, and the more 
beliefs would change so as to become consonant 
with this noncompliance. We would predict the 
same effect if reward had been used rather than 
punishment. Clearly a study is now needed to 
investigate the effects of reward or punish- 
ment at all four levels: very low, moderate and 
just insufficient to bring about compliance, 
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moderate and just sufficient to bring about 
compliance, and very high. 
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ease with which CRs are acquired and 

retained is dependent upon the person- 
ality factor of introversion-extraversion. 
Franks (1956) has examined this hypothesis 
by comparing eyeblink conditioning of clini- 
cally diagnosed hysterics with that of “dys- 
thymics” (i.e., patients suffering from anxiety, 
reactive depressions, and obsessions or com- 
pulsions). These two groups are identified as 
“neurotic extraverts" and “neurotic intro- 
verts,” respectively, in Eysenck’s personality 
schema. The hysterics were found to acquire 
the conditioned eyeblink more slowly than the 
dysthymics, who conditioned quickly. No 
difference in conditioning was found between 
anormal group and a neurotic group contain- 
ing hysterics and dysthymics. Similar results 
also were obtained in another study (Franks, 
1957) which examined the conditioned eye- 
blink of paid normal subjects in relation to 
personality tests of extraversion and neu- 
roticism. More extraversive subjects, defined 
in terms of higher test scores on the Maudsley 
Personality Inventory (MPI) acquired the 
CR more slowly and extinguished this response 
more quickly than did less extraversive (i.e., 
more introversive) subjects, but no relation 
between conditioning and meuroticism test 
Scores was observed. 

The conditioned galvanic skin reflex (GSR) 
was recorded simultaneously with conditioned 
eyeblink in Franks’ (1956) study. GSR con- 
ditioning was developed more slowly in hys- 
terics than in dysthymics, but the differences 
between the two groups were slight, and ex- 
tinction could not be examined because GSR 
adaptation occurred too rapidly under Franks’ 
testing procedures. He suggested that the 
difference between the groups in GSR con- 
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ditioning might be reduced because of the low 
intensity airpuff which was employed as the 
UCS. He noted that this weak UCS sometimes 
failed to elicit the unconditioned GSR after 
the first few trials. Vogel (1960b), using a bell 
as the UCS, examined GSR conditioning of 
alcoholics in relation to the personality vari- 
ables of extraversion and neuroticism. Al- 
though acquisition and extinction of the con- 
titioned GSR was found to relate as predicted 
to extraversion scores on the MPI, the rate 
of CR acquisition was observed also to 
correlate with neuroticism scores. Vogel’s 
(1960b) observations appear to contradict 
Eysenck’s theory, which holds extraversion 
and neuroticism to be independent with ex- 
traversion as the variable relevant to condi- 
tioning behavior. 

Since there has been no research specifically 
designed to compare alcoholics and nonalco- 
holics in GSR conditioning, the correlations 
reported by Vogel (1960b) between condition- 
ing and personality may apply only to alco- 
holics. This possibility was investigated in the 
present study. GSR conditioning was con- 
ducted with alcoholics and with nonalcoholic 
“normals” (i.e., subjects not hospitalized for 
alcoholism or other psychiatric disorder). 
Because these two groups could be expected to 
differ in neuroticism, and because the author’s 
previous study found alcoholics’ neuroticism 
scores correlated with speed of acquisition, 
covariance analyses were employed to permit 
an examination of GSR conditioning in rela- 
tion to extraversion, statistically free from 
possible influence by differing neuroticism 
scores. The experimental hypothesis, in accord 
with Eysenck’s theory, was formulated as fol- 
lows: 

1. When differences in neuroticism are con- 
trolled in alcoholic and nonalcoholic groups, 
the conditioned GSR is elicited in fewer trials 
and is more resistant to extinction in intro- 
versive than in extraversive subjects. 

In addition to the finding that, among alco- 
holics, extraversion was associated with slower 
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acquisition of the conditioned GSR, Vogel 
(1960b) also noted that some alcoholics failed 
to display the conditioned response within the 
maximum number of trials permitted in her 
procedure. On the assumption that slower 
conditioning is related solely to the extra- 
version variable, the following hypothesis was 
set forth: 

2. Alcoholic and nonalcoholic individuals 
with extraversive test scores more frequently 
fail to display a conditioned GSR than those 
with introversive scores. 


METHOD 
Subjects 


The sample of alcoholics consisted of 48 males, all 
of whom were admitted to the Alcoholism Research 
Foundation Clinic during the time the study was being 
conducted. All male patients in the clinic were tested, 
provided they could read English sufficiently to com- 
plete a questionnaire. Conditioning data could not be 
obtained for eight subjects who failed to condition 
within the maximum number of trials permitted in the 
conditioning procedure. These subjects were considered 
separately for investigation of the second experimental 
hypothesis, 

The control group contained 41 adult male volun- 
teers who were not hospitalized for alcoholism. The 
majority of these subjects were first and second year 
University of Toronto summer school students. Volun- 
teers were obtained also from the Ontario College of 
Education summer school course and from professional 
and technical personnel who visited the building where 
the experiment was being conducted, One subject in 
this group failed to display conditioning in the maxi- 
mum number of acquisition trials allowed and was 
used only in the testing of the second hypothesis. 


Procedure 


All subjects completed the MPI (Eysenck, 1956), 
and the alcoholics also reported on certain drinking 
behavior. The latter data were obtained for another 
study reported elsewhere (Vogel, 1961). 

The short term of hospitalization and the small 
patient population of the clinic made selection of only 
extremely high and low extraversion (E) score subjects 
highly impractical. For this reason, all available subjects 
were employed in this study. E scores on the MPI 
from both university students and alcoholic patients 
at the Alcoholism Research Foundation Clinic had been 
found previously (Vogel, 19602) not to differ signifi- 
cantly from the mean score of 24.6 reported by Eysenck 
(1956) for normal males, The score of 24.6 was selected, 
therefore, as a measure of central tendency in E scores 
appropriate to samples of both alcoholics and non- 
alcoholics, Subjects were dichotomized into two cate- 
gories—those below and those above 24.6, For 
convenience, these two categories are identified, re- 
spectively, as "introversive" and "extraversive." 
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In the alcoholic sample of 40 cases, the mean E 
Score was 20.98 (S.D — 9.42) and the mean neuroticism 
(N) score was 35.85 (SD — 11.19). Twenty subjects 
had E scores less than 24.6, and the remaining alco- 
holics had E scores greater than 24.6. The nonalcoholic 
group was also composed of 20 subjects in each of these 
two categories. The mean E score for the 40 nonalco- 
holics was 23.77 (SD = 9.42) ; the mean N score was 
20.98 (SD — 11.59). 

The GSR conditioning procedure and measures 
have been fully described elsewhere (Vogel, 19603) 
and may be summarized briefly. Subjects were tested 
individually for conditioning in a semisoundproof 
room. The subject was told that the test was one of 
relaxation or repose and that his task was to spell the 
Syllables which were presented to him by a memory 
drum. 

The CS was a nonsense syllable, Lay, which ap- 
peared 16 times, randomly placed among 35 other low 
association value syllables (Glaze, 1928). The memory 
drum presented a different syllable every 6 seconds. 
The UCS was an unpleasantly loud doorbell buzzer 
which reliably elicited the UCR of abrupt change in 
skin conductance. The first presentation of the CS was 
followed in .5 second by the UCS. A 50% reinforcement 
Schedule was employed so that alternate presentations 
of the CS were reinforced. Skin resistance was measured 
by a Lafayette 601-A GSR amplifier, using finger 
clamp electrodes and an Esterline-Angus pen recorder 
which traced the GSR continuously. Every 6 seconds, 
immediately before presentation of a new syllable by 
the memory drum, the GSR amplifier was balanced to 
the subject's skin resistance at that moment. This 
procedure automatically centered the Esterline-Angus 
pen at 0 on the record chart and permitted an easy 
comparison of GSR responses to each syllable during 
the conditioning procedure. Accurate identification. of 
response to the CS and UCS was permitted by a side 
pen which automatically marked the chart when these 
stimuli were presented. 

The criterion of conditioning was similar to that 
employed by Welch and Kubis (1947a, 1947b), which 
required three consecutive GSR reactions to the CS 
(taj) when unaccompanied by the UCS (bell). A GSR 
reaction was defined as a change in skin resistance to 
the CS which was larger than the subject's largest 
GSR occurring to the intervening buffer syllables. ‘The 
number of reinforcements (ie., the number of times 
the bell was presented) prior to achievement of this 
criterion constituted the count of trials to acquire the 
CR. A larger score thus indicated slower conditioning. 

Vogel (1960b) noted that this conditioning pro- 
cedure could not satisfactorily be continued much 
longer than 20 minutes because of increasing boredom 
or restlessness on the part of the subject. Slight shifts 
in posture, sighs, or comments aside to the experimenter 
all altered skin conductance somewhat, and under 
such conditions a record of conditioning could not 
reliably be obtained. For this reason, a limit of 15 
minutes was placed on acquisition trials. This duration , 
permitted 24 presentations of the CS accompanied by 
UCS and 24 unreinforced presentations. The maximum 
Possible score for a subject was thus 24, and if condi- 
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tioning had not then been achieved, the subject was 
grouped with the others who “failed to condition.” 

Extinction trials commenced immediately after the 
subject displayed the CR. The UCS was no longer 
presented, and the subject continued to spell syllables 
as they were presented by the memory drum. His 
GSR responses were recorded, and the number of 
CRs to the first 10 unreinforced presentations of the 
CS were counted. A CR was defined as a GSR to the 
CS which was larger than the subject’s largest GSR 
to the intervening neutral syllables. This score in- 
dicated the number of CRs during 10 extinction trials. 
The subject thereafter continued spelling syllables 
until “extinction” occurred. “Extinction” in this case 
was defined as three consecutive presentations of the 
unreinforced CS, all of which failed to elicit a CR. In 
many cases, this had occurred by the time the 10 
standard extinction trials were completed. Two scores 
on extinction were thus obtained for each subject: the 
number of CRs displayed in 10 unreinforced trials, and 
the number of CRs prior to extinction. In both cases, 
a larger score was considered to indicate greater CR 
resistance to extinction. 


RESULTS 


To examine GSR conditioning of alcoholic 
and nonalcoholic subjects in relation to ex- 
traversion, 2 X 2 covariance analyses were per- 
formed.? This technique was employed to ad- 
just the conditioning scores for any influence 
attributed to neuroticism (N) score differ- 
ences in these two groups. 

The covariance analysis of acquisition scores 
from alcoholic and nonalcoholic subjects is 
summarized in Table 1. After adjusting scores 
for the effect attributable to neuroticism, inter- 
action effects were nonsignificant. Alcoholic 
and nonalcoholic groups do not appear to 
differ in the number of acquisition trials, but a 
Significant effect for personality is obtained 
(F = 78.58, df = 1/75, p < .01). Introversive 
subjects in alcoholic and nonalcoholic groups 
displayed the conditioned response in an 
average of 6.12 trials, whereas extraversive 
subjects averaged 13.05 trials before the CR 
was displayed. 

Table 2 summarizes the covariance analysis 
of CRs displayed in 10 extinction trials. No 
Significant interaction effect was obtained, and 
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TABLE 1 
COVARIANCE ANALYSIS OF NUMBER OF TRIALS 
TO ACQUIRE A Conpimionep GSR wits N 
Scores oF THE MPI as COVARIANT 


Source Hr diaued aes b 
Group 1 7.9 | 0.65 | >.10 
Personality 1 | 950.8 | 78.58 | <.01 
Group X Personality} 1 20.7 | 1.71] >.10 
interaction 
Within cells 75 | 12.1 
TABLE 2 


COVARIANCE ANALYSIS OF NUMBER OF CONDITIONED 
GSRs OBSERVED DURING TEN EXTINCTION 
TRIALS with N SCALE SCORES OF 
THE MPI as COVARIANT 


Source ar SES E b 
Group 1 0.7 | 0.21 | >.10 
Personality 1 | 148.1 | 43.56 | <.01 
Group X Personality} 1 6.2 | 1.80] >.10 
interaction 
Within cells 75 3.4 


TABLE 3 
CovARIANCE ANALYSIS OF NUMBER OF CONDITIONED 
GSRs OBSERVED Prior TO EXTINCTION 
WITH N SCALE Scores ON THE MPI 
AS COVARIANT 


Source ara Sante e ? 
Group 1 | 8.15] 0.55| 2.10 
Personality 1 |572.33 | 38.33 | <.01 
Group X Personality} 1 6.61 | 0.44 | >.10 
interaction 
Within cells 75 | 14.93 


no significant difference was observed between 
alcoholic and nonalcoholic subjects. The sig- 
nificant effect for personality (F — 43.56, df 
= 1/75, p < .01) indicates that introversive 
alcoholics and nonalcoholics displayed more 
conditioned responses (mean score of 5.82) 
during 10 extinction trials than did extraver- 
sive subjects in these two groups (mean score 
of 3.20). 

The covariance analysis of number of condi- 
tioned GSRs observed prior to extinction is 
summarized in Table 3. The only significant 
effect observed is that attributable to person- 
ality (F = 38.22, df = 1/75, p < .01). In- 
troversive subjects displayed an average of 


420 


8.67 CRs prior to extinction, but extraversive 
subjects only averaged 3.47. The evidence 
from the analyses presented in Tables 2 and 3 
both indicate that the conditioned GSR was 
more resistant to extinction in alcoholic and 
nonalcoholic subjects having introversive test 
scores than in those having extraversive scores, 

An examination of personality test scores ob- 
tained in the group of nine subjects who did 
not display conditioning in the maximum num- 
ber of acquisition trials allowed revealed a 
striking agreement with the experimental 
hypothesis. All subjects who “failed to condi- 
tion” had E scores greater than 24.6 and thus 
would be categorized “extraversive” in this 
study. 


Discussion 


Introversive alcoholics and nonalcoholics 
displayed a conditioned GSR which was more 
quickly elicited and more resistant to extinc- 
tion than that displayed by extraversive sub- 
jects. Since covariance analyses were per- 
formed in this study, the differences in GSR 
conditioning observed between introversive 
and extraversive subjects are statistically in- 
dependent of differing neuroticism scores, This 
finding is in accord with Franks’s (1956, 1957) 
studies of conditioned eyeblink and may be in- 
terpreted to support Eysenck’s theory which 
asserts that conditionability is dependent upon 
extraversion and is unrelated to neuroticism, 
An explanation for the seemingly contradictory 
observation (Vogel, 1960b) of an association 
between neuroticism and GSR conditioning in 
alcoholics is suggested in Eysenck’s recently 
revised MPI manual. He reports that while 
extraversion and neuroticism scales are almost 
entirely independent, a slight negative correla- 
tion is obtained, Particularly if the sample 
contains only subjects with high neuroticism 
scores. Since alcoholics are found to have high 
neuroticism scores, such a correlation could be 
expected in Vogel’s (1960b) sample and may 
explain her observed correlation between 
neuroticism and GSR conditionability, 

The prediction that alcoholic and nonalco- 
holic subjects failing to display a conditioned 
GSR tend to have personality test scores classi- 
fied as extraversive rather than introversive 
was supported by the observation that all nine 
nonconditioned subjects had extraversive 
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scores. Since eight of these were alcoholics, 
this finding could suggest that difficulty in 
GSR conditioning under the procedures of this 
study is more typical of alcoholics than of in- 
dividuals not hospitalized for alcoholism, In 
contrast to a mean extraversion score of 29,05 
obtained by the 20 extraversive alcoholics 
who displayed GSR conditioning, the alco- 
holics who did not condition had a mean score 
of 32.38. This might also suggest that failure 
to display GSR conditioning is associated with 
an extreme degree of extraversion, indicated 
by unusually high scores on the MPI. 

Some of the subjects who displayed the 
conditioning pattern verbalized no awareness 
of the systematic presentation of the bell with 
the CS, in many instances identifying the bell 
as a ringing telephone. Although it is almost 
certain that the academic backgrounds of some 
of the nonalcoholics would make them sophis- 
ticated in terms of conditioning techniques, 
these subjects were not observed to condition 
in consistently fewer trials than those who were 
completely naive to this procedure. Speed of 
acquisition and extinction of the CR appeared 
to be unrelated to the subject’s prior knowledge 
of conditioning principles or to his failure to 
associate the occurrence of the bell with the 
CS. This observation is in accord with results 
from other studies of GSR conditioning (Wall 
& Guthrie, 1959) and of conditioning the 
autonomic finger twitch response (Hefferline, 
Keenan, & Harford, 1959), indicating that 
verbal instructions or prior knowledge of the 
required response did not influence the speed 
with which the CR was established or ex- 
tinguished. 


SUMMARY 


Measures of GSR conditioning were ob- 
tained from 40 inpatients of an alcoholism 
clinic and from 40 nonalcoholic males. These 
subjects also completed the Maudsley Person- 
ality Inventory (MPI) which contains an in- 
troversion-extraversion scale. In accord with 
the experimental hypothesis, the CR was found 
to be more quickly acquired and more re- 
sistant to extinction in introversive than in 
extraversive subjects. Alcoholics and nonalco- 
holics did not differ either in mean extraver- 
sion scores or in mean rate of establishing and 
extinguishing the CR. In accord with the 
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hypothesis that more extraversion is associated 
with poorer conditionability, subjects who 
failed to display conditioning within the maxi- 
mum number of training trials allowed were 
found to have extraversive test scores. It was 
noted that these results are consistent with 
some other eyeblink conditioning studies and 
might be interpreted as lending support to 


Eysenck's personality theory. 
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vast amount of material is generated in 
A therapeutic interviews, and adequate tech- 
niques for analyzing such material are in demand. 
In this paper, a technique of language analysis is 
applied to examining the changes in the speech of 
a schizophrenic patient over approximately one 
year of his psychotherapy. The study is illustrative 
of a method which has developed from the defini- 
tion of meaning as a matrix of verbal associations 
and of similarity of meaning as the extent of com- 
monality of associations (Bousfield, Cohen, & 
Whitmarsh, 1958). Words which appear within a 
given context in free speech or writing are taken to 
provide the meaning of that context, 

In making a contextual analysis, words are cate- 
gorized on the basis of similarity or synonymity of 
reference in order to reduce the vast number of 
different items in our language which tend to ob- 
scure underlying similarities. The actual analysis 
of a context then involves the tabulation of the 
frequencies of categories which appear in close asso- 
ciation with each other, Once a tabulation of the 
categories of associations has been made, it is 
Possible to compare the distribution of associates 
in one language segment with that in some other 
language segment. Details of the technique have 
been set forth elsewhere (Laffal, 1959). There are 
94 categories, into no more than 2 of which any 
word may be categorized. In the course of review- 
ing large amounts of verbal material, a dictionary 
of the more common words that are appropriately 
scored under each category has been developed. 
The availability of this dictionary makes the cate- 
gorizing process fairly reliable, The 94 categories 
are shown in Table 1.2 The actual application of 
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* The categories listed in Table 1 are not to be 
thought of as the best Possible set, or as a final set of 
categories. Each research application of the set of 
categories contributes toward. Sharpening and defining 
the individual categories, and toward clarifying their 
relative usefulness, 
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the categories to a sample of speech is illustrated 
in a selection from the speech of the patient whose 
language is analyzed in this paper. 


Teason hollow male Teason 
Tm thinking of the cave man. It wouldn't teach 


hollow 
male 
him 
Sex hollow 
join male reason join 
of marriage. He didn't know how the bonds of 
sex 
join help help t 
marriage were implemented to the satisfaction of a 


lead hollow 
Sacred male go female female 
deity, so he could go from woman to woman, from 


hollow 
Teason male 
a lesson because he 


reason good 
didn’t know the virtues 


group group group reason .. big 
family to family and society thinks that might double 


female difficult female 
the woman’s problem and that they would have to 


open open 


possess eat Possess disagree structure 
not only find food, but find a different pattern 
living 
of life, 


The problem posed in the present study was to 
examine the speech of a schizophrenic patient in 
the course of a year of psychotherapy in which 
certain gross, readily identified and characterized 
behavioral changes took place, At the beginning of 
the treatment the patient was on a locked ward, 
but had occasional privileges to leave the ward in 
his own custody. He was unpredictable in his 
behavior, showed inappropriate affect, laughing 
and smiling apparently in response to his own 
thoughts, and tended to become incomprehensibly 
abstruse in his conversation. Around 6 months 
after the beginning of treatment, the notes in the 
patient's chart indicate that in the therapy the pa- 
tient was verging closer to significant material 
about the relationship with the therapist, but along 


* The patient, treated by the author, continues to 
be in treatment after 4 years. With occasional relapses 
the patient has made significant social improvement, 
to the point where he is presently attending college 
during the day, while living in the hospital. 
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TABLE 1 
Worp CATEGORIES 

Categories 
agree *disagree* law crime 
animal *nature lead follow 
*begin* end* living* dead 
*big* *little male *female 
"body feature? —— *body function *near far 
calm. *upset* *new* ancient* 
clean dirty open shut 
*come* *go* *play *work* 
commodity *money* possess want 
cure *illness* real *unreal* 
*durable* change reason absurd* 
earth sea *sacred* profane 
easy difficult* see* hear 
eat drink Separate join* 
*essential* trivial *sex homosex 
fast slow* *sharp blurred 
“forward* *back* some all 
free confine speak *write* 
*good* bad* *strong* weak 
help *hurt *structure* ruin 
*hollow* hill true false 
hot cold up *down* 
in *out* young mature* 
individual "group 


Note,—An asterisk preceding a word indicates it was one of 
the 34 categories used in the correlations of the interviews of 
Patient A, Patient B, and Schreber's writing. An asterisk following 
a word indicates it was one of the 34 categories used in the correla- 
tions of the early, middle, and late interviews of Patient A 


with this were reports of increasingly psychotic 
ways of speaking, and also that the patient was 
hallucinating during the treatment hours. There 
were also reports of the patient shouting at and 
bumping purposely into sicker patients. There 
followed a period in which the patient was re- 
stricted to the ward, and was extremely difficult to 
communicate with, refusing to come to the doctor’s 
office and walking out after a few minutes if he did 
come, At the end of the year the patient was 
communicating better in therapy, talking about 
Passes at home, and assumed a privilege card in 
order to work in the library. That this progress was 
nd transitory is evidenced by the fact that shortly 
i terward the patient went to an open ward where 
n was elected president of the patient government. 
n the first year of treatment, the patient made a 
clearcut social improvement. 

It seemed reasonable to suppose that with the 
patient's. change toward social adaptation, his lan- 
D might also show some changes with respect 
= T and distribution of word categories. This 

d be examined both in terms of the overall 
Category changes in interviews taken from various 
is of the treatment, and by examining in 

etail the shifts in associates accompanying specific 
Categories, 


Before actually looking in detail at the way the 
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patient's language changed, it is worthwhile to see 
whether the technique of language analysis which 
has been described is capable of discriminating 
between the styles of speech or writing of different 
individuals. If the technique is capable of doing 
this, and can also reflect changes within the style 
of one individual, its potential usefulness is obvi- 
ously very broad. Verbal material from three dif- 
ferent sources was therefore analyzed in order to 
explore individual differences in style, in addition 
to examining the productions of the individual 
patient at various points in his therapy to see 
whether his own style had changed with his shift 
toward social adaptation. 


MerHop 


In a prior study (Lafíal, 1960) a large amount of 
written material from the autobiography of a psychotic 
patient, had already been categorized using the 94 
categories of the method of contextual associates. A 
number of psychotherapeutic interviews with another 
psychotic patient, Patient B, were also scored into the 
categories; and a sampling from the material of the 
patient whose interviews over a year of therapy were 
to be analyzed, was taken for the purpose of the inter- 
individual comparisons. There were thus available 
separate profiles of the 94 categories from three different 
sources: The written autobiography of the patient 
Schreber, the interview material of Patient A, and the 
interview material of Patient B, Patient A being the 
one whose therapeutic interviews were to be further 
analyzed. 

The method of comparing profiles of categories 
was to correlate them, using Pearson r, with the 
categories substituting for “individuals” and the source 
of the data substituting for “variables.” Thus Patients 
A, B, and S were the variables, and there were 94 
categories or “individuals” to be compared. The actual 
scores were the frequencies of occurrence of the various 
categories in the speech samples of each of the patients, 

Reliability of profiles is determined by randomly 
splitting the speech sample from each patient, taking 
alternate lines, into two subsamples, and correlating 
the profiles of these subsamples. In addition, the availa- 
bility of two profiles for any patient, makes possible 
four cross-patient correlations for any pair of patients, 
The distribution of these four correlations may be 
used as the basis of tests to determine whether S is 
more like A, than he is like B, or whether A and B 
are more like each other than either of them is like S. 

Despite the fact that the speech samples are cate- 
gorizable into 94 different categories, only 34 of the 
categories are used in the correlations, the remaining 
60 categories being discarded by the application of 
certain systematic procedures for the elimination of 
categories. There are two reasons for this elimination 
procedure. The first has to do with the fact that cate- 
gories which occur either with consistently high fre- 
quency or consistently low frequency in all samples of 

do not discriminate between the samples, but 
only lead to artifactually high correlations. For example, 
if the patient, in all of his speech about any topic 
whatever, uses certain categories very frequently, and 
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others rarely, these categories would not discriminate 
between his speech about one topic or another, but 
would tend to obscure any differences which might 
exist in his speech about the different topics. The 
second consideration in the elimination of categories is 
the product of a systematic exploration of the changes 
in correlations between profiles of speech samples with 
the elimination of various categories from the correla- 
tions. On the basis of a consistent finding that two 
speech samples made up by taking alternate lines of 
transcribed text from the corpus of speech of one in- 
dividual, correlate very highly with each other as 
compared to their correlations with samples from other 
individuals, a systematic study was made of the 
changes in these self-correlations versus other-correla- 
tions as categories were eliminated from the correla- 
tions. Thus beginning with no eliminations from the 94 
categories, and progressively eliminating more and 
more of the consistently high frequency categories and 
the consistently low frequency categories, in various 
combinations, that arrangement of eliminations was 
sought which would give the greatest difference between 
the self-correlations from a single corpus, versus the 
other-correlations against some other corpus of ma- 
terial. After the categories were ranked from high to 
low based on explicit criteria for such rankings, various 
combinations of eliminations were tried in order to see 
which would give the highest difference between self- 
correlations and other-correlations. The combinations 
of eliminations tried were: for high categories, 0, 4, 
8, 12, 14, and 16; for low categories, 0, 8, 16, 24, 32, 44, 
and 52. The arrangement that seemed to give the 
greatest consistent difference between self-correlations 
andother-correlations, was that in which 16 high and 44 
low categories were eliminated, 


RESULTS AND DISCUSSION 


Tables 2 and 3 show the reliabilities and the 
intercorrelations of speech samples from thera- 


‘In ranking a set of profiles of categories, the 
highest ranked category was taken as that which, for 
all of the profiles being compared, had the highest low 
number. Thus if six profiles were being compared, 
and there were three categories which had frequencies 
as below, the highest ranked category would be Cate- 
gory Number 1, since its lowest number is 12; the 
second highest ranked category would be Category 
Number 2, since its lowest number is 11. 


CATE- 
ae FREQUENCIES IN PROFILE 
BER A B c D E Pr 
1 12 17 2t 16 13 30 
2 22 19 31 14 1 12 
3 16 21 14 9 13 12 


In ranking the low frequency categories, lowest rank 
was assigned to categories with zero frequencies, then 
with frequencies not exceeding one in any of the vari- 
ables, then with frequencies not exceeding two in any 
of the variables, etc. For each set of profiles com- 
pared it is necessary to apply this procedure independ- 
ently. The 34 categories used in correlating profiles 
are thus somewhat different for different sets of data. 
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RELIABILITIES AND INTERCORRELATIONS OF SpE 
SAMPLES FROM PATIENTS A AND B, AND 
ScHREBER'S AUTOBIOGRAPHY 


(Number of categories correlated — 34) 


Patient A Patient B 
At As Bi Ba 
Patient 
A E -801 
Patient Bı .099 .027 
B B: .159 -127 .833 
Schreber S: —.008 | —.031 | —.472 | —.398 
8: —.057 | —.136 | —.537 | —.510 
TABLE 3 


(Number of categories correlated — 94) 


Patient A Patient B. 

A As B B: 
Patient 
A A: .932 i 
Patient Bı +829 | .812 id 
B B: -845 | .822 | .956 ix 
Schreber S; -690 | .672 | .630 | .634 P. 

S: -653 | .652 | .592 | .580 | .948 


peutic interviews with Patients A and B, and from 
Schreber’s autobiography, with no categori 
eliminated (94 remaining), and with 60 categori | 
eliminated (34 remaining), The 34 remaining cate- 4 
gories are identified by asterisks before the words, 
in Table 1, There are clear distinctions betwe 


associates distinguishes between the styles oi 3 
Speech and writing of separate individuals. The 
technique was also applied to the analysis of thera- 
Peutic interviews obtained over the course of a 
year of treatment of Patient A. Four interviews. 
dated August 27, Se] 


February 11, February 20, a 
of 1957; and five interviews dated October 3, 
October 8, October 10, October 24, and October 31 
of 1957 were taken for the analysis. The selection 
of these particular interviews was determined 

their relatively close grouping in a series of 30 inter- 
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views which had been recorded during the year of 
treatment. The first four interviews were at the 
beginning of treatment of the patient; the second 
four occurred about 6 months after treatment 
began; and the third set of interviews was about a 
year after treatment began. The third set con- 
tained five interviews, since two were relatively 
brief. 

The initial hypothesis tested was that the three 
sets of interviews would show a shift in the profiles 
of contextual associates, reflecting improvement 
in the patient's psychiatric condition. It was pre- 
dicted that the profiles of the last set of interviews 
would be most different from those of the early 
and the middle sets of interviews, reflecting the 
fact that the last interviews were obtained under 
conditions of relative psychological integration. 
Table 4 shows the reliabilities and the intercorrela- 
tions of the profiles of contextual associates taken 
from the early, middle, and late interviews in the 
psychotherapy of Patient A. The 34 categories 
used in the correlations are identified by asterisks 
after the words in Table 1. 

In Table 4 the reliabilities for the early, middle, 
and late interviews range from .704 to .752. Both 
the early and middle interviews differ radically 
from the late interviews, and are more like each 
other than like the late interviews. 

Since the technique of analysis of contextual 
associates discriminates successfully between the 
styles of separate individuals, and also between the 
productions of one individual in manifestly dif- 
ferent psychological states, it was decided to 
examine in detail the contextual associates of spe- 
cific categories in the early, middle, and late inter- 
Views, to determine whether the technique would 
Show shifts in profiles associated with the specific 
categories. For this purpose, eight categories were 
selected for examination. The basis of selection of 
these categories was merely that they occurred 
with sufficient frequency in each of the three sets 
of interviews to make possible the development of 
Separate profiles for each of the three sets. No 
a priori psychological grounds were used in the 
Selection, so that no special significance is attribu- 
ted to the categories selected. The eight categories 
Were: join, disagree, possess, go, hollow, agree, 
help, and reason. Contexts of these categories were 
defined as that stretch of speech from three lines 
above a line in which the specific category ap- 
Peared, to three lines below the line in which the 
Specific category appeared. In each set of inter- 


*The definition of a context as that stretch of 
Speech which, in the typed transcript, begins three 
lines above the line containing a key word, and ends 
three lines after a line containing the key word, was 
developed in a prior study (Laffal, 1960), as a result 
of empirical exploration of stretches of speech of 
different lengths, 
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TABLE 4 
CORRELATIONS OF PROFILES OF CONTEXTUAL 
ASSOCIATES FROM EARLY, 
MIDDLE, AND Late INTERVIEWS 
OF Patient A 
(N = 34) 
Early Middle Late 
Ei E: Mı M: Lı 
Early Es +752 
Mı -323 | .400 
Middle M» | .300| .380| .707 
Late Lı |—.033 |—.078 |—.044 |—.189 
L: .156 |—.012 | .110 |—.175 | .704 


views, as many such contexts of the specific cate- 
gories were taken as would provide enough scores 
for reliability check. The analysis then proceeded 
as above. There were two profiles of scores available 
for each category, for each set of interviews. The 
categories were ranked from high to low depending 
on distribution of frequencies of occurrence in the 
six profiles. The 16 highest categories and 44 lowest 
categories were eliminated before doing correla- 
tions. 

The distributions of correlations for the eight 
categories are not listed, since the pattern followed 
by the correlations basically repeats that of 
Table 4, with reliabilities being relatively high 
compared to other correlations, and the profiles of 
the last interviews being considerably different 
from those of the early and middle interviews. 

The question raised by the fact that the profiles 
of specific categories seemed to follow the general 
trend of the interviews on the whole, was whether 
contexts of specific categories merely reflected the 
general speech pattern, or whether there was some 
specificity about the profiles related to the specific 
categories. In order to examine this, the contexts 
of different categories from the same sets of inter- 
views were correlated. Thus contexts of join and 
disagree in the early, middle, and late interviews 
were correlated, as were contexts of possess and 
go, hollow and agree, and help and reason. In each 
case, the determination of which categories to 
eliminate was independent and followed the pro- 
cedure described. 

A Mann-Whitney rank test (Mosteller & Bush, 
1954) of the reliabilities of self-correlations of cate- 
gories versus the cross-correlations of different 
categories from the same sets of interviews, shows 
that the reliabilities of self-correlations were sig- 
nificantly higher than the cross-correlations, at the 
.05 level of confidence. The mean of self-correla- 
tions was .521 with a range of .220—.832. The mean 
of the cross-correlations was .372 with a range of 
147—557. 

This demonstrates that there is a specificity in 
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the contexts of individual categories which can be 
distinguished from the general drift of the body of 
verbal material out of which the categories are 
drawn. In the present study no further effort was 
made to explore shifts in the contexts of individual 
categories, since no evidence is as yet available as 
to the psychological significance of the individual 
categories. 

The data were examined in one other way, in 
order to throw some light on the nature of the 
change in the patient’s speech in the course of 
psychotherapy. This further examination was also 
the result of an effort to determine whether the 
differences between Schreber’s writing and the 
speech of the two patients, might not be paralleled 
by differences in relative diversity or constriction 
in the use of the 94 categories. No specific hy- 
potheses were involved in this examination al- 
though it was believed (erroneously, as it turned 
out) that wider use of the 94 categories and more 
uniform distribution of the responses among the 
categories ought to be concomitants of better psy- 
chological integration. 

The diversity of categories used was examined 
by computing entropy or average information 
(Shannon & Weaver, 1949) of the 94 categories 
used by Patients A and B, and by Schreber, and in 
the early, middle, and late interviews of Patient A. 
The formula for the computation of entropy is 
H = —)°p; logs p;. In the profile of categories, p; 
is the frequency of occurrence of a particular cate- 
gory, divided by the total number of occurrences 
of scorable categories. 

The entropy scores" in Tables 5 and 6 may shed 
some light on the nature of Patient A's changes 
during psychotherapy. In Table 6, Patient A's 
entropy scores, based on samples drawn from all 
of his interviews, is higher than the entropy scores 
of Patient B, and of Schreber's writing. It is not 
clear why the percentage of words categorized in 
these samples is lower than those categorized 
throughout the interviews, as shown in Table 5. 
This may be an artifact of the particular segments 
of the interviews that were selected for categoriza- 
tion. The average entropy score of all the inter- 
views of Patient A in Table 5 is 5.6112. This is 
slightly lower than the entropy scores shown for 
Patient A in Table 6, but is still higher than the 
entropy scores for Patient B and for Schreber. 

If the higher entropy score for Patient A in 
Table 6 is a valid indicator of broader and more 
dispersed use of categories by Patient A as com- 
pared to Patient B and to Schreber, this must 
reflect the fact that Patient A throughout the in- 
terview samples selected was considerably more 
disorganized than either Patient B or Schreber in 


©The negative sign has been dropped in these 
Scores. 
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TABLE 5 


INFORMATION SCORES OF EARLY, Mipprr, 
AND LATE INTERVIEWS OF PATIENT A 


Information scores of Percentage of total® 


Interviews 


category profiles words categorized 
Early 1 5.7758 .420 
Early 2 5.6257 -401 
Early 3 5.5800 +389 
Early 4 5.6188 .406 
Middle 1 5.6874 .356 
Middle 2 5.7546 375 
Middle 3 5.6767 .388 
Middle 4 5.6642 .403 
Late 1 5.5762 +390 
Late 2 5.5742 +304 
Late 3 5.3250 375 
Late 4 5.7087 +373 
Late 5 5.3778 380 


* Percentage of total words categorized was obtained by divid- 
ing the number of categorizations in an interview, by an estimate 
of the total number of words in the interview, Total number of 
words in the interview was estimated by multiplying the number 
of typed lines in the transcript by 12, the average number of words 
in a line. 


TABLE 6 
INFORMATION SCORES OF A RANDOMLY SELECTED 
Bopy or SPEECH FROM INTERVIEWS OF 
PATENT A, oF PATIENT B, AND 
FROM THE WRITING OF SCHREBER 


Sor of i of total^ 
nat ETE 
Patient A 
Sample 1 5.7105 i 
.216 
Sample 2 5.7194 
Patient B 
Sample 1 5.4925 
.307 
Sample 2 5.5262 
ber 
Sample 1 5.5346 
.272 
Sample 2 5.4679 


* Percentage of total words categorized was obtained by divid- 
ing the total number of categorizations in Samples 1 and 2 of each 
patient, by an estimate of the total number of words used in the 
samples, Total number of words in the sample was estimated by 
multiplying the number of typed lines in the transcripts by 12, 
the average number of words in a line. 


his writing. Patient A in his early and middle 
interviews especially was speaking in a typically 
schizophrenically obscure fashion. Patient B, while 
paranoid in his ideation, spoke with intact lan- 
guage; and Schreber of course, while wildly delu- 
sional, presented his ideas in a well organized and 
logical fashion. The higher entropy scores would 
thus reflect less organization and structure in the 
patient’s use of categorizable words. 

That high entropy scores derived from the profiles 
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of categories do indeed mirror greater disorganiza- 
tion is somewhat supported by close examination 
of Table 5. The Mann-Whitney test applied to the 
ranked entropy scores, shows the entropy scores 
of the late interviews to be significantly smaller at 
the .05 level than the entropy scores of the early 
and middle interviews. In view of the available 
evidence that the last interviews occurred during 
a period of relative integration in Patient A, as 
compared to the early and middle interviews, a 
reasonable conclusion appears to be that psycho- 
logical integration is accompanied by greater 
structuring of the category profile, or by reduction 
of the diversity and dispersion of category choices 
in speech. 

This conclusion suggests, if the present patient 
is typical, that movement in the direction of re- 
covery from schizophrenia is attended by a kind 
of constriction of the language process. Actually 
this fits some of our intuitive notions of recovered 
schizophrenic patients. Very often such patients 
who impress one as imaginative in their psychotic 
states turn out to be rather limited individuals in 
their “emissions. Recovery, in a figurative sense, 
is at ths expense of broadness; the recovered pa- 
tient no longer has available a variety of possible 
verbal associations which were simple for him in 
his state of illness. 

Thus in applying the technique of analysis of 
contextual associates to interview material from 
the beginning, middle, and latter part of a year of 
treatment of a schizophrenic patient, the essential 
findings are that the patient’s language changes as 
his social behavior changes, and that the nature of 
this language change is in the direction of constric- 
tion and organization. It has also been demon- 
Strated that it may be possible to examine in detail 
how the patient’s speech about specific categories, 
and by implication, specific subject matters, 
changes over a period of time. 

The technique of analysis of contextual associ- 
ates is a complex and laborious one. However, the 
material which it undertakes to deal with, verbal 
behavior, is easily among the most complicated 


Y 


427 


behaviors that psychologists have attempted to 
study. Others who find the technique of interest, 
may wish to introduce their own variations into it, 
and there is little doubt that further research, in 
revealing the weaknesses and inadequacies of the 
method, will also point toward ways of stream- 
lining it and reducing its cumbersomeness. 


SUMMARY 


A technique of language analysis, called the 
analysis of contextual associates, is applied to 
transcripts of therapeutic interviews taken from a 
year of treatment of a schizophrenic patient. The 
analysis shows changes in the language of the pa- 
tient from a more psychotic period to a more 
socially integrated period. An examination of the 
average information, or entropy, of the patient’s 
speech in the separate stages of his social reintegra- 
tion reveals that the more socially integrated 
period is accompanied by a reduction in informa- 
tion in his speech. The study highlights some of 
the potential of the method of analysis of con- 
textual associates. d : 
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CRITIQUE AND NOTES 
SHIFTS IN EVALUATIONS OF PARTICIPANTS FOLLOWING INTERGROUP 
COMPETITION! 


WARNER WILSON? anp NORMAN MILLER? 
Northwestern University 


number of hypotheses concerning the effects 
A of interaction and competition between indi- 
viduals and groups can be found in recent litera- 
ture. Interaction between persons tends to increase 
their liking for one another (Bovard, 1951, p. 404; 
Festinger, 1951; Homans, 1950, p. 112; Sherif & 
Sherif, 1956, p. 293); the tendency for interaction 
to result in greater liking tends to be nullified or 
reversed if the interaction is accompanied by un- 
pleasant experiences such as having to associate 
with persons perceived as social inferiors (Festinger 
& Kelley, 1951), failing (Coch & French, 1948; 
French, 1944; Sherif & Sherif, 1956, p. 294), or 
engaging in intragroup competition (Deutsch, 
1953; Grossack, 1954); intergroup competition 
leads to increased ingroup cohesion (Sherif & 
Sherif, 1956, p. 306); intergroup competition leads 
to an unfavorable perception of the outgroup 
(Avigdor, 1952; Sherif & Sherif, 1956; Shrilke, 
1936); and a hostile perception of the outgroup is 
an inevitable concomitant of ingroup cohesion 
(Murdock, 1949, p. 83). While these general theo- 
retical statements are no doubt true of some groups 
under some circumstances, it seems likely that as 
more information is gathered, additions and quali- 
fications will be called for. 

The present paper represents one step toward 
specification of the conditions under which these 
statements apply. Intergroup competition was 
simulated by having two-man teams compete 
against two “stooges” on several tasks. Possible 
differences in competence were eliminated by 
manipulating winning and losing experimentally 
while using equivalent groups of subjects. Winning 
or losing was manipulated by instructing trained 
stooges to consistently win on all tasks against half 
the teams and to consistently lose against the 
other half. The subjects rated their own team 
member as well as the members of the opposing 
team (the stooges), on a list of traits. Ratings were 


1This research was supported in part by United 
States Public Health Project No. M-1544 under the 
direction of Donald T. Campbell. The authors are 
grateful for his encouragement and aid. 

The authors wish to appreciatively acknowledge 
suggestions made by Harold Guetzkow during the 
planning stages of this study. T 

2 Now at the University of Hawaii. 

3 Now at Yale University. 
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made both before and after intergroup competition 1 
The shift in favorability of a team member's before 
and after ratings of his teammate and his oppo 
nents was the dependent variable. T 


METHOD 


Subjects. The 60 subjects were male students al 
Northwestern University. Half were recruited fi 
fraternity houses; the other subjects were students 
introductory psychology. The subjects drawn fr 
introductory psychology received the equivalent of 
correct answer on their final exam by serving in 
experiment. Each two man team of subjects 
randomly assigned to a Win or Lose condition wii 
the restriction that there be 15 teams in each condi 
tion. ? 

Stooges. The stooges used to manipulate winning 
losing were students in advanced undergraduate psy- 
chology classes. They were paid for their time, Persons 
unmistakenly mature in physical appearance were not 
selected. In all, three different pairs of stooges were 
used. 

Procedure, Four persons at a time took part in 
experiment, two of whom were always stooges. Whi 
all four had arrived, they were ushered into the exp 
mental room, shown where to hang their coats, 
seated at a round table 5 feet in diameter. A number 
was placed before each subject to identify him. 
subjects were then given a rating form with 27 
sonality traits: anxious, bossy, capable, complaining, 
conceited, critical, dependent, masculine, greedy, 
gullible, hostile, influential, insincere, obsequi 
orderly, prying, scheming, secretive, intelligent, sting 
strict, stubborn, suspicious, touchy, unappreciati 
unscrupulous, good judge of personality. Subjects wer 
instructed on how to use a nine-point rating scale 
tate themselves and the other three persons on 
the 27 traits. The order in which persons were ral 
was randomly determined. All subjects were pro’ 
with a definition sheet containing synonyms and 
behavioral descriptions of all the traits. This definit 
sheet also differentiated between those traits mi 
commonly confused by college subjects in a pilot study 
Following the rating task, subjects were instructed 
follows: 

Now I want to divide you into two teams. P. 

— and — will form one team, and persons — 

— the other. I have several sets of problems 

two persons can work on at once. Team mem! 

must cooperate for efficient solutions. Both te 

will work on the same problem at the same t 

The team solving the problem first wins. The sp! 
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with which problems of this type are solved cor- 

relates highly with intelligence test performance, 

however, the problems are fairly simple for college 
persons and you should not find them difficult. 

They are representative of a variety of skills and 

we typically find that each team wins about half 

the time. Each time a team wins, that team will 
receive $1.00. As indicated, the problems require 
the participation of both members of each team. 

Therefore, feel free to converse as much as necessary 

with your teammate in coordinating your efforts 

at solution. 

While subjects were apparently assigned to teams 
at random, the two stooges were uniformly assigned 
to the same team. The first task employed a paste- 
board box approximately 18 inches in each dimension 
and webbed with strings producing a complex criss- 
cross pattern. A small cigarette box was placed in the 
bottom of the larger box. Each subject was given a 20 
inch length of stiff straight wire, 1$ inch in diameter, 
and instructed to help his partner lift the cigarette box 
through the strings and out of the larger box without 
touching it with hands or body. The second task re- 
quired the teammates to arrange four flat pieces of 
plastic into a rectangle. The third task was one of the 
so-called “water jar” problems which is essentially a 
simple mathematical puzzle. The fourth task was a 
crossword puzzle. 

The total period of interaction on the four tasks 
lasted 13 minutes. When stooges had been instructed 
to win, they guided their solution speed against the 
Progress of the crucial subjects thereby effectively 
minimizing any differences between the amount of 
time spent by Winning and Losing subjects on each 
of the four tasks. Differences in total solution time 
were controlled by manipulating the time allowed for 
- the last task, the crossword puzzle, so as to bring the 
total for every group to 13 minutes. 

Following competition, the experimenter read these 
additional instructions: 

One of the purposes of this experiment is to deter- 
mine whether your ability to judge personality 
improves after you have had some experience on 
simple tasks with the persons you are rating. Now 
you are to again rate the other three persons and 
yourselves on the 27 traits. This time you should 
be able to make your ratings more accurate than 
they were last time.* 

Treatment of the data. Since all 27 traits had an 
evaluative connotation, the differences between before 
and after ratings on all traits could be combined to 
obtain a composite difference score reflecting an al- 
gebraic sum of the shifts in the favorability of a sub- 
Ject’s ratings.5 The traits capable, masculine, in- 


“All subjects filled out an open-ended questionnaire 
and forced-choice questionnaire on reasons why they 
might have done better on the tasks than they actually 
. did. These data, as well as the self-rating data, are 
, omitted from the present report. Similarly, all ratings 

by the Stooges are ignored. 

5An analysis was done on each trait separately to 
determine the percentage of judgments which increased 
in favorability after interaction. Most shifts were in a 
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fluential, orderly, intelligent, and good judge of per- 
sonality, were assumed to be positive, all others to be 
negative. The ratings of the positive traits were re- 
versed, a rating of one was changed to nine, two to 
eight, etc., to give all upward shifts in ratings a common 
meaning. 

The individual composite difference scores of team 
members were averaged so that each time a team of 
subjects was tested, a single difference score for their 
team and a single difference score for the opponents 
would be obtained. 

The fact that the stooges and crucial subjects were 
recruited from different populations bears considera- 
tion. Shifts in ratings of ingroup and outgroup members 
may not be comparable given initial differences in 
these rated objects. Averaging the initial ratings so as 
to yield one score for their Ingroup and one for the 
Outgroup by each team of subjects, the differences 
between initial Ingroup and Outgroup ratings made by 
Winning and Losing teams yielded ¢ ratios of .54 and 
-38, However, the possibility still remains that in spite 
of initial random assignment to conditions, those sub- 
jects assigned to one subgroup might make initial 
judgments consistently higher or lower than those 
assigned to another. Subjects were therefore divided 
into four groups according to the source from which 
they were recruited and the condition to which they 
were assigned, Win or Lose, and the initial ratings of 
Ingroup members and Outgroup members were exam- 
ined by separate analyses of variance. The two analyses 
of variance yielded F ratios of 1.41 or less, p > .05. 
Thus, groups recruited from different sources, Winners 
and Losers, and ratings of Ingroup and Outgroup mem- 
bers were considered equivalent in terms of initia] 
judgments. 


RrsuLTS 


Table 1 shows the mean magnitude of the four 
composite difference scores by subgroup and indi- 
cates the significance of the change. In all four 
instances, regardless of the team's experimental 
treatment or the object rated by the team, there 
is a significant shift toward a more favorable evalu- 
ation following interaction. 

Table 1 also suggests that some before-after 
shifts are larger than others. The results of a 
2 X 2 trend analysis of the composite difference 
scores of teams are presented in Table 2. Since the 
order in which teammates and opponents were 
rated was randomly varied, the trend analysis does 


positive direction. While some traits showed more 
change than others, the percentage of shifts on most 
of the traits is insignificant. The only significant shifts 
in the negative direction occurred when the winners 
rated the losers on the following traits: capable, de- 
pendent, masculine, greedy, and intelligent. While the 
traits “capable” and “intelligent” seem to be task 
relevant traits, there does not seem to be any adequate 
basis for interpreting all five of these traits as defining 
a separate factor; The major analyses are based instead 
on composite scores which lump all the traits together. 
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TABLE 1 
MEAN SHIFTS IN FAVORABILITY 
Experimental | Opjects rated | Men | , ? 

S,T«O? Teammates | 13.73 | 3.94 | <.01 

Opponents 5.35 | 3.33 | <.01 
STI, a Teammates | 3.57 | 2.15 | <.05 

Opponents 7.62 | 3.56 | <.01 


^S = subject, T = teammate, O = opponent, w = won, | = 
lost. 


TABLE 2 
ANALYSIS OF VARIANCE OF SHIFTS IN 
FAVORABILITY 
Source df MS F 

Between treatments (W-L) 1| 234.03 |2.33 
Between subjects* in same group | 28 | 100.40 
"Total between subjects 29 
Between trials (T-O) 1| 70.41 |1.37 
Interaction: (W-L) X (T-O) 1| 579.71 |8.87* 
Pooled subjects X Trials 28 | 65.36 
Total within subjects 30 


* “Subjects” actually refers to “teams”; each team’s data for 
T and O was pooled to create a single score for each. 
*p «90. Es a 


not evaluate a trial or order effect in the objects 
rated. It merely handles the fact that the ratings 
of teammate and opponent are not independent, 
However, as seen in Table 2, neither main effect is 
significant; only the interaction between the ex- 
perimental treatment (Win or Lose) and object 
rated (Teammates or Opponents) is significant 
(F = 8.87, p < .01).6 As seen by inspection of 
Table 1, the significant interaction reflects the 
opposite effects of the experimental treatment on 
magnitude of shift in the ratings. When the crucial 
team won and other team lost, there were large 
positive shifts in ratings of teammates but small 
positive shifts in ratings of opponents. However, 
when the crucial team lost and the other team won, 
ratings of teammates showed small positive shifts 
While ratings of opponents showed somewhat 
larger positive shifts. While there is a suggestion 
that the experimental treatment had a greater 
effect on ratings of teammates as opposed to op- 


Since Bartlett’s test indicated the composite 
difference scores did not meet the homogeneity of 
variance assumption (x? = 11.37, .01 < P < .02) the 
trend analysis was repeated on transformed Scores. 
The transformation X — 4/X +VX+1 yielded 
homogeneous variances (x2 = 1.20, > .25). The 
trend analysis on the transformed scores paralleled 
the analysis of Table 2, yielding a significant inter- 
cation but no main effects. 
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ponents, this difference does not reach significance 
(t = 1.68, p > .10). 


Discussion 


Though the design confounds the subject’s ex- 
perience with that of his teammate and his oppo- 
nents, it mirrors dyadic competition as it is typi- 
cally found in the natural environment. That one’s 
own team wins when the opponents lose, and that 
whenever one wins one’s teammate does too, are 
the rules rather than exceptions of competitive | 
activity. Within these limits, the results suggest 
that the competence of the object rated (Team- 
mate or Opposifig team), as reflected by the per- 
formance of that team on the experimental tasks, 
is the major factor accounting for the differential 
increase in the favorability ratings made by the 
two experimental groups. 1 

In addition, the results speak on the several 
hypotheses initially considered. It is quite apparent 
from Table 1 that the first hypothesis, predicting 
an increase in liking as a function of interaction, 
receives support, provided one is willing to define 

n "increase in liking" by the dependent measure A 
herein employed. Hypothesis 2, which is essentially f 
a qualification of the first hypothesis, states that 
the tendency for interaction to result in greater 
liking tends to be nullified if the interaction is 
accompanied by unpleasant experiences such as 
losing. While this hypothesis would receive support 
if only the ratings of teammates were inspected, | 
there is no main effect in the direction of lower 
ratings by the subjects who lost. However, it may 
reasonably be claimed that the interaction was not 
sufficiently long nor the unpleasantness of losing 
sufficiently intense to warrant an expectation of . 
such results in the present instance. 

If the relative increase in the favorability of 
ratings of teammate versus opposing team is used 
as an index of increased group cohesion, the third i 
hypothesis is also'in need of qualification. Ratings 
of teammates did not increase more than ratings ii 
of opponents. The related hypothesis: that inter- (T 
group competition leads to an unfavorable percep- Af 
tion of the outgroup (Opponents) also fails to find {1 
support in the present instance. Similarly, pooling 11. 
all subjects, the correlation of .17 between the jJ. 
increase in favorability of teammate and opponent 
ratings offers no support for the notion that in- 
group cohesion presupposes a negative perception 
of the outgroup; indeed, the correlation is in the Y 
wrong direction. 


| 
| 


SUMMARY 

Intergroup competition was simulated by having || 
two-man teams compete against two stooges A if 
several tasks. Winning and losing was manipulated | 
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by having the stooges always win against half the . 


teams and always lose against the other half. The 
difference in favorability of a team’s before and 
after ratings of the other participants on 27 per- 
sonality traits was the dependent variable. 

A significant interaction was found between the 
experimental treatment (Win or Lose) and the 
object rated (Teammate or Opponent). The rele- 
vance of the findings to five widely held hypotheses 
concerning the effects of interaction and competi- 
tion on individuals and groups was considered. 
Only the hypothesis that “liking” is increased by 
interaction received unqualified support. 
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THE EFFECTS OF THORAZINE ON LEARNING AND RETENTION IN 
: SCHIZOPHRENIC PATIENTS! 


NORRIS D. VESTRE? 
University of Minnesota 


ESPITE the enormously expanding use of 
thorazine and related drugs, little effort has 

been directed towards investigating the effects of 
such agents on basic psychological processes like 
learning, retention, judgment, and abstract reason- 
ing. The claim is often made that these functions 
remain intact or are improved in psychiatric pa- 
tients as a result of thorazine treatment (Lehmann 
& Hanrahan, 1954). These claims have, as a rule, 
been based on clinical impressions and have led 
those making the statement to the further claim 
that the drugs facilitate the process of psycho- 
therapy. If the psychiatric literature can be taken 
as an indication, it seems that one of the major 
arguments for the use of the so-called “tranquil- 
izers" is that they facilitate psychotherapy (Ford 
& Jameson, 1955; Kinross-Wright, 1955; Winkel- 
man, 1954). Lesse (1956) has expressed concern 
regarding the paucity of specific research in this 
area. As Eysenck (1957) suggested, our relative 
lack of knowledge in this area may, perhaps, be due 
to the weak rationales for all of the somatic treat- 
ment procedures. Two recent efforts have been 
made, however, to provide a theoretical basis to 
account for the behavioral changes resulting from 
these procedures (Eysenck, 1957; Heistad, 1957). 
A number of investigators have been concerned 
with evaluating the effects of thorazine on intelli- 
gence and psychomotor efficiency, Gilgash (1957), 
Kovitz, Carter, and Addison (1955), and Petrie 
and LeBeau (1956) reported improved Wechsler- 
Bellevue test performance in psychotic patients 
after a period of thorazine treatment in dosages 
from 75 to 400 mg. per day. Gibbs, Wilkens, and 
Lauterbach (1957), using two drug dosage levels 
(75-125 and 150-450 mg. per day), found their 
placebo group to demonstrate greater pre- to post- 
test gains in Wechsler-Bellevue IQ than the two 
drug groups. Tourlentes, Hunsicker, and Hurd 
(1958) found thorazine (200 mg. per day) to have 
no significant effect on verbal or nonverbal intelli- 
gence, verbal fluency, suggestibility, motor speed, 
or perceptual recognition in schizophrenic patients. 


1 This report is based on a dissertation submitted 
to the Graduate School, University of Minnesota, in 
partial fulfillment of the requirements for the degree of 
Doctor of Philosophy. 

The author is grateful to his advisor, William 
Schofield, for his guidance and encouragement. 

* Now at Veterans Administration Hospital, St. 
Cloud, Minnesota. 
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Shatin, Rockmore, and Funk (1956) reported that 
thorazine did not interfere with, and in some cases 
increased, performance on psychomotor tests 
(tracing, dotting, tapping) and on tests of rote 
memory, digit symbol substitution, and word 
fluency. Whitehead and Thune (1958) found thora- 
zine in doses of 200 or 300 mg. to have little effect 
on several functions from simple motor learning to 
problem solving. Porteus (1957) and Porteus and 
Barclay (1957) reported significant decrements in ` 
maze performance, which they attributed to the 
drug. 

Most of these studies have been concerned with 
the effects of thorazine on intelligence test scores ^ 
or psychomotor and psychological efficiency tests. 
In the present investigation, the selection of the 
task was dictated by the intention to find a pro- ^ 
cedure that might be somewhat more pertinent to 
the question of the effect of thorazine on the learn- 
ing or relearning which occurs during psycho- 
therapy. 

This study investigated the effects of one of the 
tranquilizing drugs (thorazine) on attempts to 
alter, by selective reinforcement, preferred re- 
sponses to stimulus words in schizophrenic | 
patients. 


METHOD 

Subjects 

Forty schizophrenics, most of whom were hospi- 
talized less than 6 months, were less than 45 years of 
age, and were average or above in intelligence were 
chosen for the study. Nine subjects were dropped for 
one of a number of reasons and were replaced by other 
subjects who met these criteria. The mean age of this 
group was 35.8 years; the mean length of hospitaliza- 
tion was 4.3 months, and the mean Shipley-Hartford 
IQ was 1064. — 


Procedure 


The experimental procedures involved the ad- 
ministration of a free association test, à specially 
designed learning task, and a test for retention O 
learning. > 

Free association situation. Thirty-six words aa | 
trarily chosen from the Kent-Rosanoff list were aC 
ministered in the usual manner. The subject’s spon- 
taneous association to each stimulus word was recorde 

Learning situation. The learning materials, whi 3 
were a multiple-choice form of the association ke 
nique, were presented on 4" X 6" index cards. m 
stimulus word was typed in capital letters near 
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top of the card and three response alternatives were 
typed in small letters on a single line below the stimulus 
word. The stimulus words were the same 36 which 
were used in the free association situation, The response 
alternatives for each stimulus word included the sub- 
ject’s preferred (free association) response, which, of 
course, varied for each subject, and two low frequency 
(1%) responses selected from the Russell-Jenkins 
1954 tables. The subject’s preferred response appeared 
with equal frequency in each of the three positions— 
left, middle, and right—on the cards. 

During the learning trials the experimenter said, 
“Right,” following responses the subject was to 
acquire; he said, “Wrong,” for any other choice. The 
response selected by the experimenter as the “correct 
response” was randomized for each stimulus word and 
for each subject and was also controlled for position. 
For 12 of the 36 items, the subject’s preferred response 
was called “correct”; for the remaining 24 items, one 
of the two low frequency alternatives was called 
“correct.” After each trial, the experimenter shuffled 
the deck of cards so that the order of presentation was 
changed for each trial. Once a card was responded to 
correctly on three successive trials, it was removed 
from the deck. Learning trials continued until 18 out 
of 24 “new” associations were learned to a criterion of 
three successive correct responses each, 

Retention test. Retention was tested by presenting 
the learning materials to the subject and having him 
try to recall the correct responses. 


Treatment of Experimental Groups 


The entire program was 4 weeks in length for each 
subject. All subjects were placed on placebos for 1 week 
Prior to the free association test in order to diminish 
the effects of previous drugs. Following the association 
test, subjects were randomly assigned to one of four 
Broups. In the following description of these groups, 
the first letter refers to the conditions (thorazine or 
placebo) during the 1-week period between the asso- 
ciation test and the learning trials; the second refers 
to the conditions during the 2-week period between 
the learning trials and the retention test. The four 
Eroups were as follows: 

TT Group. These subjects learned and were tested 
for retention under thorazine conditions. 

TP Group. These subjects learned under thorazine, 
but they were tested for retention under placebo con- 
ditions, 

PP Group. These subjects learned and were tested 
for retention under placebo conditions. 

PT Group. These subjects learned under placebo 
Conditions, but they were tested for retention under 
thorazine conditions. 

At the time of the learning trials and the retention 
test, 20% of the subjects were on 100 mg. per day of 
thorazine, 17% were on 200 mg., 7% were on 300 mg., 

3% were on 400 mg., and 3% were on 800 mg. The 
Placebos were identical in appearance to thorazine 
4 tablets, Although the ward staff was aware of the 
a = each patient was receiving, the experimenter was 

ot, 
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TABLE 1 
MEAN NuMBER oF Items RECALLED FOR 
THORAZINE AND PLACEBO LEARNING 
GROUPS 


PP and PT | TT and TP 

W = 20) | W = 20) 

Items used — j t ? 
M |Vai 


Vari- 
ance | M 


ance 


All 24 “New” Associa- 
tions 
18 Associations learned 


15. 70}14.85 |11.25 |12.30 | 3.82 | .001 


13.15] 9.19 | 9.05 | 7.21 | 4.53 | .001 


RxsuLTS 

The effects of thorazine on learning were evalu- 
ated by comparing the performance of the two 
groups having their learning trials under thorazine 
treatment (TT and TP groups) with the perform- 
ance of the two groups having their learning trials 
under placebo conditions (PP and PT groups). Tho- 
razine, in the dosages used, was found to have no 
significant effect on attempts to alter preferred re- 
sponses to stimulus words. Although the placebo 
learning groups required fewer trials to reach cri- 
terion, the / value obtained in testing the between- 
group difference did not reach an acceptable level 
of significance (p = .18). 

The data for the retention phase were examined 
in several ways. Recall differences between the PP 
and PT groups and between the TT and TP groups 
were studied. Also, the recall of the combined TT 
and PP groups was compared with that of the 
combined TP and PT groups. All / tests of the 
relevant between-group differences were within 
chance expectancies. 

A comparison was made of the effect of thora- 
zine on retention regardless of the learning condi- 
tion, that is, groups TT and PT vs. PP and TP, 
The administration of thorazine during the period 
from the learning trials to the retention test did not 
interfere with the recall of the previously learned 
material. 

When the data were analyzed in such a way’ 
that the two groups learning under thorazine (TT 
and TP) and the two groups learning under 
placebos (PP and PT) were compared with respect 
to retention, some important findings were revealed. 
As indicated in Table 1, the two groups having 
learning trials under placebos recalled, at the time 
of the retention test, significantly more items than 
the thorazine learning groups. As reported in the 
results of the learning trials, the thorazine learn- 
ing groups took slightly, but not significantly, 
more trials to reach the learning criterion. Even 
though this difference was not significant, it is 
possible that the observed differences in retention 
partially reflect these slight differences in learning. 


434 


For this reason, it was thought that the findings 
would be more interpretable if it could be demon- 
strated that subjects, matched on the number of 
trials required to learn to criterion, differed with 
respect to the number of items recalled on the re- 
tention test. Of the 20 subjects in each learning 
group (thorazine or placebo), 13 could be matched 
on the number of trials required to reach criterion. 
These 26 matched subjects, 13 from each learning 
condition, were then compared with respect to re- 
call of the “new” associations learned. This opera- 
tion served to reduce only slightly the significance 
of the differences on retention. The two groups 
having learning trials under placebos still recalled 
significantly (.01) more of the 24 “new” associa- 
tions presented and also significantly (.001) more 
of the 18 (of the 24) “new” associations learned to 
criterion. 


Discussion 


The learning task required that the subject 
reject or unlearn a strong response, defined as his 
preferred response, to a stimulus word, learning a 
new response in its place. Thorazine was found to 
have rather complicated effects on learning and 
retention in this task. 

Inferior performance in learning might have 
been expected for the thorazine group if this drug 
has a general depressant effect on the learning 
process. The drug might, however, also bring about 
a reduction in negative transfer from previously 
learned associations which could counteract the 
depressant effect on learning. In this case, one 
might have expected the thorazine learning group 
to be superior on the learning of the 24 “new” asso- 
ciations and perhaps inferior on the 12 “old” asso- 
ciations, At least, one would expect the drug and 
placebo learning groups to perform differently, 
relative to each other, on these two types of items. 
Such was not the case in the present study. 

The comparison of efficiency of recall under the 
same drug condition as prevailed during learning 
as opposed to recall under a different drug condi- 
tion was, in effect, a test, in the area of human 
verbal learning, of the theoretical propositions of 
Heistad (1957) about the effects of changes in the 
internal environment on response strength. The 
data did not support these hypotheses; the relevant 
between-group differences were statistically insig- 
nificant. 

This study found retention of learning to be 
impaired if the learning occurred under thorazine 
conditions. This finding is of interest because of the 
possible implications for psychiatric treatment pro- 
cedures. It has come to be a general practice to 
engage in psychotherapy with patients who are 
also being treated with chlorpromazine or some, 
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similar chemical agent. If the learning (or relearn- 
ing) which is accomplished during a course of 
psychotherapy by patients who are being treated 
concurrently by drugs is less apt to be retained 
than if these patients were not being treated con- 
currently by chemical means, then one may cer- 
tainly wonder about the efficacy of this general 
practice in producing any durable benefits from 
psychological treatment. 


SuMMARY 


"The major concern of this study was to evaluate 
the effects of thorazine on learning and retention. 
There is no evidence that the drug, in the dosages 
administered, has any significant effect on attempts 
to alter the preferred responses to stimulus words 
of schizophrenic patients. Retention was not en- 
hanced if the retention test was given under the 
same drug condition as prevailed during learning. 
Thorazine, given during the period from the learn- 
ing trials to the test for retention, did not interfere 
with performance on retention. Retention was im- 
paired, however, if the learning was accomplished 
under thorazine conditions. Stated in observa- 
tional terms, learning accomplished under drug 
conditions was not retained as well as learning 
accomplished under placebo conditions. 


REFERENCES 


EvsENCE, H. J. Drugs and personality: I. Theory and 
methodology. J. ment. Sci., 1957, 16, 119-131. _ 

Forp, H., & Jameson, G. K. Chlorpromazine in 
conjunction with other psychiatric therapies: A 
clinical appraisal. Dis. nerv. Syst., 1955, 16, 179- 
185. 

Grass, J. J., WILKENS, B., & Lavrersacs, C. G. A 
controlled clinical psychiatric study of chlor- 
promazine. J. clin. exp. Psychopath., 1957, 18, 
269-283. 

Garcasn, C. A. Effects of thorazine on Wechsler scores 
of adult catatonic schizophrenics. Psychol. Reps 
1957, 3, 561-564. 

Heistap, G. T. A bio-psychological approach to 
somatic treatments in psychiatry. Amer. J. Psy- 
chiat., 1957, 114, 540—545. 

Kiwnoss-WarcHT, V. Chlorpromazine treatment of 
mental disorders. Amer. J. Psychiat., 1955, 141, 
907-912. 

Kovrzz, B., Carter, J. T., & Appison, W. P. A com- 
parison of chlorpromazine and reserpine in chronic 
psychosis. Arch. Neurol. Psychiat., 1955, T4, 467- 
471. ; 

LEHMANN, H, E., & HANRAHAN, G. E. Chlorpromazine. 
Arch. Neurol. Psychiat., 1954, T1, 227-237. T 

Lesse, S. Psychotherapy and ataraxics: Some T 
servations on combined psychotherapy 20 
chlorpromazine therapy. Amer. J. Psychother 
1956, 10, 448—459. : 

PETRE, A., & LeBrav, J. Psychologic changes E 
man after chlorpromazine and certain types ? 


CRITIQUE AND NOTES 435 


brain surgery. J. clin. exp. Psychopath., 1956, 17, 
170-179. 

PonTEUS, S. D. Maze reactions after chlorpromazine. 
J. consult. Psychol., 1957, 21, 15-21. 

PonrEUS, S. D., & Barctay, J. E. A further note on 
chlorpromazine. J. consult. Psychol., 1957, 21, 
297-299. 

Russert, W. A., & Jrwxiws, J. J. The complete 
Minnesota norms for responses to 100 words from 
the Kent-Rosanoff Word Association Test. Of. 
Naval Res. tech. Rep., 1954, No. 11, (Contract No. 
N8) 

Suatiy, L., ROCKMORE, L., & Funk, I. C. Response of 
psychiatric patients to massive doses of thorazine: 


II. Psychological test performance and compara- 
tive drug evaluation. Psychiat. Quarti., 1956, 30, 
402-416. — 

TounrzNrEs, T. T., HuNsickzn, A. L., & Hun», D. E. 
Chlorpromazine and communication processes. 
Arch. Neurol. Psychiat., 1958, 79, 468-473. 

WnurrEgEAD, W. A., & Tuone, L. E. The effects of 
chlorpromazine on learning in chronic psychotics. 
J. consult. Psychol., 1958, 22, 379-383. 

WiNKELMAN, N. W. Chlorpromazine in the treatment 
of neuropsychiatric disorders. J. Amer. Med. Ass., 
1954, 155, 18-21. 


(Received August 16, 1960) 


Journal of Abnormal and Social Psychology 
1961, Vol. 68, No. 2, 436-437 


GROUP SIZE, PRIOR EXPERIENCE, AND CONFORMITY! 


LEON A. ROSENBERG? 
Purdue University 


‘HE present study was a re-examination of 

the relationship between group size and 
conformity. Asch (1952) had reported a curvi- 
linear relationship between group size and con- 
formity; a finding which neither Goldberg (1954) 
nor Kidd (1958) could replicate. These latter two 
studies, however, differed greatly from that of 
Asch. Kidd, unlike Asch, attempted to move his 
subjects’ judgments toward a more correct esti- 
mate. A similar problem existed in Goldberg’s re- 
search where the experimental stimuli were much 
too ambiguous for the study to be comparable to 
Asch’s. Since the subjects were not sure of objec- 
tive reality, how could they have perceived any 
discrepancy between their original estimates and 
the supposed partner estimates? The experimental 
procedure employed by Asch, on the other hand, 
presented the subject with a clearly perceived con- 
flict. The influence source’s estimates (the judg- 
ments made by the subject’s supposed partners) 
were clearly in error. The present study was there- 
fore planned again to investigate the effect of the 
group size variable, using procedures, however, 
that are more definitely analogous to Asch’s pro- 
cedure, 

A second purpose of the present study was to 
investigate the effects on conformity of combining 
increasing group size with two prior experience 
variables. Prior experience refers here to a combina- 
tion of high partner prestige and failure experience, 
These two variables have each been found to be 
related to conformity, and have also been found to 
be additive in their effects (Mausner, 1953, 1954a, 
1954b; Mausner & Bloch, 1957). 


METHOD 


The subjects were 227 males enrolled in an intro- 
ductory psychology course. The subjects were required 
to judge the lengths of a series of lines. The apparatus 
enabled the subject to choose one of 14 possible 
“lengths” and signal his choice by pressing a momen- 


1 Part of a dissertation submitted for the PhD degree. 
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tary contact switch. By the use of lights controlled by 
the experimenter’s master panel, each subject could be 
told that it was his turn to respond and, in addition, 
he could be told that his choice was "right" or “wrong,” 
The construction and operation of the apparatus was 
the same as that described by Crutchfield (1955). The 
essential point is that the experimenter was able to 
signal the same information to all subjects at the same 
time. 

The experimental procedure required subjects to 
judge the lengths of a series of lines twice. The first 
judgment (Ai) was made before the influence attempt 
was introduced. The second judgment (As) was a re- 
estimation after subjects had observed others making 
their choices. Receiving information as to how other | 
members of the group perceived the lengths of the 
lines constituted the influence attempt. The apparatus 
was constructed in such a way that the information that 
each subject received was determined by the experi- 
menter. Subjects were led to believe, however, that 
they were actually being informed of estimates made by. 
the other people in the room. The "false" information 
was designed so as to be easily perceived as being in- 
correct. The influence attempt involved a substantial 
overestimation of the lengths of the lines by the in- 
fluence source (i.e., the “partners’ ” estimates). 

One group served as a control; they made the A: and 
Az judgments without receiving any influence informa- 
tion (no prior experience-Control, or NPEC). Four 
other groups underwent the influence treatment. The 
group sizes were 2, 3, 4, and 5 (NPE-2, -3, -4, and -5). 
Hence, the influence source in these four groups in- 
volved either 1, 2, 3, or 4 partners. An additional five 
groups underwent the same procedures with one excep- 
tion. These groups received the prior experience treat- 
ment before making the A; judgments (PEC, PE-2, -3, 
-4, and -5). On a similar task, they were informed that 
they had done poorly and that the others in the group — 
were much more accurate in their estimates. 


CONFORMITY 


2 3 
GROUP SIZE $ 
Fic. 1. Mean conformity scores for the eight expert- 
mental groups. 
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TABLE 1 
SUMMARY OF THE ANALysIs OF VARIANCE 


Source of Variance df MS F 
PE-NPE 1 | 150.15 9.66** 
GS 3 62.12 3.99** 
linear 1 57.79 3.40* 
quadratic 1 17,31 d. oos. 
cubic 1 16.53 1.06 
(PE-NPE) X GS 3 14.43 .30 
Residual 152 15.54 
*.05 < p < 10. 
** p « .01, 
RrsuLIS 


The conformity score for each subject involved 
computing As-A; for each trial in which an influ- 
ence attempt had been made, The mean conform- 
ity scores for each experimental group are pre- 
sented in Figure 1. Conformity definitely occurred 
in all groups (NPEC and NPE-2 were significantly 
different from each other; £ = 2.04; p < .05). 
There were also definite differences between the 
experimental groups themselves. A summary of 
the analysis of variance is presented in Table 1. 
The analysis indicates that, for all levels of group 
size, the prior experience treatment significantly 
increased the degree of conformity obtained. 
Group size was also found to have a curvilinear 
telationship to conformity (the quadratic compo- 
nent was significant at p < .01). The change from 
a group size of 4 (GS-4) to GS-5 represented a 
decrease in conformity rather than a leveling-off 
effect (¢ = 2.02; df = 152; p < .025). 

For the formula, Conformity = AA, to be 
used, the A; estimates of all the groups had to be 
approximately equal. This prerequisite was met. 
The mean conformity scores of the two control 
groups were found to be equal, a finding that indi- 
cated that the prior experience treatments in them- 
selves had no definite effect on the re-estimations. 

Interview data indicated that 86% of the total 
sample were aware of the discrepancy between 
their perception of the lines and the influence 
Source’s estimates. The data also indicated that 
only 14% of the sample rejected the validity of the 
group situation. There was no relationship between 
this latter attitude and the drop in conformity 
from GS-4 to GS-5. 
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Discussion 


A major finding of this research was the curvi- 
linear relationship between group size and con- 
formity. This constituted a successful replication 
of Asch's (1952) finding. The difference between 
the present results and the data reported by Kidd 
(1958) and Goldberg (1954) is apparently due to a 
difference in method. The curvilinear relationship 
between group size and conformity appears when 
one is careful to duplicate an essential aspect of 
Asch's procedure, namely, the use of partner esti- 
mates that are clearly incorrect. 

The results also indicated that the prior experi- 
ence and group size variables were additive at all 
levels of group size. That is, their combination at 
any particular level produced more conformity 
than that elicited by that level of group size alone. 
The degree of additivity was found to be approxi- 
mately the same for all levels of group size. The 
data, therefore, indicates that a fairly uniform 
relationship exists between these two variables. 


REFERENCES 


Ascn, S. E. Some forms of interpersonal influence. In 
G. E. Swanson, T. M. Newcomb, & E. L. Hartley 
(Eds.), Readings im social psychology. New York: 
Holt, 1952. 

CnurcHrIELD, R. S. Conformity and character. Amer. 
Psychologist, 1955, 10, 191-198, 

GorpBERG, S. C. Three situational determinants of 
conformity to social norms. J. abnorm. soc. Psychol., 
1954, 49, 325-329. 

Kmp, J. S. Social influence phenomena in a task- 
oriented group situation. J. abnorm. soc. Psychol., 
1958, 56, 13-17. 

MAUSNER, B. Studies in social interaction: III. Effect 
of variation in one partner's prestige on the inter- 
action of observer pairs. J. appl. Psychol., 1953, 
37, 391-393. 

MAUSNER, B. The effect of one partner's success in a 
relevant task on the interaction of observer pairs. 
J. abnorm. soc. Psychol., 1954, 49, 557—560. (a) 

MAUSNER, B. The effect of prior reinforcement on the 
interaction of observer pairs. J. abnorm. soc. 
Psychol., 1954, 49, 65-68, (b) 

MAUSNER, B., & Brocn, B. L. A study of the additivity 
of variables affecting social interaction. J. abnorm, 
soc. Psychol., 1957, 64, 250-256. 


(Received August 22, 1960) 


Journal of Abnormal and Social Psychology 
1961, Vol. 63, No. 2, 438-439 


A NOTE ON THE SCORING OF EGO DISJUNCTION 


FRED HEILIZER! 
Veterans Administration Hospital, Northampton, Massachusetts 


recent paper by Trehub (1959) introduced 

the concept of ego disjunction and presented 
an investigation of the relationship between ego 
disjunction and severity of psychopathology. The 
concept of ego disjunction, derived from a double 
approach-avoidance conflict model (Dollard & 
Miller, 1950), assumed that parity or equivalent 
magnitudes of antagonistic need strengths are not 
necessary within a definition of ego disjunction. 
The investigation of the relationship between ego 
disjunction and severity of psychopathology in- 
volved the selection, on rational grounds, of four 
antagonistic pairs of needs (aggression-deference, 
succorance-nurturance, autonomy-abasement, or- 
der-change) from the Edwards (1954) Personal 
Preference Schedule for college students, adoles- 
cents, neurotics, character disorders, and schizo- 
phrenics. The ego disjunction score for each sub- 
ject was computed by summing the T scale scores 
of the two needs which comprise an incompatible 
pair of needs, subtracting 100 (i.e., the sum of the 
two T scale means) from the sum, and summing 
only the positive residuals to produce an overall 
ego disjunction score. 

Two points in this description deserve emphasis: 
The scoring is arbitrary (as indicated by Trehub, 
1959) and neither derives from nor reflects the 
double approach-avoidance model, and the scoring 
procedure produces a confounding of average differ- 
ences and variability differences among groups. 
For example, if the scores of two samples are 
exactly equal, each corresponding to a T scale dis- 
tribution for M = 20, the mean of each group 
equals 50.0, and the average positive residual of 
each group equals 4.0. If, however, we increase 
the variability of one group by multiplying the 
deviation scores by two, the average positive 
residual of this group now equals 8.0 although the 
Taw score mean of 50.0 is retained. 

The purpose of this note is to present a reanal- 
ysis of Trehub’s (1959) data to permit a clearer 
evaluation of his theoretical contribution. The data 
were constituted for each subject by adding the 
scores for the two needs of a need pair, thus provid- 
ing four scores, and summing to yield an overall 
ego disjunction score. An alternative procedure 
would have been to equate the variance of all 
groups on each need pair according to the average 


1I would like to thank Arnold Trehub for his close 
cooperation and considerable assistance in the prep- 
aration of this paper. 


variance for each need pair and then apply the 
method of positive residuals, Such a procedure 
would correspond more closely to Trehub’s original 
procedure, which, in turn, derives from the expecta- 
tion that only mutually incompatible needs with a 
relatively high level of joint strength indicate ego 
disjunction. The full scale T scores were used here, 
however, because the alternative procedure is 
extremely tedious, the dividing point for “rela- 
tively high level of joint strength” is arbitrary, and 
the issue is not at test. Full scale T' score data were 
not available for the adolescent subjects. Thus 
Trehub’s positive residual data and our full scale T 
score data were computed for college students, 
neurotics, character disorders, and schizophrenics. 
The mean ego disjunction score per need pair for 
each of the four groups is presented in Table 1. The 
rank order of the groups is consistent with predic- 
tion according to either method. 


TABLE 1 
Mean Eco Disyuncrion Scores (per Need Pair) FOR 
COLLEGE STUDENTS, NEUROTICS, CHARACTER 
DISORDERS, AND SCHIZOPHRENICS 


Positive Full scale T 

residuals scores 
College students 5.01 100.56 
Neurotics 7.37 103.80 
Character disorders 8.65 105.77 
Schizophrenics 11.09 107.27 


The analyses of variance are presented in Table 
2. A Type I analysis (Lindquist, 1953) was done 
for each set of data, utilizing the four groups as @ 
between-subjects variable and the four need pairs 
as within-subjects variable. The use of full scale T 
scores confirms the statistically significant main 
effect among groups which was reported by Trehub 
(1959), although at a higher probability level. The 
analyses of the need pairs main effect and the 
Groups X Need Pairs interaction do not yield 
Statistically significant differences by either 
method, Thus we can conclude that the prediction 
of-a positive relationship between general ego dis- 
junction and degree of psychopathology is CoD 
firmed when the confounding effect of variability 
differences is removed. There is, again, no evident 
that the groups could be characterized with respec 
to particular areas (need pairs) of ego disjunction: 

A final analysis utilized the Hartley sequentia 
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TABLE 2 


ANALYSES OF VARIANCE OF Eco DISJUNCTION 
Scores 


Full scale T scores | Positive 
Source 

a | us | F | MS | F 
Between subjects 19 
Groups 3 | 672.90 | 5.60°| 513.79 | 9.81*° 
Error 76 | 120.19 52.39 
Within subjects 240 
Need pairs 3 | 26.23) — | 143.39 | 1.46 
Groups" need pairs 9 | 312.71 | 1.48 | 102.87 | 1.05 
Error 208 | 211.35 98.22 


Significant at .01 level. 
** Significant at .001 level. 


range test (Hartley, 1955; Snedecor, 1956) to 
evaluate the differences among the four groups 
according to each scoring method. The analysis of 
the positive residual data confirmed Trehub's 
(1959) analysis (omitting the adolescent group) in 
that all comparisons were statistically significant 
(P = .05, two-tailed) except the comparison of the 
neurotics with the character disorders. The anal- 
ysis of the full scale T score data, however, ren- 
dered only a partial confirmation of the previous 
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analysis in that the college students were reliably 
discriminable from the schizophrenics and the 
Character disorders with no other statistically 
significant comparisons being obtained. Thus we 
can conclude that intergroup comparisons yield 
only a partial categorization of college students as 
distinguished from character disorders and schizo- 
phrenics when the confounding effect of variability 
differences is removed. 
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ARE DIFFERENCES IN SCHIZOPHRENIC SYMPTOMS RELATED 
TO THE MOTHER’S AVOWED ATTITUDES TOWARD CHILD 
REARING? 


WILSON H. 


GUERTIN 


University of Florida 


‘HE existence of a relationship between symp- 

toms and childhood experiences or back- 
ground is a fundamental assumption of most 
theories of psychopathology. Those working with 
schizophrenics look for and report unusual en- 
vironmental or parental features appearing in the 
backgrounds of their patients. This search for a 
relationship between background and schizo- 
phrenia presupposes a uniformity in schizophrenic 
symptomatology which clearly does not exist 
(Rabin & King, 1958, p. 227). Other investigators 
try to cross-validate relationships proposed earlier, 
but sample schizophrenics with different symptom 
pictures and conclude that no such relationships 
exist. 

Despite many negative studies, the large num- 
ber of reported differences between the mothers of 
Schizophrenics and those of normals (Rabin & 
King, 1958, pp. 260-261) must be explained. The 
differences are either evidence for the existence of 
Such a class of relationships or the result of serious 
methodological errors. Differences obtained be- 
tween groups of mothers in most studies cannot be 
attributed conclusively to the psychopathological 
differences between the schizophrenic and normal 
groups since other nonschizophrenic differences 
related to hospitalization and failure in life may be 
operating (Scott, 1958). This is especially true for 
obtained differences between two groups of 
mothers’ responses to retrospective question- 
naires, where the mother of a schizophrenic’s 
guilt, shame, or feeling of failure may lead to 
important response biases and even deliberate 
falsifications. 

Amidst this methodological confusion, a flourish 
of Occam’s razor reduces the issue to the crucial 
general question: Are differences in schizophrenic 
symploms related to the mother’s avowed attitudes 
toward child rearing? This study will proceed 
de novo, neither proposing nor testing specific sub- 
hypotheses in order to concentrate upon the funda- 
mental question. The weakness of comparing 
social failures (schizophrenics) with social successes 
(normals) will be avoided by relating symptom 
differences within a group of hospitalized schizo- 
phrenics to the avowed attitudes of their mothers. 


PROCEDURE 


The subjects were selected on the basis of the avail- 
ability of questionnaire information. They are part of a 
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group of 49 hospitalized male schizophrenics described 
elsewhere in terms of symptom pictures (Guertin, 
1961). In this earlier study, transposed factor analysis 
disclosed five type-factors, and the subject’s loading 
on each of the factors constitutes the independent 
variable in the present investigation. The type-factors | 
were labeled normal, catatonic withdrawal vs. over- 
expressive, paranoid, resistive isolation, and anxious- 
dysphoric. The sample is heavily loaded with chronic 
cases despite the imposed upper age limit of 40, (Further 
description of the sample will be found in the original 
article.) 

The mothers’ objectively scored responses to ques- 
tionnaire items provided the dependent variable. 
Mothers’ attitudes on family life and children were 
evaluated by the Parental Attitude Research Instru- 
ment (PARI) developed by Schaefer and Bell (1958) 
from an earlier scale (Shoben, 1949). Information about 
the subjects and their developmental history and en- 
vironment was provided by the M-B History Record 
(MBHR) (Briggs, 1959) filled out by the mother or, in 
seven cases, by the father or a sibling. Data were 
gathered entirely by mail, ostensibly to assist in the 
planning of activities for the son in terms of his past 
history. The return rate was 85%. 

Rank-difference correlations were calculated between 
each of the five type-factor loadings for subjects and 
the corresponding mothers’ scores on the 23 scales of 
the PARI and scores on the seven clusters of the 
MBHR (Briggs, 1959). The number of omitted answers 
on the MBHR was correlated with the type-factor 
loadings, increasing the total number of coefficients 
to 155. 


RESULTS AND DISCUSSION 


Only 10 of the 23 PARI scales and 4 of the 7 
MBHR clusters showed correlations with any of 
the type-factor loadings significantly different from 
zero at less than the 5% level. They are reported 
in Table 1 along with the correlations for number of 
omissions on the MBHR. 

Since the total number of significance tests made 
was 155, almost eight values could be expected to 
appear by chance alone to be significant. Sixteen, 
or twice as many as were attributable to chance, 
were actually obtained. 

The uneven distribution of significant ($ < 05) 
coefficients according to type-factor column in the 
table cannot be attributed to chance (chi square 
21.49, 4, p < .001). Thus, not only is the number 
of significant coefficients double chance expectancy, 
but they clearly are not distributed by chance. —— 
Some type-factors are more closely related to the - 
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TABLE 1 
CORRELATIONS OF SIGNIFICANT PARI AND 
MBHR SUBSCALES WITH SCHIZOPHRENIC 
TYPE-FACTOR LOADINGS 


Type-factor 
Scale and subscal * 
ie and subscale a ae E ?8 
3 18 8 |28 a 
E [22 8 |g] 3 
2 SF] & jet] 8 


PARI (N = 28) 
Encourage verbalization 42* 05 28 25 07 
Seclusion of mother 


Martyrdom -0 |-16|—20| 00 | —38* 

Marital conflict 39* | 21 | —30 | —o1 | —os 

Irritability —02 02 | —20 | —22 | —38* 

Suppression of aggres- | —25 | —12 | —25 | —40*| —23 
sion 

Equalitarianism 55**"| o0|—i5| 30| soe 

Inconsiderate husband 03 17 | —29 | —25 | —38¢ 


Comradeship & sharing 37* 02 00 35 30 


Dependency mother —10 14 | —19 | —26 | —41* 
MBER (N = 31) 
Psychopathic —36* | —02 | —06 | —19 | 00 
Achievement —28 |-s7* 22| —26 | —04 
Neurotic 12 |—28| 12| —35| 46%% 
Schizoid 39 | 13|—18| —M | 17 
Number of Omissions | —12 | —03 | —07 | —27 | —so*** 
*?«.0. 
"p < .02. 
ENS POL, 


questionnaire variables than are the others. That 
the mothers of some types of schizophrenics be- 
have differently on the questionnaires than do the 
mothers of others is therefore demonstrated. 

While it is established that mothers of different 
types of schizophrenics behave differently on the 
questionnaires at the present time, this is only an 
equivocal answer to the ultimate question of 
whether the patients’ family environments differed. 
Gordon (1957) has shown that questionnaire re- 
Sponses may be quite unreliable indicators of how 
a mother actually behaves with her child, and Bell 
(1958) discussed the many problems of evaluating 
child rearing behavior in the past from retro- 
Spective questionnaires in the present. Nor is the 
MBHR a direct indication of early environment; 
tetrospective distortion and deliberate falsification 
Probably occur. Also, very recent adjustment is 
Weighted rather heavily in the cluster scores of the 
MBHR. While the exact nature of the childhood 
home situation cannot be inferred easily from the 

ARI or MBHR responses, the differences in 
Mothers’ current behavior make it reasonable to 
expect that they also behaved differently toward 
their sons in the past. 

From the pattern of significant coefficients in 
Table 1, it is quite surprising to find that the more 
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florid symptom types, catatonic and paranoid, 
are not systematically related to the questionnaire 
scales. Goldstein and Carr (1956) were disap- 
pointed in seeking differences in mothers’ attitudes 
for these two groups. They did find significantly 
more inability to answer items among the mothers 
of the catatonic, While their findings are not con- 
firmed here by the correlations for omissions, the 
experimental procedures really were not compar- 
able. 

The two symptom types of Table 1 that are 
defined best in terms of questionnaire scales are 
the normal and the anxious-dysphoric. Comparison 
of significant correlations for each type-factor with 
the factors derived by Zuckerman, Ribback, 
Monashkin, and Norton (1958) from the PARI 
shows correspondence with two of their three ob- 
tained factors. The normal type-factor is high for 
Zuckerman's democratic attitudes factor, while the 
anxious-dysphoric type-factor is high negative for 
authoritarian-control. The democratic attitudes 
factor is composed of items added by Shaefer and 
Bell (1958) so that the average mother could agree 
with more of the items, and they were called “Tap- 
port” items. As Zuckerman et al. suggest, the fac- 
tor probably is related to an acquiescent response 
set (p. 170). The meaning behind strong professed 
disagreement with authoritarian-control items by 
the mothers of anxious-dysphoric patients is not 
apparent. Yet it is interesting that these least 
typical schizophrenics (pseudoneurotic) have 
mothers who denounce maternal domination, the 
feature which is most often pointed to as schizo- 
phrenogenic (Ekstein, Bryant, & Friedman, 1958, 
564ff.). 

The largest coefficient for the MBHR is for the 
neurotic cluster. Since current adjustment and 
behavior are heavily weighted for that cluster 
score, the high correlation merely reflects the 
neurotic symptoms of the anxious-dysphoric type- 
factor. The schizoid MBHR cluster is more 
heavily weighted for social isolation than for 
autism or thought disorder, so the normal type- 
factor may be reflected in a high score on the clus- 
ter but in few schizophrenic symptoms. However, 
there is an implication that those schizophrenics 
who make a good present adjustment are charac- 
terized by a history of social isolation. Quite prob- 
ably the chronic undifferentiated schizophrenic in 
good remission scored high on the normal type- 
factor and is responsible for the observed relation- 
ship. 

In conclusion, it appears that there is some 
relationship between schizophrenic symptom 
types and the responses of their mothers to ques- 
tionnaires. It is difficult, however, to conclude 
that their actual home situations differed but it 
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seems likely that important home differences 
existed. 
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STATUS RESTORATION AND THE REDUCTION OF HOSTILITY! 
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UMEROUS objections have been raised against 
the generality of the concept of frustration 
as used in the Yale theory of aggression (Dollard, 
Doob, Miller, Mowrer, & Sears, 1939). Maslow 
(1941) contends that not all deprivations lead to 
aggression but only those which are threatening to 
the security, integrity, or status of the organism. 
Rosenzweig (1944) distinguishes between need 
persitive and ego defensive reactions to frustration. 
Horwitz (1956) postulates that hostility is aroused 
as a consequence of the reduction of a person's 
power or adequacy. Worchel (1960) similarily con- 
ceives of hostility as arising as a result of “ego 
threat" following the failure or punishment of 
instrumental aggression. It is the purpose of the 
present study to test the implication of a “threat” 
theory of hostility that restoration of status or 
esteem reduces hostility. 


METHOD 


Berkowitz (1958) has suggested some important 
considerations in any adequate test of a catharsis 
hypothesis. Since the present study is concerned 
with the influence of various techniques (other 
than communication and catharsis) on the reduc- 
tion of hostility, the relevant precautions proposed 
by Berkowitz have been taken into account: (a) 
some measure of inhibited aggression, (b) careful 
control of the frustrating variable, and (c) separate 
analysis of the results for subjects who are charac- 
teristically hostile. The TAT is used to obtain 
measures of projective hostility and aggression- 
anxiety. The frustrating situation is identical for 
both the experimental and control subjects, and 
the control task is kept as neutral as possible. The 
results for boys and girls are treated separately 
and the self-ideal discrepancy, which has been 
shown to be related to the expression of hostility 
(Rothaus & Worchel, 1960), is included in the 
factorial design. 


Subjects 


From our introductory classes in psychology, 96 
freshman and sophomore subjects were drawn to 
complete a 2 X 2 X 4 factorial design (Sex X Self- 
Ideal Discrepancy X Treatment). The 48 males and 
females were classified as either high (above 49) or low 
(49 and below) in self-ideal discrepancy on the basis of 


.' This research was supported by the United States 
Air Force through the Air Force Office of Scientific 
Research of the Air Research and Development Com- 
mand, under Contract No. AF 49(638)-460. 


their responses to the Self-Activity Inventory (SAI) 
(Worchel, 1957) taken during the first meeting of the 
class. 


Procedure 


The experiment was conducted in small groups of 
16-20 subjects selected on the basis of sex and self- 
ideal discrepancy so that each group had approxi- 
mately the same number of males and females with 
high and low self-ideal discrepancy. To minimize social 
structuring (e.g., friendship and leadership patterns) 
and the effect of initial, differential attitudes towards 
the class instructors, subjects for any one experimental 
session were drawn from all four psychology classes 
with no more than five subjects from any one class. To 
reduce the probability of communication from one 
experimental group to another, sessions were conducted 
on successive hours on two successive days. The sub- 
jects were not informed as to the nature of the experi- 
ment until all groups had been tested. There were four 
different treatments for a complete experiment: success 
(E1), threat removal (Ex), threat removal plus success 
(E;), and control (C). Biases due to time of testing, 
communication effects, etc. were minimized by repli- 
cating the experiment three times with different orders 
of testing determined by an assistant. The experi- 
menter did not know the experimental treatment to be 
introduced until after the interpolation of the frustra- 
tion. 

At the first meeting of the classes, the SAI was 
administered with the comment that it has been the 
practice of the department of psychology to administer 
a battery of psychological tests for recommendations 
and guidance, and that since the class hour was not 
long enough to complete the entire battery, it would be 
necessary for the students to report during a free hour 
for further testing. Schedules of their free hours were 
filled out so that appointments could be arranged. 

When the small experimental groups met at the 
appointed hour, they were told that the examiner was 
a professor in the department of psychology, that the 
scores on their tests would be kept on file, and that 
they could arrange for appointments to review their 
scores. An "intelligence test" consisting of a number 
of subtests was then administered with the comment 
that each test was timed from 30-90 seconds and that 
most students would be able to finish each subtest in 
the prescribed time limits. The first subtest was quite 
easy and most subjects were allowed to finish it without 
any comment from the experimenter. The next four 
subtests were difficult, could not be finished within the 
prescribed time limits, and the experimenter kept 
interrupting with belittling remarks as: “You are too 
slow,” “Speed it up,” “You are taking too long on the 
items,” “Skip around if you can’t do one item,” etc. 
After the fifth subtest, the procedure varied, depending 
upon the nature of the experimental treatment. At this 
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TABLE 1 

Mean SCORES ror DIRECT AGGRESSION (DA), SELF-AGGRESSION (SA), PROJECTIVE-AGGRESSION (PA), 
AND AGGRESSION-ANXIETY (AA) on ALL Groups or SUBJECTS UNDER ALL CONDITIONS j 
— 

Threat removal Success Ga pieta Control 

Male Female Male Female Male Female Male Female 
LoSI | HiSI | LoSI | HiSI | LoSI | HiSI | LoSI | HiSI | LoSI | HiSI | LoSI | HiSI | LoSI | HiSI | LoSI | Hist 
DA 47.7 |39.5 |45.0 |47.0 |46.7 |45.0 |39.5 |40.8 |55.0 |47.2 |44.2 |42.8 |63.3 |57.2 |62.0 |67.7 
SA 45.5 41.2 |39.5 |45.1 |41.3 |33.3 |39.2 |50.8 |39.8 |47.2 48.7 |43.0 |39.2 |44.0 |41.0 49.3. 
PA 2.0 | 2.2 | 1.7 | 2.2 | 1.7 | 2.5 | 2.7 | 2.0 | 2.2 | 2.3 | 2.2 | 2.0 | 2.3 | 1.7 | 2.3 | 1.2: 
AA 2.7 | 2.5 | 2.2 | 3.0 | 3.0 | 2.9 | 2.2 | 2.5 | 2.3 | 3.2 | 2.7 | 2.0 | 2.5 | 2.7 | 2.5 21 

TABLE 2 


point, the experimenter would open an envelope to 
indicate the treatment to be employed. 

One experimental group (success) continued the 
intelligence test which had five more subtests attached 
to the booklet. For these subtests, the experimenter 
made no further comment during the testing, allowed 
most of them to finish, and praised their work at the 
end of each subtest with such remarks as “You are 
doing much better,” “That’s good, most of you have 
finished the test." The second experimental group 
(threat removal) had a neutral questionnaire request- 
ing some biographical data following the fifth and last 
subtest of the booklet. This questionnaire was arranged 
so that it would take approximately the same amount 
of time as the last five subtests, At the end of the ques- 
tionnaire, the experimenter apologized for mistakenly 
giving the wrong time limits on the intelligence test. 
He remarked that he had taken a stop-watch with a 
30 second face instead of a 60 second face, and therefore, 
they had received only one-half of the prescribed time 
for each subtest. The test, therefore, was not valid and 
it would be best to destroy the papers. He asked the 
class to cross out the first five tests, The third experi- 
mental group was given both threat removal and 
success. At the completion of the first five subtests 
(hostility arousal), the experimenter remarked that he 
had given the wrong time limits, etc. These tests would 
not count and, therefore, they could destroy them by 
marking through the answers, but for the next five 
subtests the time limits would be correct. As previously, 
praise was given at the end of each subtest and most 
subjects were allowed to complete the tests. The fourth 
group, the control session, was given the neutral ques- 
tionnaire to complete at the end of the fifth subtest of 
the intelligence test with no further remarks. 

Of course, the procedure does not eliminate entirely 
the possibility of fantasy aggression during the experi- 
mental treatment even though all the subjects were kept 
occupied. The same opportunity for such fantasy, 
however, is also possible during the control condition. 
Therefore, differences, if they occurred, between experi- 
mental and control subjects should not be due to 
differences in fantasy aggression. 


Dependent Variables 


To get a measure of their hostility following the 
experimental treatment, all subjects were adminis- 


ANALYSIS OF VARIANCE OF DIRECT AGGRESSION 
Scores FOR ALL GROUPS 


Source df | SS | MS| F 
Treatment 3 | 5744 | 1915 | 9.82* 
Self-ideal 1 98) 98) = 
Sex 1 s| 58) — 
Treatment X Self-ideal 3 86} 29) — 
‘Treatment X Sex 3 641 | 214 | 1.10 
Sex X Self-ideal 1 372 | 372 | 1.91 
Sex X Self-ideal X Treatment 3 68| 23| — 
Within 80 |15632]| 195 


* Significant beyond the .01 level. 


tered a Test of Social Sensitivity and Insight. Instruc- 
tions emphasized the importance of accuracy in judg- 
ing others even under conditions of brief contact, and - 
in honestly assessing one's own feelings and attitudes 
towards oneself. The subjects were requested to pan 
a check mark in one of the five blanks at the right 2 

each statement which best described their opinion from 
"strongly agree" to "strongly disagree." Two Scores 
were obtained from this rating scale: hostility toward 
the experimenter (DA) and self-aggression (SA). Hoe 
ity toward the experimenter included agreement T 

those items which devaluated and criticized the experi- 
menter or disagreeing with items that described the ex- 
perimenter in favorable terms. Self-aggression AA 
scored on items where the subject blamed himself for 
poor performance. i ; 

In order to obtain a measure of aggression id 
six slides of the TAT were used. The subjects were to i 
that the test measures imagination and the usual 
tions for the TAT were read. Then each slide was i 
jected for 30 seconds following which the subjects hos 
allowed 3-4 minutes to write a story. Two scores m 
obtained from the protocols: personal or project 
aggression (PA) and impersonal aggression or aggresio 
anxiety (AA). Personal or projective aggression 1S E 
act of aggression from one person to another person 
object. The criteria for an act of aggression were 
used by Clark (1955). An impersonal response (ene 
sion anxiety) includes any act of aggression towar! a 
person from a source which is vague and Cleary q 
from another person—in a sense, an act of God or 
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For example, a person is injured by an animal, or 
through an accident or illness. Two judges scored each 
protocol for the presence of a personal and an im- 
personal aggression. The interjudge correlation for 
PA was .86 and for AA .73. 


RESULTS AND DISCUSSION 


The means for each group across the three 
replications did not differ significantly and there- 
fore, all subjects under each of the four experi- 
mental conditions were combined. The mean 
aggression scores are shown in Table 1. Only the F 
ratio for the treatment variable on DA was signifi- 
cant beyond the .01 level (Table 2). None of the F 
ratios on any of the other dependent variables 
attained significance. 

On DA, the means for threat removal, success, 
and threat removal plus success were 44.8, 43.0, 
and 47.3, respectively. They did not differ from 
each other significantly but they were all signifi- 
cantly lower than the control group (62.5). The 
fact that there were no differences on SA, PA, and 
AA suggests that the results on direct aggression 
do not appear to be affected by inhibitory mecha- 
nisms. Therefore, the three techniques for remov- 
ing threat and restoring status seem to be quite 
effective in reducing hostility. 

The results, in general, confirm the importance 
of power restoration although the subjects in the 
present experiment did not take any action to 
change the experimenter as was true in the experi- 
ment by Horwitz, Goldman, and Lee (1954). The 
design does not permit any direct comparison 
between the techniques of power restoration 
utilized by Horwitz et al. and those in the present 
investigation but our data suggest that reduction 
of hostility by communication which alters the 
frustrating agent or by communication of hostility 
alone to the frustrating agent (Thibaut & Coules, 
1952) may be due to the restoration of esteem in 
these situations. 

It is possible, also, that the effective factor in 
reducing hostility by catharsis may be the experi- 
ence of power or status that follows when one 
“lets off steam” (McClelland & Apicella, 1945; 
Sears, Maccoby, & Levin, 1957). In a study of 
anxiety reduction, McKeachie, Pollie, and Speis- 
man (1955) found that subjects who were encour- 
aged to write comments about their answers to an 
examination made higher scores than students who 
had conventional answer sheets. Subjects, how- 
ever, who were instructed to write explanations 
made slightly higher scores than those who were 
encouraged to write feelings. The author suggested 
among other things that allowing students to com- 
ment may change their perception of the test from 
pa of punishment to one of facilitating their suc- 

ess, 


SUMMARY 


The present study reports positive results on a 
test of the implications of a “threat” theory of 
hostility, namely, that hostility is reduced by 
status restoration, Essentially, the experiment 
deals with techniques designed to restore the 
status or the integrity of the subject, who has been 
subjected to the hostile arousing conditions, with- 
out permitting expression of aggression (catharsis 
or communication). 
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THE EFFECT OF OVERT AGGRESSION ON PHYSIOLOGICAL 
AROUSAL LEVEL! 


JACK E. HOKANSON anp SANFORD SHETLER 
Florida State University 


question of whether the expression of 
aggression leads to a reduction in aggressive 
drive has been a point of conflict in the literature 
ever since first proposed by Dollard, Doob, Miller, 
Mowrer, and Sears (1939). The central problem in 
the study of this “catharsis hypothesis” has been 
the inability of investigators (Berkowitz, 1958) to 
“demonstrate clearly that the decrease in (post- 
aggression) hostile behavior is due to drive reduc- 
tion and not to response inhibition” (p. 274). 

A more general (and presumably more testable) 
question that has recently come under investiga- 
tion concerns the reduction of general arousal level 
after the expression of hostility. Worchel (1957) 
offers indirect evidence that the expression of ag- 
gression may reduce performance-interfering emo- 
tions (anger and anxiety). Hokanson (1961) found 
a suggestive negative correlation between vigor of 
aggressive response following frustration and post- 
aggression elevation in systolic blood pressure. 
The present experiment attempts to test directly 
the hypothesis that the expression of aggression, 
after a frustration, produces a greater reduction in 
physiological arousal (systolic blood pressure) than 
having no opportunity to aggress. 

In addition, the present research uses both a 
high and a low status frustrator in an attempt to 
assess the effects of this variable on “tension” 
reduction. The hypothesis is advanced that post- 
aggression systolic level should be more reduced 
with a low status frustrator than with one of high 
status. 


METHOD 


Fifty-six undergraduate students? taking an intro- 
ductory course in psychology were arranged in a 2 X 
2:X 2 factorial design. These three variables were: 
high and low frustration; high and low status of the 
frustrator (experimenter); opportunity vs. no oppor- 
tunity to aggress against the frustrator following the 
frustration manipulation. 

‘The experiment was introduced to the subject as one 
involving blood pressure response to working on routine 
intellectual tasks. To aid in establishing this deception, 


l'This project was supported by the Research 
Council of Florida State University. 

The authors are also indebted to Michael Burgess 
for his assistance in carrying out this study. 

2 The majority of subjects were female; however, 
owing to limited availability of subjects some males 
were also used. These latter subjects were evenly dis- 
tributed throughout the eight cells in the experiment. 


after an initial 8-minute adaptation period, the subject 
was administered the Picture Completion subtest of 
the WAIS, following which blood pressure was meas- 
ured. After another 2-minute rest period the frustration 
manipulation took place. The subject was asked to 
count backwards from 99 to 1 by two’s as quickly as 
possible. Subjects in the high frustration condition 
were then repeatedly interrupted and harassed con- 
cerning their slow performance; asked to begin again 
four times; and finally stopped with the statement that 
their data could not be used. Earlier use of this tech- 
nique (Hokanson, 1959) indicates that subjective 
feelings of anger are markedly increased. Subjects in 
the low frustration condition counted backwards from 
99 to 1 by two’s as quickly as they could ending with 
the experimenter remarking “Good.” 

Immediately following the frustration manipula- 
tions half of the subjects were given an opportunity 
to aggress physically against the frustrator by ad- 
ministering electric shocks? to him. This situation was 
structured as follows: Subject was told that the next 
task involved an interpersonal guessing situation in 
which the subject was to think of a number between 1 
and 10, following which the experimenter was to guess 
the number. If the experimenter’s guess was wrong, 
the subject was to signal this error by administering 
the shock. In this way the experimenter was presum- 
ably studying the effect of pain on his subsequent 
guesses. If the experimenter was correct, the subject 
would indicate this verbally. Among all opportunity 
subjects, the experimenter was thus "shocked" at 
least 7 times out of a total of 10 trials. Subjects in the 
no opportunity condition went through the same 
procedure except that the signaling of the experi- 
menter's errors was done by simply flashing a light 
instead of shocking the experimenter. 

The status manipulation was introduced at the 
beginning of the experimental session. The "high 
status" experimenter, a distinguished appearing male 
of 48 years, introduced himself as a visiting professor 
from a large eastern university who was continuing. 
research on blood pressure at Florida State University. 
The low status experimenter, an average-appearing 
male student assistant of 20 years, introduced himself 
as an undergraduate psychology major who was carry- 
ing out an experiment for one of his professors. Exten- 
sive rehearsal prior to the project insured that each 
experimenter used comparable procedures throughout 
the experimental session. 

Systolic blood pressure was recorded before and 
after each phase of the experiment, thereby affording 
an index of change as a result of each manipulation. 
"miner v6 N E M GRUPO US 


? Although the experimenter was wired to an elab- 
orate “shock” apparatus, he did not actually receive 
a shock, but merely behaved as if he had. 
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RESULTS 


Table 1 presents the analysis of variance of 
systolic changes resulting from the frustration 
manipulation. The significant main effect for 
frustration indicates that subjects in the high 
frustration condition manifested considerably 
greater systolic increases (11.4 mm. of Hg) than 
subjects in the low frustration treatment (3.9 mm. 
of Hg). 

An analysis of the changes in systolic blood pres- 
sure after the expression of aggression, with respect 
to prefrustration systolic level, is presented in 

- Table 2 along with the results of a Duncan multiple 
range test among the eight treatments. This 
analysis is aimed at pointing up any differences 
among conditions in the extent to which systolic 
pressure returned to its pre-arousal level. The sig- 
nificant Status X Opportunity interaction suggests 
that regardless of frustration treatment, subjects 
with the low status experimenter have a trend 
toward higher elevation in blood pressure at the 
end of the experiment under no opportunity to 
aggress condition (6.0 mm. of Hg) than with oppor- 
tunity (1.6 mm. of Hg); whereas under high status 
the trend is reversed, with the "opportunity" 
subjects having a somewhat greater elevation (4.1 
vs. 2.7 mm. of Hg). Comparison of the four cell 
means by Duncan multiple range test reveals no 
significant differences among treatments. These 
trends however, are elaborated further in the 
second order interaction. 

The Status X Frustration X Opportunity 
interaction is best seen by reference to the cell 
means and the results of the multiple range test 
presented at the bottom of the table. The findings 
Suggest that a major part of the variance in this 
interaction can be accounted for by the marked 


TABLE 1 
CHANGE IN SvsrOLIC BLOOD PRESSURE AS A 
RESULT OF FRUSTRATION 


df SS MS F 
Status 1 5.8 5.8 
Frustration 1 | 787.5 | 787.5 | 33.2** 
tatus X Frustration 1| 37.8| 37.8 
Within 52 1231.1]| 23.7 
Total 55 |2062.2 


Cell means of systolic changes during frustration 


Low High 
frustration frustration 
Low status 3.43 12.57 
High status 4.43 10.29 
"p <o. 
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T TABLE 2 


ELEVATION oF SYSTOLIC BLOOD PRESSURE 
AFTER THE EXPRESSION OF AGGRESSION 


(re prefrustration level) 
df SS MS F 

Status 1 1.8 1.8 
Frustration 1| 44.6] 44.6 
Opportunity 1} -31.5)| 31.5 
Status X Frustration 1| 77.8) 77.8 
Status X Opportunity 1| 120.1 | 120.1 | 5.18* 
Frustration X Oppor- 1| 20.7| 20.7 

tunity 
Status X Frustration X | 1| 97.7 | 97.7 | 4.21* 

Opportunity 
Within 48 1113.1 | 23.2 
Total 55 [1507.3 

Cell means of systolic changes during the 
expression of aggression 
Low status High status 
Low High Low High 
frustra- | frustra- | frustra- | frustra- 
tion tion tion tion 
portunity 1.43, | 1.71. | 3.71. | 4.57. 
o opportunity | 2.00, | 10.00, | 3.71, | 1.71, 


® Cell means with different subscripts are significantly different 
from one another at the .05 level by Duncan multiple range test. 
*p< 05. 


elevation in the low status-high frustration-no 
opportunity condition. The systolic elevation (re 
prefrustration level) of 10.0 mm. of Hg in this cell 
is reliably greater (p = .05) than the elevations in 
the remaining seven cells—with no other inter- 
condition differences being significant. 


Discussion 


We have here evidence in support of the view 
that the blocking of aggressive behavior towards a 
frustrator following an anger provocation results in 
an elevated physiological arousal level—but only 
with a frustrator who is perceived as being of ap- 
proximately equal or lower status. Following the 
course of systolic pressure during the experiment 
for subjects with the low status frustrator: there is 
a mean increase of 12.6 mm. of Hg as a result of the 
frustration manipulation; a return of blood pres- 
sure to within 1.7 mm. of Hg (on the average) of 
prefrustration level for subjects given an oppor- 
tunity to aggress against the experimenter; but a 
negligible drop (2.6 mm.) for subjects given no 
opportunity to express aggression. Both the oppor- 
tunity and no opportunity subjects were faced with 
essentially the same intellectual and physical 
problem during the shock the experimenter portion 
of the procedure (guessing numbers technique) 
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with the exception that the opportunity subjects 
signaled an error by physically hurting the experi- 
menter. Apparently then, the knowledge that the 
experimenter was undergoing discomfort at the 
hands of the subject was the critical factor in the 
reduction of blood pressure. With respect to the 
no opportunity subjects arousal level was increased 
during frustration, and remained high at the con- 
clusion of the experiment in the absence of any 
“relief” type of interaction with the frustrator. 

The question arises then as to why there was 
not a similar persistence of elevated arousal in 
the high status-high frustration-no opportunity 
condition. The most parsimonious explanation 
seems to be one involving expectancy constructs. 
Quite possibly, subjects have learned that there is 
very little likelihood of their being able to aggress 
directly against a person of relatively high status; 
and therefore, even though frustrated, response 
tendencies toward overt aggression would probably 
be only minimally aroused. More likely would be 
the evocation of withdrawal responses. Towards 
the close of the experimental session then, tension 
reduction may have taken place at the prospect of 
getting away from the frustrating, high status 
experimenter. 

Closer inspection of the cell means at the bottom 
of Table 2 suggests further that subjects frus- 
trated by the high status experimenter and given 
an opportunity to aggress towards him had a trend 
toward greater post-aggression systolic elevation 
than subjects in the no opportunity condition, 
Quite possibly, being placed in a situation that 
calls for aggression towards a person of relatively 
high status may be associated with a certain 
amount of arousal. 

From the above discussion a tentative generali- 
zation may be drawn: following a frustration (and 
its concomitant elevation of physiological proc- 
esses), “tension” reduction may take place when 
the subject makes a response which he perceives to 
be appropriate to the situation—ie., overt aggres- 
sion towards a frustrator of equal or lower status 
or withdrawal (or some other nonaggressive be- 
havior) with a high status frustrator. Should these 
“appropriate” responses be blocked, we have in 
the present experiment partial evidence at least 
that physiological arousal will remain high. 


CRITIQUE AND NOTES 


SuMMARY 


Fifty-six undergraduate male and female sub- 
jects were exposed to the following conditions ina 
three-way factorial arrangement: high or low 
frustration by a high or low status experimenter 
with a subsequent opportunity or no opportunity 
to aggress physically (via electric shocks) towards 
the frustrator. Systolic blood pressure was meas: 
ured before and after the frustration manipulation 
and the expression of aggression. 

The main findings in this experiment were: (a) 
Írustration led to significantly greater systolic in- 
creases than the no frustration control condition: 
with both the high and the low status experi- 
menter; (b) subjects frustrated by the low status 
experimenter and given an opportunity to aggress 
against him manifested a return of blood pressure d 
to prefrustration levels which was not significantly - 
different from that of nonfrustrated subjects; 
whereas, (c) subjects frustrated by the low status 
experimenter and given no opportunity to aggress 
against him manifested significantly greater 
systolic elevations at the conclusion of the experi- 
ment than either frustraled-opportunity to aggress 
subjects or nonfrustrated subjects; (d) subjects 
frustrated by the high status experimenter mani- 
fested a return of blood pressure to prefrustration 
levels which was not significantly different from 
that of nonfrustrated subjects in both the oppor- 
tunity and no opportunity to aggress conditions. 

The results were discussed as offering support 
Íor the hypothesis that under certain conditions 
overt aggression has “tension” reducing qualities; 
but also, that other types of behavior (i.e., with- 
drawal) may reduce arousal if they are appro- 
priate to the situation. 
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TRENDS IN COGNITIVE ABILITY IN THE OLDER AGE RANGES 


A. E. MAXWELL 
Institute of Psychiatry, Maudsley Hospital, and University of London 


UE to different notions about how best to 
rotate psychological factors into meaningful 
positions, factor analysts often come to somewhat 
different conclusions even when they analyse the 
same data. For this reason, it is not to be expected 
that there will be universal agreement about the 
factors of Doppelt and Wallace's (1955) data for 
the Wechsler Adult Intelligence Scale for older 
people presented in this paper. However, this need 
not be thought of as wholly unsatisfactory if it is 
made clear at the outset what the aim of the 
present analysis is. 

Clinically, it is often customary to use only 10 of 
the subtests of the WAIS battery, and in the 
interpretation of factors derived from the inter- 
correlations of these tests, it often is the clinical 
psychologist's desire that the first factor be put 
through Vocabulary. The cognitive content of this 
test, namely, verbal comprehension, is readily ap- 
praised, and this helps in interpreting the factor. 
But since the Performance tests have considerable 
loadings on a factor anchored in this way, it is well 
to think of it not simply as a verbal but rather as a 
verbal-intellectual factor. When an analysis of the 
battery yields only two significant factors (Thomp- 
son, 1951), it now turns out that Block Design and 
Object Assembly have appreciable positive load- 
ings and Vocabulary an appreciative negative 
loading, on the second factor when orthogonality 
is retained. The latter can then be thought of as a 
Space performance or visualization factor. These 
are the two factors which we had primarily in mind 
when we came to analyse the intercorrelations of 
the WAIS tests for Doppelt and Wallace’s (1955) 
normative data for older people, and the purpose 
of the analysis was to see if there were any clear-cut 
trends in the contributions of the several tests to 
these factors after the age of 60. 

Notions of rotating the factors to “simple 
Structure” were not entertained. It is the writer’s 
(Maxwell, 1959) contention that, in our present 
State of ignorance about the standard errors of 
factor loadings, the simple structure concept is too 
easily abused, This then, is a comparative study in 
factor analysis. Strictly speaking, such a study 
Should begin with a test of whether the variance- 


‘ ! Thanks are due to Don Kendrick for valuable 
discussion during the preparation of this paper; to 
W. L. B. Nixon, who did the factor analyses and in- 
Verted the correlation matrices on the London Uni- 
versity Mercury Computer, and to N. Hemsley who 
assisted with the other calculations. 


covariance matrices for the four age groups (60-64, 
65-69, 70-74, and over 75 years) themselves differ, 
for only in so far as they do would one expect to 
find differences in factor patterns. The test would 
be required to be done on the data for raw scores 
as the transformation to scaled scores would tend 
to obliterate differences in variance between the 
groups. However, as all the data required to re- 
construct the variance-covariance matrices are not 
available in Doppelt and Wallace’s (1955) paper, 
we must pass straight to factor analyses of the 
standardized covariance or correlation matrices. 


Factor Analyses 


The original intention was to use Lawley’s 
maximum likelihood method of analysis (Thomp- 
son, 1951) for all four correlation matrices, so that 
a statistically valid test of significance of residuals 
would be available. However, difficulties were 
encountered with this method where the data for 
the 60-64 and the 65-69 age groups were con- 
cerned; the communalities and the factor loadings 
for Block Design approached unity, and the 
method naturally broke down. For these matrices 
the first four principal components (Table 1) are 
given instead. For the 70-74 group, four significant 
maximum likelihood factors were found, and for 
the Over 75 group three significant factors, They 
too appear in Table 1. 

The factors were now rotated so as to maximize 
the loading for Vocabulary on the first, and the 
loadings for Block Design and Object Assembly 
on the second factor. The remaining significant 
factor for the 70-74 age group, which had now been 
anchored as a result of these rotations, was found 
to have high loadings for Picture Completion, 
Picture Arrangement, and Similarities. As this is 
possibly the “second reasoning factor... appar- 
ently distinct from that measured by Arithmetic,” 
noted by Davis (1956), it was decided to rotate the 
third and fourth factors for the other three age 
groups so as to bring it into prominence for them 
too. The results are given in Table 2. The third 
factor may be labeled education of conceptual 
relations in the light of Davis’s description of it, 
but it will be discussed further below. 

Loadings for the fourth factor are also given, but 
apart from those for the 70-74 age group, where 
this factor is statistically significant, the factor 
need not be taken too seriously. Though the maxi- 
mum likelihood method of analysis broke down in 
the case of the two younger age groups, the re- 
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450 CRITIQUE AND NOTES 


TABLE 1 
UNROTATED Factor Loapincs 
Factors Factors Factors Factors 

ae (Age 60-64) (Age 65-69) (Age 70-75) (Age over 75) 

I H | II IV H b HI | IV I I HI | Iv I II | m 

Comprehension 76| 39| 02| 16} 71 |—43 |} 03|—16| 52|—18 |-57 | 11| 77 |-16 | 18 
Arithmetic 79| 15} 12| 36] 77 |—09 |—30 |-19 | 62 |—40 |—20 |—08 | 78 |—14 |-28 
Similarities 76| 35 |—21 |—30 | 73 |-04 | 29 |—41 | 67 |-24 |-25 | 39 | 72 | 17] 40 
Digit Span 63 |—31 | 67 |—11 | 61 |—48 |-35 | 19 | 50 |—47 |—24 |-22 | 65 |-21 |-35 
Vocal 82| 34| 10| 05|86|—28]| 01|—05| 72|—52 |-19 | 00| 85 |—39 | 10 
Digit Symbol 81 |—21 |-05 | 26|80| 12 |—23| 32| 71| 01|—34]|—24| 71 | 15 |-23 
Picture Completion 75 | 08| 08|—33]| 74} 10| 41] 44/90] 05] 12 |-01 | 57| 30| Ot 
Block Design 83 |—26 |-10 | 06} 76| 45 |—22] 07|46| 19|—82| 03] 66| 43 |—34 
Picture Arrangement | 75 |—22 |-14 |—33 | 80 | 06| 44| 00| 68] 11|—21| 23] 47] 53| 41 
Object Assembly 72|—36 |-39 | 11 |64| 60 |—18 |—23 | 37 |-54 |-06 |-37 | 58 | 56 |—11 


Note—Decimal points omitted. 


TABLE 2 
ROTATED FACTOR Loapincs FOR THE Four AGE GROUPS 
Factor I Factor II Factor III Factor IV 
Test 

60+ | 65+ | 70+ | 754+ | 604+ | 65+ | 70-- | 754+ | 604- | 65+ | 704- | 75+ | 60+ | 65+ | 70+ 
Comprehension 83 | 79 | 77 |—04 |-10 | 01 |-02 |-14 | 01| 07| 24| 09 |—16 |-08 
Arithmetic 78 | 74 | 14 | 76 | 11| 32|14| 34|—42 |-22 | 06|—04| 01|—19 | 16 
Similarities 83 | 68 | 66 | 58 | 11| 18|27| 00| 35| 36| 43| 61| Ot |—41 |-15 
Digit Span 44 | 76 | 72| 68 | 13|—07 | 10 | 31 |—33 |-39 |-14 |-17 |-79 | 19 | 17 
Vocabulary 89 | 90 | 89 | 93 |—02 | 10|10|—08|—10 | 04| 12| 07|—07 |-05 | 10 
Digit Symbol — 65 | 67 | 53 | 59| 48] 49|48| 46|—33 |-10| 09| 17|—06| 32] 40 
Picture Completion 72 | 63 | 58 |40| 16| 27/19] 33] 19| 51| 60| 39|—33| 44] St 
Block Design 65 | 50 | 34 | 42 | 56| 76|90| 69|—15| 00| 02| 27|—15| 07 |-03 
Picture Arrangement | 59 | 70 |41 | 21 | 52| 25| 42] 14| 22| 53| 47| 78|—28| 00] 10 
Object Assembly 50 | 33 | 40 | 29 | 74| 83 |52| 58|—08| 06 |—27| 48| 08|—23| 25 


Note:—Decimal points omitted. 


TABLE 3 
WEIGHTS FOR FINDING Factor Scores 
Factor I Factor II Factor III 
Test 

Go+ | 6+ | Ter | 5+ | oot | es+ | zov | z+ | cot | ost | ror | 135+ 
Me” 
Comprehension +32) .28| .24| .14 |—.34 |—.30 |—.11 |-.13 |—.20 |-.04 |—.04 | 07 
\rithmetic 20| .16| .16| .16|—.18| .09|—:05 | .19 |-.59 |-.36 |—.13 |—.20 
Similarities -30| .13| .15| .07 |—.15 |-.08 |-.01 |-.20 | .68| .36| .31| -3 
Digit Span —.06 | .29| .17| .12|—.13 —.21|—.06 | .15 |—.31 |-.52 |—.23 |—-20 
Vocabulary :30 1.25 | .46 | 160 |—:35 |-.17 |— "19 |— 145 |—:05 |-- 04 |-:22 |--13 
Digit Symbol -01| .08| .03| .06| .29| .24| ‘og | ‘19 |—51 |—:23 |—.23 |--03 
Picture Completion | .16| <08 |-.08| ‘01 |—‘09 |-02 |-.03 | ‘07 | 43 | .54| .88| -0 
Block Design —.01|—.05|-.11 |-.01 | .38| .49| :93| .50|—.27 —.14 —.01 | -90 
Picture Arrangement | .00| .11 |—.02 |—.05 | 36 —.06| :06 |—.07| .48| .56| .16| -48 
Object Assembly —.08|—.13| .05|—.05| .68| .39| .08| .29| .29|-.06 —.28 | -18 
SD of factor scores — | 1.00 | 1.00 | .94| .96| 1:00 | 1:00 | <95 | .87| 1.04 | 1.00 | .88| -88 


siduals after one factor had been extracted were be taken to be “specific” in nature and result from 
only just significant for these groups, and though the use of unities in the diagonal cells of the / 
the principal component factors, which were used matrix. 

instead, show a few loadings of considerable To facilitate a comparison of the four sets of 
magnitude on the fourth factor, these may safely results, it was deemed essential to obtain regression. 


CRITIQUE AND NOTES 


TABLE 4 
VARIANCES AND COVARIANCES OF FACTOR 
Scores 
(60-64) (65-69) 
I Ir ml I IL m 
I .9989 | .0003 |—.0126 | 1.0002 |—.0003 |—.0007 
II .0002 | .9987 |—.1144 |—.0003 | 1.0076 |—.0005 
m —.0126 |—.1144 | 1.0412 |—.0008 |—.0005 | 1.0013 
(70-74) (Over 75) 
x IL IH I bu IH 
I .8876 | .0519 | .0554 | .9233 | .0351 | .0391 
I .0520 | .8948 |—.0092 | .0351 | .7613 | .0626 
m .0554 |—.0092 | .7801 | .0390 | .0625 | .7751 


weights by means of which factor scores are ob- 
tained. These show the relative importance of the 
tests for predicting the several factors. They 
appear in Table 3. In Table 4, the variances and 
covariances of the factor scores are given. Examina- 
tion of the latter table shows that the weights yield 
factors which were virtually orthogonal since the 
off-diagonal entries in the matrices are almost 
zero, The standard deviations of the factor scores 
when standardized test scores are used are given 
by the square roots of the entries in the main 
diagonals, and to facilitate comparison, these are 
entered below the weights in Table 3. Where these 
standard deviations fall below unity, the corre- 
sponding weights should be scaled up appropri- 
ately, but this can be done mentally when com- 
paring the results for one age group with those for 
another. 


Discussion AND CONCLUSIONS 


When looking for trends with age in performance 
on the WAIS battery, it should be remembered 
that the basic data are the means and standard 
deviations (preferably for raw scores, which are 
uncontaminated by scaling) given by Doppelt and 
Wallace (1955). The information supplied by the 
factor loadings, and more especially by the factor 
weights (Table 3), is complementary to the basic 
data and is important primarily for showing the 
toles which the tests play relative to each other— 
that is, how they compete with each other in the 
composition of the factors as people advance in age. 

The loadings for all groups on the first factor are 
relatively high and positive, the highest being for 
Vocabulary, Comprehension, and Similarities. But 
when we come to examine the weights, interesting 
trends become obvious. Neglecting Digit Span and 
the five Performance tests, for which the weights 
are negligible, it is seen that those for Vocabulary 
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steadily increase with age, whereas those for 
Similarities, Comprehension, and (to a lesser 
extent) Arithmetic steadily decrease, Vocabulary 
is a relatively pure measure of verbal comprehen- 
sion, but Comprehension also measures (Davis, 
1956) general reasoning and fluency (that is, 
responsiveness and uninhibitedness). Similarities 
measures, in addition, visualization (the ability to 
manipulate actual and symbolic objects and to see 
relationships), and Arithmetic measures reasoning 
and numerical knowledge. It is logical then to 
conclude that with the onset of old age, verbal 
comprehension plays an ever increasing role in any 
determination of general intelligence, while general 
reasoning, visualization, and fluency play a 
gradually decreasing role. 

The second factor, to which we have attached 
the broad general title space performance, has its 
highest loadings on Block Design, Object As- 
sembly, and Digit Symbol. When we examine the 
factor weights, it is seen that those for Object 
Assembly and, to a lesser degree, Digit Symbol and 
Picture Arrangement, all of which are positive, 
tend to decrease with age, just as the weights for 
Comprehension, which are negative and anti- 
thetical to the former, decrease correspondingly. 
Since the main contrast here is between reasoning, 
in the case of Comprehension, and perceptual 
speed, in the case of the Performance tests, the 
decay of these two types of ability presumably 
explains why the weights for the tests tend to 
decrease and converge to zero as old age sets in. 
Block Design, on the other hand, which contrasts 
with Object Assembly in as far as it is a measure of 
mechanical knowledge, weathers well with age. 
The weights for Arithmetic for the second factor, 
which tend to increase, support this interpretation. 
This suggests that the retention of certain in- 
formation—such as, for example, the multiplica- 
tion table—is little impaired in old age. 

The third factor, which we have tentatively 
called education of conceptual relations, has its 
highest positive loadings for Similarities, Picture 
Arrangement, and Picture Completion, while it 
has considerable negative loadings for Digit Span 
and Arithmetic. Here the weights for Similarities, 
which are positive, and for Arithmetic and Block 
Design, which are negative, fall away steadily 
with age. This appears to be due to the decline in 
perceptual speed and numerical facility. The 
weights for Picture Completion, which are posi- 
tive, increase briskly until just over 70 and then 
suffer an abrupt decline. For Picture Arrangement, 
the weights are more steady. The latter deduc- 
tions, when taken in conjunction with our earlier 
remarks about this factor, suggest that the ability 
to form conceptual relations is fairly well main- 
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tained in old age, but that powers of inductive 
reasoning play a decreasing role. 

In summary, it seems fair to say that good per- 
formance on the WAIS battery of tests as old age 
sets in depends to an ever increasing extent on 
verbal comprehension, the command of language 
which a person attains and enjoys during youth 
and middle life. The contribution to performance 
made by inductive and deductive reasoning, per- 
ceptual speed, fluency, and perhaps to a lesser 
extent visualization, gradually declines, 


CRITIQUE AND NOTES 


REFERENCES 


Davıs, P. C. A factor analysis of the Wechsler-Bi 
scale. Educ. psychol. Measmt., 1956, 16, 127. 
Dorrzrr, J. E., & Wattace, W. L. Standardizatio 
of the Wechsler Adult Intelligence Scale for old 
persons. J. abnorm. soc. Psychol., 1955, 51, 312-33 
MAXWELL, A. E. Statistical methods in factor analysis 
"4 Psychol. Bull., 1959, 56, 228-235. 
Txomson, G. H. The factorial analysis of human abili 
(Sth ed.) London: Univer. London Press, 1951. 


(Received October 12, 1960) 


Journal of Abnormal and Social Psychology 
1961, Vol. 63, No. 2, 453-456 


SEX DIFFERENCES, SEX COMPOSITION, AND 
GROUP PROBLEM SOLVING! 


L, RICHARD HOFFMAN anp NORMAN R. F. MAIER 
University of Michigan 


ROBLEM solving research has consistently shown 

men to be superior problem solvers to women 
(e.g, Maier, 1933). Berry (1958) and Sweeney 
(1953) have shown this superiority even when 
certain correlated aptitude variables were con- 
trolled for. A number of investigators (Berry, 1958; 
Carey, 1958; Milton, 1957, 1958; Sweeney, 1953) 
have suggested that women's poor performance is 
due in part to a negative attitude towards problem 
solving associated with their “sex role identifica- 
tion." Carey (1958) attempted to improve these 
attitudes by having her subjects discuss them in 
groups, and accomplished an increase in the 
women's problem solving effectiveness. 

The present study used Maier's horse trading 
problem to determine whether problem solving in 
groups might be effective in improving women's 
problem solving performance. Maier and Solem 
(1952) showed that the proportion of correct 
answers to this problem increased following group 
discussion. This research was designed to answer. 
the questions: Do women benefit as much as men 
from solving the problem in a group? Do subjects 
benefit as much when they are in groups of only 
one sex as in groups of both sexes? and Is the 
effect the same when “majority right" conditions 
are controlled for? 


METHOD 


Subjects. Three hundred and eighty-eight students, 
primarily freshmen and sophomores, were recruited 
from various courses at the university. Half the sub- 
Jects were males. Ninety-seven four-person groups 
were established: 32 groups of each single sex composi- 
tion (all male and all female) and 33 groups of mixed 
Sex composition (two males and two females), Subjects 
were assigned at random to groups within each sex 
Composition category, except that friends and close 
acquaintances were prevented from being in the same 
group. 

Procedure. The standard horse trading problem? 
(Maier & Solem, 1952) was typed on a small sheet 
with the possible answers listed (He made $30, He 


‘This investigation was supported by a USPHS 
Tesearch grant (M-2704) from the National Institute 
of Mental Health, United States Public Health Service. 

Half the subjects were asked “How much did he 
make on the two transactions?” instead of the standard 
Question, *How much did he make in the horse busi- 
Ress??? Distributions of responses to the two questions 
Were identical so the results for both questions have 
been combined for the present analysis. 


made $20 [the correct answer], He made $10, He broke 
even, He lost $10). This sheet was placed face down in 
front of each subject. At a signal the subjects had one 
minute to solve the problem individually by checking 
one of the answers supplied on the sheets, These 
answer sheets were collected and a new set passed out 
for recording the subject’s solution following discussion 
with the other members of his (or her) group. Group 
discussion was permitted for 8 minutes. At the end, 
subjects were told to “mark the answer you now 
believe to be correct, regardless of what the rest of 
your group believes." This instruction was intended to 
discourage conformity. 


ReEsuLts 


As in past studies, a significantly higher propor- 
tion of males (53.1%) than females (25.8%) 
solved the problem correctly individually, Simi- 
larly, the proportion of males (84.8%) having the 
correct answer after group discussion was signifi- 
cantly higher than the proportion of females 
(60.1%). The proportion of males (75.6%) chang- 
ing from an incorrect answer to the correct answer 
after group discussion was significantly higher than 
the proportion of females so changing (51.4%). 
Finally, a slightly, but not significantly, higher 
proportion of females (14:3%) than males (6.9%) 
switched from initially correct responses to an 
incorrect answer following discussion. These 
results are summarized in Table 1. 

Although these results show a clear superiority 
of males over females in the ability to solve the 
horse trading problem, some surprising differences 
emerged when the data were analyzed according 
to the sex composition of the groups. Table 2 
compares the proportions of males and females in 
single sex and mixed sex groups achieving correct 
solutions before and after discussion, as well as 
the proportions of those who were initially incor- 
rect who changed to correct solutions after discus- 
sion.? It can be seen that prior to the discussion, 
males had a significantly higher proportion of 
correct answers than did females in both single-sex 
and mixed-sex groups. It should also be noted, 
however, that a somewhat higher proportion of 


3 The percentages of subjects switching from correct 
to incorrect solutions is so small that we have not 
reported it in subsequent tables. The trend in those 
data supports the main effects discussed in that females 
in all female groups showed the greatest tendency 
(18.5%) to switch from correct to incorrect responses 
following discussion. 
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TABLE 1 T 
PERCENTAGES OF CORRECT SOLUTIONS TO 
THE HORSE TRADING PROBLEM BY SEX 
After discussion, 
Correct Total switched to 
subjects | before | e x 
iScussion discussion Cortes: incorree t 
Males 53.1% | 84.8% | 75.6% | 6.9% 
(194) | (191) (90) (101) 
** ** **x 
Females 25.8% | 60.195 | 51.4% | 14.3% 
(194) (193 (144) (49) 


* Numbers in parentheses are bases for percentages. Four sub- 
jects failed to record answers following the discussion: two males 
and one female with the correct answer initially and one male with 
an incorrect answer. 

** Difference in percentages between males and females sig- 
nificant at the .01 level. 


females in mixed sex groups than in all female 
groups were initially correct (! = 1.65, p < .10). 

After group discussion, in single sex groups the 
proportions of males achieving the correct answer 
(85.6%) and switching from incorrect to correct 
solutions (78.3%) were significantly greater than 
the comparable proportions for females (49.6% 
and 41.0%, respectively). In mixed sex groups, on 
the other hand, the proportions of males and 
females who correctly answered the problem after 
discussion (83.3% and 80.3%, respectively) did 
not differ significantly, nor did the proportions of 
males and females (70.0% and 75.0%, respec- 
tively) who switched from incorrect to correct 
solutions. Moreover, these proportions were almost 
the same as the comparable proportions for males 
in the all male groups. In other words, while males 
profited as much from group discussion in mixed 
Sex groups as in all male groups, females profited 
much more in mixed sex groups than in all female 
groups. In fact, the females in mixed sex groups 
benefited as much as males in either type of group. 

Is this effect a function of women's greater 
chance of being exposed to the correct answer in 
mixed sex groups than in all female groups, and 
thus conformity to the majority answer? Or does it 
result from women’s conformity to the right 
answer proposed by male subjects? 

At first glance, the results provide a condi- 
tionally affirmative answer. to the first question. 
Due to the higher proportion of correct first solu- 
tions obtained both from males in general and from 
women in the mixed sex groups than from females 
in all female groups, a greater proportion of women 
was exposed to the correct answer in mixed sex 
groups than in all female groups. Over half (55.692) 
of the women in mixed sex groups as against only 
12.0% of the women in all female groups (p < .01) 


CRITIQUE AND NorEs 


TABLE 2 

PERCENTAGES OF SOLUTIONS BEFORE AND 
AFTER Group Discussion BY SEX 
AND SEX COMPOSITION 


j fud T carrech 
e op rele aio 
Single sex 
Males 52.3 85.6 
(128) (125) 
** ** 
Females 21.9 49.6 
(128) (127) 
Mixed sex 
Males 54.5 83.3 
(66) (66) 
Females 33.3 80.3 
(66) (66) 


^Numbers in parentheses are bases for percentages. Fi 
subjects in single sex groups failed to respond following 
cussion: two males and one female with the correct answer. 
one male with an incorrect answer. 
* Difference in percentage between males and females | 
nificant at the ,05 level. ty 
** Difference in percentage between males and females si 
nificant at the .01 level. 


were in groups where two or more subjects ini 
held the correct answer. Furthermore, in 
mixed sex groups with two or more subjects in 
tially correct, approximately the same proportion 
of males as females reported correct answers f 
lowing discussion (90.5% and 83.3%, respectivi 
and changed from incorrect to correct an 
(81.8% and 79.2%). These results are compa: 
to those obtained from males in all male groups! 
shown in the top half of Table 3. 
The remaining results in Table 3 show, howeve 
that the superiority of the women in mixed § 
groups over all female groups cannot be exp a 
entirely in terms of conformity to the majo! 
answer. The percentage of women switching 
an incorrect to the correct answer was not sig 
cantly different in mixed sex groups where two ! 
more subjects initially had the right ans 
(79.2%) from what it was in mixed sex g 
where one or no subjects were initially coi 
(70.0%). By contrast, in groups where one or 
subjects had the correct answer initially, the p! 
portion of women switching to the correct ans; 
was significantly higher in mixed sex group 
(70.0%) than in all female groups (39.8%), ev 
though only a small proportion of both males 
females was correct before the discussion in 
single sex and mixed sex groups. Thus regardless 
the initial distributions of correct answers in tl 
groups, the percentages of women in mixed 
groups changing to the correct answer were : 
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TABLE 3 
PERCENTAGES OF SOLUTIONS BEFORE AND AFTER 
Group Discussion BY SEX, SEX COMPOSITION, 
AND INITIAL DISTRIBUTION OF SOLUTIONS 


Total After 
ues correct | discussion, 
answer switched 
Sen after | to correct 
SCUSSIOD | discussion | answer 
Two or more members 
initially correct: 
Single sex groups 
Males 67.0 88.2 78.6 
(88)* (85) (28) 
Females 50.0 66.7 50.0 
(24) (24) (12) 
Mixed sex groups 
Males 73.8 90.5 81.8 
(42) (42) (11) 
. 
Females 42.9 83.3 79.2 
(42) (42) (24) 
Oneor no member initially 
correct: 
Single sex groups 
Males 20.0 80.0 78.1 
(40) (40) (32) 
E .. 
Females 15.4 45.6 39.8 
(104) (103) (88) 
Mixed sex groups 
Males 20.8 70.8 63.2 
(24) (24) (19) 
Females 16.7 75.0 70.0 
(24) (24) (20) 


“Numbers in parentheses are bases for percentages. Of the 
four subjects who failed to respond the second time, referred to in 
Table 2, Footnote a, the three males were in groups with two or 
more subjects correct and the female was in a group with one 
member correct. 

** Difference in percentage between males and females signifi- 
cant at the .01 level. 


Proximately the same as the percentages of men 
so changing in both all male and mixed sex groups. 
Females in all female groups, on the other hand, 
were generally much less successful in solving the 
problem. 

The data are insufficient to give a definitive 
answer to whether the women in mixed sex groups 
conformed to males’ having the correct answer, but 
what evidence is available casts doubt on the 
adequacy of this explanation also. There were 
nine mixed sex groups with one right and three 
Wrong answers to the first administration of the 
Problem: five groups in which a male was correct, 
and four in which the female was correct. There 
Was no greater tendency for women to be more 
likely to switch to the correct answer following dis- 
Cussion where a male had the correct answer than 
where a female had the correct answer. Also, males 
Were about as likely to change to the correct 
answer where a female was initially correct as 
where a male was, a finding that provides further 
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evidence that the women in mixed sex groups 
engaged actively in problem solving rather than 
looking to the men for the correct answer. 


Discussion 


The over-all results confirm earlier findings 
which showed the superior ability of males over 
females to solve the horse trading problem. The 
fact that women’s performance in mixed sex groups 
was equal to men’s and superior to women’s in all 
female groups confuses the picture, however. 

The random assignment of subjects to mixed 
and single sex groups, as well as the consistency of 
the results when one or no members were initially 
correct, rules out the likelihood that differences in 
personal attributes (e.g., aptitudes, attitudes) of 
women in the two types of groups account for 
their performance differences. Something in the 
mixed sex situation which differs from the all 
female situation must account for the difference. 

The possibility that women in mixed sex groups 
conformed to the correct answers developed by the 
male members cannot be ruled out. The probability 
that males more often than females discovered the 
correct answer in the mixed sex groups is strongly 
suggested by the significantly higher proportion of 
males as compared to females in single sex groups 
who had the correct answer after the group 
discussion. 

The results may also be explained, however, by 
hypothesizing that women in mixed sex groups 
were more motivated to work on the problem than 
were the women in all female groups. The men 
may have challenged the women to higher per- 
formance, have encouraged and guided the women 
to think through the solution process, or merely 
have provided a legitimation of the problem solv- 
ing activity. Such an explanation is consistent with 
a possible interpretation of Carey’s (1958) results, 
She interpreted her women’s improved problem 
solving performance to their improved attitude 
toward problem solving following discussion. 
However, she did not find evidence of attitudinal 
improvement. It is possible, then, that the dis- 
cussions of attitudes served to encourage the 
women to work harder in attempting to solve the 
second round of problems, and, since many women 
had the aptitude necessary to solve the problems, 
their average success increased. 

This interpretation may also explain why Maier 
(1933) found that women profited more than men 
in improved problem solving performance from a 
lecture on problem solving. The lecture, in giving 
them an approach to successful problem solving, 
may also have encouraged them to use their 
abilities, while the women in the control groups, 
without such encouragement, approached the 
problems with their usual negative attitudes. 
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Thus despite women’s generally negative atti- 
tudes toward problem solving, circumstances may 
be manipulated to motivate them momentarily to 
actively and successfully solve problems. While 
Milton (1957) and Carey (1958) have suggested 
that women’s sex role identification produces 
negative attitudes toward problem solving, the 
present results suggest that under certain condi- 
tions these negative attitudes may be overcome 
and women can adopt a positive approach to 
solving problems. Whether the negative attitudes 
are held in abeyance (to be reinyoked upon leaving 
the experimental situation) or other facets of the 
women’s personal identification are evoked in the 
presence of men remains an unanswered question. 
It is possible that women’s sex role identification 
may be used to motivate their problem solving 
activity. Milton’s (1958) more recent study showed 
that problems phrased in a “feminine content 
form" were solved approximately equally well by 
males and females and his interpretation of the 
results is consistent with the present one. In just 
what other ways sex role identification may be 
turned to productive problem solving activity in 
women would seem to be a subject for further 
research, 


Summary 


One hundred ninety-four men and an equal 
number of women solved Maier’s horse trading 
problem, first as individuals and then in groups of 
four. There were 32 all male, 32 all female, and 
33 mixed sex groups (two males and two females). 

The following results were consistent with 
earlier findings: 

1. 53.1% of the males as against 25.8% of the 
females solved the horse trading problem cor- 
rectly individually (5 < .01). 

2. 84.8% of the males as against 60.1% of the 
females had the correct answer following group 
discussion (6 < .01). 

3. 75.6% of the males as against 51.4% of the 
females switched from an incorrect to the correct 
answer following group discussion (b < .01). 

Confusing the sex difference literature is the 
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finding that the percentage of women in mixed se 
groups switching from an incorrect to a corr 
answer following group discussion was: (a) 
proximately equal to the percentage of men 
switching and (b) significantly greater than 
percentage of women so switching in all fema 
groups. 

Although an explanation in terms of women’ 
conformance to correct answers found by the ma 
members of the group cannot be ruled out com 
pletely, an alternative explanation appears stronger 
and consistent with the results of previous work by 
Maier (1933), Carey (1958), and Milton (1958), 
The suggested hypothesis is that the women in 
mixed sex groups were more motivated to solve 
the problem than were the women in all female 
groups. i 
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INTROVERSION-EXTRAVERSION DIFFERENCES IN 
JUDGMENTS OF TIME 


R. LYNN 
University of Exeter, England. 


N attempt has been made by Claridge (1960) 

to apply Eysenck's (1957) theory of intro- 
version-extraversion to the problem of time errors. 
The time error is a constant error that occurs in 
psychophysical judgments, and consists of syste- 
matic overestimation (negative time errors) or 
underestimation (positive time errors) of the 
second of two identical stimuli. The hypothesis 
advanced by Claridge is as follows: the first 
stimulus produces both excitatory and inhibitory 
effects, the inhibitory effects reducing its perceived 
intensity or duration. This hypothesis accounts for 
the preponderance of negative time errors that 
most investigators have reported. It also follows 
Írom this hypothesis, taken in conjunction with 
Eysenck's postulate that extraverts generate reac- 
tive inhibition more quickly and dissipate it more 
slowly than introverts, that extraverts should 
show larger negative time errors than introverts. 
Claridge verified this hypothesis using judgments 


error becomes cumulative with this method. The 
present paper reports the results of an investiga- 
tion designed to examine introversion-extraversion 
differences in estimations of time using the 
Llewellyn-Thomas technique, 


METHOD 


The subjects were two groups of 20 introverted and 
20 extraverted male university students. Introversion- 
extraversion was assessed by the Maudsley Personality 
Inventory (Eysenck, 1959a). The introverts scored 
between 6-18 on this scale and the extraverts between 
29-44. 

The procedure follows that of Llewellyn-Thomas 
(1959). The apparatus consisted of a light that could 
be switched on with two keys, one of which was held 
by the experimenter and the other by the subject. The 
experimenter told the subject that he would switch 
on the light for a brief interval and following this 
the subject should switch on the light and attempt to 
keep it on for the same interval of time, The subject 


TABLE 1 


MEAN TIME ESTIMATIONS OF INTROVERT AND EXTRAVERT GROUPS USING THE LLEWELLYN-THOMAS 
POSITIVE FEEDBACK TECHNIQUE 


Introverts 
M 


SD 
Extraverts 
M 


SD 
Value of ¢ 


* Significant at the .05 level. 


of intensity of sound and duration of time. A 
tendency for hysterics (extraverted neurotics) to 
show | greater negative time errors than dys- 
thymics (introverted neurotics) in judgments of 
time intervals has also been reported by Eysenck 
(1959). 
, Recently Llewellyn-Thomas (1959) has pub- 
lished à new procedure for obtaining time judg- 
Ments designed to maximize individual differences. 
his procedure involves the use of a positive feed- 
back technique in which the subject is required to 
make a judgment of a standard and is then pre- 
Sented with his judgment as his new standard and 
5o on over a number of trials. Any tendency to 


was told that there would be a number of trials which 
might differ in length and he was asked not to count or 
use any other aids to time estimation. The first stand- 
ard was 15 seconds, and there were nine further trials 
in which the subject was successively given his last 
judgment as his new standard. The interval between 


trials was approximately 5 seconds. 
RESULTS 


The results are given in Table 1. It will be noted 
fhat there are no differences between introverts 
and extraverts on the first five trials, but that dif- 
ferences in the direction predicted by Eysenck’s 
theory emerge from Trial 6 and become statisti- 
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cally significant at the .05 level on the last three 
trials. The results do not replicate the findings of 
-Claridge and Eysenck exactly, since they found 
significant introversion-extraversion differences on 
a single trial. It is not easy to account for this dis- 
crepancy, since the procedures used appear to have 
been the same; Nevertheless, the hypothesis that 
introversion-extraversion differences in time judg- 
ments do exist receives some support, from the 
present experiment. Further, Eysenck’s theory 
that these introversion-extraversion differences 
reflect differences in the generation of reactive 
inhibition entails the prediction that the differ- 
ences would become greater as the trials proceed, 
since reactive inhibition would not dissipate fully 
in the intertrial interval; and it is evident from 
inspection of the table that this also occurred. 


» 
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CORTICAL’ CONDUCTIVITY AND VOCABULARY 


: recent series of papers (Klein & Krech, 1952; 

Krech & Calvin, 1953; Livson & Krech, 
1955, 1956) has developed and elaborated the 
‘construct of “cortical conductivity" (CC). In 
brief, CC is said to be a parameter of cortical inte- 
‘gration and, as such, is postulated to be related to 
individual differences in cognitive functioning. 
The kinesthetic aftereffect test (KAE), associated 
ith the work of Kóhler and Wallach (1944), has 


* 


wi 
been employed as an indirect measure of CC, 
- High CC is viewed as being positively related to 
- efficient cognitive behavior and inversely related to 
the magnitude of the KAE (Livson & Krech, 
1955). 

The present study provides a direct test of the 
Livson and Krech (1955) hypothesis that vocabu- 
lary scores are negatively related to KAE. In 
addition, since scores on the visual aftereffect test 
- (VAE) and the KAE are not significantly inter- 
. correlated (Spitz & Lipman, 1960), the relationship 
_ between the VAE and the Vocabulary test will 
_ also be examined. 


METHOD 


The VAE and KAE apparatus and procedure has 

been detailed elsewhere (Spitz & Lipman, 1960). The 

- reliability of these tasks is .73 and .74, respectively. 
The Vocabulary test (Davis & Davis, 1953) was group — 
Administered and consisted of 60 items in which the ~ 

Subject had to select the correct answer from among 

- five alternatives. Although governed by a 15 minute 


1 
3 


D 


time limit, all subjects completed the task. 


: Subjects and Procedure 


. . In the first phase of the study, 111 female and 39 

E male college sophornore volunteers were administered 
the VAE followed by the KAE. The Vocubulary test. 

Was administered early the following semester during 

- “regular auditorium meeting of the sophomore class. 

3 Of the subjects on whom KAE and VAE measures 
Were obtained, 97 females and 27 males also received 
the Vocabulary test. ^ 


RESULTS AND DISCUSSION 


~ Since Rechtschaffen and Bookbinder (1960) 
Ve reported significantly larger aftereffects for 
j M 


1 Now at Special Studies Unit, National Institute of 
; ental Health, Bethesda, Maryland. 
ba The authors would like to express their apprecia- 
. Mon to Marshall P. Smith, Chairman of the Psychology 
ti erus of Trenton State College, for his coopera- 
EG. oughout this study and in particular for his 
Est on, administration, and scoring of the vocabulary 
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males than females, these data were examined 
separately ‘by sex. The normative data on the 
Vocabulary, KAE, and VAE tests are presented in 
Table 1. Pearson 7's between the Vocabulary and 
aftereffects measures are presented in Table 2. 

These results clearly do not support the Livson 
and Krech hypothesis. It is possible, of course, that 
a more direct measure of CC, i.e., some physio- 
logical measure (Krech, Rosenzweig, Bennett, & 
Krueckel, 1954) would yield different findings. At 
present, however, the Livson and Krech (1956) 
proposal that Speai "s “g” is “linked” with the 
construct of CC seems highly tenuous. 


TABLE 1 


MEAN AND SD FOR THE VOCABULARY, 
VAE, AND KAE Tests 


: KAE VAE 
n Vocabulary | Gamm.) | (in mm.) 
a | sp| u | sp| x | sp 
Females 97 37.40| 6.70 | 2.00 | 3.05 | 4.55 | 3.37 
Males 27 35.37| 8.19 | 3.25 | 2.73 | 5.85 | 3.71 


TABLE 2 


PEARSON CORRELATIONS BETWEEN VOCABULARY 
"AND THE KAE anD VAE Tests 


4 


Females Males 
4 (97; Q7) 
KAE VAE |. KAE VAE 
Vocabulary +.16 | +#.08| —:12| +.06 


Note.—None of the correlations are significant at 
the .05 level of confidence. 
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/ īru Volume 63, my 6-year term as 
Editor of this journal ends, and the 
editorial reins pass to Daniel Katz 

of the University of Michigan. The wise and 

competent direction of the journal being thus 

d assured for the immediate future, I report in 

this valedictory editorial some facts and im- 
pressions about its recent past, and record 
my indebtedness to the many people who have 

contributed to bringing the journal out. Such a 

report is in order, since the Editor is respon- 

siblé for a “public utility,” not a “private 
enterprise”; he holds a stewardship on behalf 
of the members of the American Psychological 

‘Association, of the journal’s authors and 

readers, and less tangibly but most important, 

on behalf of its sector of psychological science. 

To some readers, “abnormal and social” 
psychology has seemed to label a rather odd 
and ambiguous sector. When Morton Prince 
founded the Journal of Abnormal Psychology 

55 years ago, he provided a vehicle for serious 

^ psychological contributions that fell outside 

the reigning orthodoxies then concerned with 
pursuit of the “generalized normal adult 
human mind." *Abnormal" psychology was 
dynamic psychology; it was committed to an 
interest in the rich if disorderly phenomena 
of private lives, as these come most dramati- 

y cally into view from the vantage point of the 

4, Consulting room. The addition of social psy- 
chology (in 1921) seemed natural enough. 
This too was a new, untidy, unorthodox field, 
ìn which the pioneer investigators, as in the 
Case of abnormal, had to set greater store on 
the human importance of the problems than 
on neatness and rigor of method and theory. 
Joining the two fields was appropriate for an 

| Additional reason that now seems more a 

. matter of historical accident: the earlier syste- 

| ae formulations in social psychology (see 

d 1908 texts by McDougall and Ross) 

1 ew their explanatory principles from the 

i Same tradition of French psychopathology of 


£ 


Which Prince was a leading American ex- 

Ponent. From present perspectives, the social 

Psychology of imitation, suggestion, and 

" crowd phenomena was an oddly abnormal so- 
cial Psychology. 

Prince’s main love remained abnormal psy- 
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chology, however, and when he gave the jour- 
nal to the American Psychological Association 
in 1926, the deed of gift stipulated that “the 
subject matter of abnormal psychology shall 
be preserved as the dominant feature thereof.” 
Interpreting “abnormal psychology” broadly 
to include the dynamic psychology of per- 
sonality processes—an extension in keeping 
with Prince’s intent-as applied to the evolving 
character of the field—subsequent editors 
have remained faithful to his charge. Social 
psychology has accounted over the years for 
less than half of the journal’s contents, and 
editorial preference has been given to those 
aspects of social psychology in which focus 
on interpersonal or intrapersonal dispositions 
and processes provides linkage to the study of 
personality. 

But times have changed, and one may well 
ask whether the traditional marriage of ab- 
normal and social psychology between these 
green covers has the justification today that 
recommended it to Prince. On the sides of 
both abnormal and social psychology, scien- 
tific progress has transformed the fields for 
which the journal is responsible. Systematic 
investigation and explicit theory have made 
deep inroads on the less formal observation 
and leisurely speculation more characteristic 
of the journal a generation and more ago. 
By the same token, orthodoxies, especially of 
method, have developed on both sides of the 
journal’s sector. Once a respectable journal 
of unrespectable psychology, the journal as 
it presently reflects its field may appear to 
nostalgic eyes too respectable through and 
through. And with scientific respectability 
and orthodoxy has gone spggialization. Ab- 
normal psychology and social psychology 
have lost their amateur standing. Clinical 
psychologists and social psychologists have 
developed professional identities that limit 
their interest in consorting with one another 
in a common publication. 

The most insistent pressures on the mar- 
riage, practically, arise from the sheer quan- 
titative increase in research that has accom- 
panied these developments. Thus, in 1948, 
Gordon W. Allport received 149 manuscripts, 
of which he printed 62. I began reviewing 
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manuscripts in 1955, receiving 379 and accept- 
ing 146. By 1960 (when the papers published in 
the 1961 volumes were submitted), there were 
502 manuscripts to be reviewed, of which I 
finally accepted 205 for publication, having 
reviewed 2,825 and published 990 over the 
period of 6 years. The results of such a growth 
rate virtually impose specialization and divi- 
sion of labor, not only in regard to editing 
(the task would have been impossible without 
Shoben's collaboration as Associate Editor 
in the handling of the more clinical studies), 
but also with respect to production and con- 
sumption of research. Authors and readers 
must scan ever narrower segments of litera- 
ture if they are to keep abreast of it at all. 

With matters like these in mind, I raised 
in the middle of my editorial term the ques- 
tion of whether some kind of formal separation 
between abnormal and social psychology 
Should not be worked out within the frame- 
work of a single journal (outright divorce 
would be excluded by the terms of Morton 
Prince's deed of gift). Such a separation may 
well become a necessity in the not distant 
future, but on closer thought I am con- 
vinced that it can and should be avoided for 
the present. Apart from the traditions of the 
journal, which there are practical as well as 
Sentimental reasons for taking seriously, 
several considerations pointed in the direc- 
tion of maintaining the journal with its 
present broad spectrum of content. Foremost 
in my view is the desirability of resisting, 
in the "softer" areas of psychology with which 
JASP is concerned, the trend toward special- 
ization along standard and narrowly compart- 
mentalized lines. 

For the scientific study of personality in its 
Structure, development, and Change cannot 
properly be isolated from the study of social 
relations and processes. The composition of 
this journal, as of the Division of Personality 
and Social Psychology of the American Psy- 
chological Association, gives institutional 
recognition to this fact. Grant that today’s 
psychologists can at best master a smaller 
fraction of the current output that is poten- 
tially relevant to their scientific concerns than 
was formerly possible: but let us avoid, where 
we can, erecting artificial boundaries that 
channel institutionally this unavoidable selec- 
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nal’s part in publishing significant research on 


tivity. The lines of theoretical development 
in this part of our science are not clear enough, 
nor is the trajectory of cumulative growth in 
knowledge sufficiently well established, to 
make it a safe bet to freeze on particular 
patterns of specialization. At this choice point 
of editorial transition, then, JASP remains, 
wittingly, a journal of broad spectrum as it i 
passes to the editorial care of Katz. 

I have taken much satisfaction in the jour- - 


this wide front over the past 6 years. None- 
theless, there seems to me to be cause for con- 
cern about the health of the fields of psychol- 
ogy that it mirrors. The trend toward busy 
scientific respectability- has had its costs. 
When I invited thé Associate Editor's retro- | 
spective views, Shoben formulated the prob- 
lem with respect to the more clinical aspect 
of the journal in words that I can only echo: 


Tt seems to me that a remarkably high proportion of 
the research reported is clean, stringently conceived, 
and effectively executed, reflective of rigorous and pains- | 
taking thought and experimentation, and remarkably 
trivial! There are occasions when I have the unpleasant | 
fantasy that psychology has become so enamored of 
method that techniques become our independent vari- 
ables and our substantive problems only the dependent 
ones. In any case, much of what has been reported in 
the JASP over the last six years indicates a domination 
by methods that have evolved and become elaborated 
along traditional lines. While there have been flashes of 
novelty, there have certainly been no striking break- 
throughs, and I am a little appalled by the paucity of 
bold and vigorous theoretic thought that seeks 
Cogency and relevance along with rigor and methodo- 
logical sophistication. On the clinical side, this state of 
affairs implies two “disciplines” which really have too 
little to do with one another: One deals with tradition- 
ally researchable problems; the other with reflections on 
the lore of the consulting room. 


Just as the greatly increased volume of sub- 
mitted papers is a result of the ample financial 
Support that has become available for psy- 
chological research, so the fixation on tech- 
niques that Shoben laments can be charged 
at least in part to affluence, bigness, and 
“projectitis.” The culture of project research 
grants and competitive pressures from the 
academic and professional marketplace hardly 
encourages thoughtful inventiveness, for all 
the real advantages of the present era that 
we would be unwilling to forego; a spate of neat 
and tidy—yet conventional, fragmentary, 


, = 


EDITORIAL 


and not fully cumulative—reports tends to be 
engendered. 

And as one views from a slight distance this 
flow of publication streaming from project 
grant to bibliography, one regrets a relative 
weakness of intrinsic direction, a deficiency 
in the guidance of scientific development by 
the growing edge of theory in the light of his- 
torical perspective. Rather, styles and fashions 
have tended to follow upon each other’s heels 
in a way that reflects the “euphoria of gadgets" 
of which C. P. Snow says we should beware. 
The tachistoscope, the F Scale, the Taylor 
MA scale, the Semantic Differential: how 
much of the literature of the recent past re- 
volves around a few instruments and tech- 
niques that, like Mt. Everest, had the great 
advantage of being there. Yet our psycho- 
logical gadgets lack the class" of those that 
furnish the laboratory of the modern physi- 
cist. Our fascination with them, the extent 
to which they seem to dominate our research, 
is hardly attributable to their intrinsic at- 
tractiveness; rather, it is symptomatic of the 
relative lack of clear alternative sources of 
direction and of the prevailing rather narrow 
time perspective on continuities in our dis- 
Cipline. Parallel observations could be made 
about the succession of stylish problems for 
research, in which shallow historical perspec- 
tive has often isolated research done under a 
Currently fashionable labél from related earlier 
work done under different verbal auspices. 

If these are indeed current liabilities of the 
generally prosperous fields of “abnormal and 
Social” psychology, the journal must largely 
reflect them in spite of good will to buck the 
Current where it can. During my editorial 
term ambitious theoretical discussion has 
been welcomed here—but too seldom received; 
Continual pressure has been brought to bear 
on authors to present their work in the theo- 
retical and historical context that makes its 
Significance apparent. When authors of minor 
fragments were known to be engaged in sub- 
Stantial research programs, they were en- 
Couraged to report their work in larger seg- 
ments that would warrant discussion in 
broader context. The Editor tried, without 
iM much success, to convey to authors that 
ne premium necessarily set upon conciseness 

Teporting implies no predilection for brevity 
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as such. Meaty longer articles were eagerly 
received, A good many articles—but, in retro- 
spect, probably not many enough—were re- 
jected as not making a sufficiently important 
contribution: an uncomfortable judgment for a 
knowingly fallible Editor. And an increasing 
number of fragmentary reports that never- 
theless seemed to belong in the literature were 
published as notes rather than as major ar- 
ticles. That these policies had at least some 
measurable effect is indicated by the fact that 
the mean length of major articles and case 
reports (notes being excluded) was 5.6 pages 
in 1955 but 7.3 in 1960, with the increase re- 
flecting a steady upward trend. But the under- 
lying problem is not to be solved by 1.7 pages. 
The foregoing observations are offered not 
as catharsis for editorial frustrations (though 
they probably serve as such) nor in the pre- 
sumptuous spirit I fear they may suggest; I 
hope rather to conserve and communicate a few 
of the perspectives that an Editor is bound to 
achieve as a by-product of hard work from 
his privileged vantage point, and also to make 
explicit the rationale for some of the policies 
of my outgoing regime. One final complaint: 
about the bad writing that stands as a barrier 
between so much good research and its poten- 
tial reader. The fault that is at once most 
pervasive and most readily corrigible (by 
author, nor by Editor!) is that of writing that 
can only be characterized as intentionally 
bad; writing that skillfully follows a bad model 
in the mistaken notion that it is properly 
scientific (again the “euphoria of gadgets” l). 
The studiously observed and laboriously 
contrived passive construction; the precari- 
ously stacked multinoun phrase; the cryptic 
and functionless abbreviation; the “academic” 
stigmata of utilize, it was fell that, more spe- 
cifically, etc.—but I need say no more. 
There remains the pleasant task of acknowl- 
edging the contribution of others to editing 
the journal. The Editor is the focus of a large 
collaborative effort, to which many people 
commit their time and skill. I have already 
noted my indebtedness to the Associate Edi- 
tor, E. J. Shoben, Jr., to whom I delegated pri- 
mary responsibility for processing the more 
clinical articles. The Consulting Editors whose 
names appear in the front pages have de- 
voted endless hours to reviewing manuscripts, 
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usually making suggestions for their consider- 
able improvement. Most of them have per- 
formed this anonymous labor of love for a full 
6 years, and there are others who served loy- 
ally as Consulting Editors during earlier 
years of my editorial term: Leon Festinger, 
Herbert H. Hyman, Gardner Lindzey, Quinn 
McNemar, Renato Tagiuri, and the late 
Harold E. Jones. I am deeply grateful to all 
of them. 

The heavy load of reviewing manuscripts 
has been carried by these regularly listed Con- 
sulting Editors. But Shoben and I have called 
on a large number of additional persons for 
advice on particular papers. Listing them here 
not only records our gratitude for their help, 
but further illustrates the extent to which 
editing the journal has become a collective 
enterprise. The following persons have given 
us advice on one or more manuscripts: 


tion’s publications: initially, Lorraine Bouthi- 
let, and during the last 5 years, Arthur C. 
Hoffman. To my competent secretaries at 
New York University and at Berkeley, Susi 
Stein and Sara F. Caldwell, I am indebted for 
handling the journal’s extensive correspond- 
ence, maintaining its complex files, and, on 
occasion, sustaining my editorial morale. 

It is probably inevitable that the Editor 
will often have the feeling of flying blind: 
processing a monstrous mail and shipping 
stacks of manuscripts to the APA publication 
office whence they eventually emerge in print, 
with only an occasional critical howl or appre- 
ciative comment. The “feedback” that does 
come to him, from authors and readers, 
whether positive or negative, is therefore 
particularly welcome, and I should like to 
record my special gratitude to those authors 
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The arduous task of preparing copy for the 
printer and checking proof has been ably 
performed in the Central Office of the Amer- 
cian Psychological Association under the direc- 
tion of the Managing Editor of the associa- 


who have not only submitted gracefully to 
what must have often appeared to them as 
editorial mayhem, but have taken the trouble 
to let me know that in retrospect they thought 
their intellectual offspring may have benefited 


William F. Soskin 
Donald P. Spence 
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from the harsh treatment. Write to the new 
Editor when the spirit moves you! 

The pressures, the frustrations and grati- 
fications of the editorial role predictably leave 
me ambivalent as I turn over the direction of 
the journal to Daniel Katz. But there is no 
ambivalence in my attachment to the vener- 
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able journal, and it is therefore very satisfying 
to know that the new Editor is so admirably 
qualified to guide it toward a distinguished 
contribution to psychology in the coming 
years. May his editorial regime prosper, and 
may abnormal and social psychology flourish! 

M. BREWSTER SMITH 
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RELATION OF EARLY SOCIALIZATION EXPERIEN CES TO 
y AGGRESSION IN MIDDLE CHILDHOOD: 


ROBERT R. 


. SEARS? 


Stanford University 


'HIS report presents one part of a follow- 

up study of the 5-year-old children 

whose early socialization experiences 
were described in Patterns; of Child Rearing 
(Sears, Maccoby, & Levin, 1957). It attempts 
to trace the effects of some of those experiences 
on the children’s aggressive behavior 6 years 
later, 

During the winter of 1951-52, 379 public 
school kindergarten children in Newton and 
Watertown, Massachusetts, were selected as 
subjects for a study of the child rearing ante- 
cedents of identification, The design of that 
research required the measurement of certain 
aspects of both the children’s behavior and the 
mothers’ child rearing practices and attitudes. 
The measures of identification which seemed 
appropriate, at that time, related chiefly to the 
extent of children’s adoption of adult role 
behavior and to the stage of development of 
guilt feelings and other Signs of conscience, ` 
Measures of such behavior were obtained from 
three sessions of doll play, two permissive and 
one structured (Levin & Sears, 1956; Sears, 
1953), and from mother interviews (Sears et al., 
1957). These interviews were mainly directed 
to the measurement of the antecedent (ma- 
ternal) behavior, however ; they were designed 
to secure information about aspects of the 


nesota on March 7, 1961, 

2 The research was designed in collaboration with 
Eleanor E. Maccoby, John W. M. Whiting, and Wesley 
Allinsmith. Eleanor Maccoby and Wesley Allinsmith 
supervised the collection of the final data with the 
assistance of Roger V, Burton, and Eleanor Maccoby 
has collaborated in its analysis. I wish to express ap- 
preciation also to John C. Wright, who, while at Stan- 
ford, assisted in the Preparation of the aggression tests, 
The earlier stages of this r j at Harvard, were 
supported largely by a USPHS grant (M 461). The 
follow-up study was supported by an additional grant 
to Harvard (M 1468) and a mental health program 
grant to Stanford University by the Ford Foundation. 
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mother's practices and attitudes that had theo- 
retical relevance to the development of identifi- 
cation. The dimensions measured related to 
early caretaking, feeding, toileting, the han- 
dling of dependent, aggressive, and sexual be- 
havior, and methods of discipline. These inter- 
views were based on a standard series of 
questions and probes, and permitted entirely 
free answers. They were secured by trained 
interviewers in the mothers! own homes, were 
recorded and transcribed, and then coded on 
188 scales. It is the mothers’ scores on these 
scales that provide the antecedent measures 
in the present longitudinal study of the rela- 
tion of early child rearing experiences to 
children’s manifestations of aggression at age 
12. 


Follow-Up Study 


While the original study was directed to the 
Problem of identification, the theoretical 
Position on which the research was based 
Tequired measurement of maternal behaviors 
that were equally relevant to the development 
of various other aspects of children’s per- 
sonalities, Since the extent and kind of in- 
fluence of early socialization experiences on 
later childhood and adult behavior have been 
studied mainly by clinical techniques, this 
body of data seemed of considerable potential 
value both for testing hypotheses concerning 
developmental relationships and for searching 
out new ones. What was needed was a set of 
appropriate measures of the children, at age 
12, which could be compared with these earlier 
obtained reports of their socialization ex- 
periences, To these ends, therefore, a follow-up 
study was planned that would provide meas- 
ures not only of such presumably identifica- 
tion-induced behavior as adult role choice, 
resistance to temptation, and sex typing 
(masculinity-femininity), but of aggression 
and dependency also, 


METHOD 


Procedure. The follow-up measures of the children 
were obtained in the spring of 1958, when the children 
were in the sixth grade and were mainly between 11-6 
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and 12-6 years of age; for communicative convenience 
they will be called **12-year-olds" (cf. Table 1). This 
year was chosen because the schools operated on a 6-3-3 
division, and the sixth grade was the last occasion on 
which the children would be together in a single-teacher 
grade, permitting ratings of school behavior by a 
teacher who had full time supervision of a given child. 

The measures obtained were of four kinds. First, 
there was a series of self-administering scales, con- 
structed or adapted for the present purpose, for the 
measurement of masculinity-femininity, self-concept, 
nurturance, adult versus child role choice, and five 
aspects of aggressive attitude. Second, standardized 
teacher interviews were recorded, transcribed, and 
coded for the measurement of other displays of a few 
of these same variables. Third, a story completion 
measure of guilt was obtained. And fourth, a realistic 
test situation for the measurement of resistance to 
temptation was administered (Grinder, 1960). The 
Present report describes only those findings relating to 
aggression which were obtained from the self-adminis- 
tered scales. Although an attempt was made to include 
measures of the children’s aggression in the teacher 
interviews, the teachers were unable to report on this 
aspect of behavior with sufficient clarity to provide use- 
ful data. This may have resulted, in part, from the 
institutional- nonpermissiveness for child aggression 
toward either teacher or peers at the sixth grade level. 
The major difficulty in securing teacher interview 
measures stemmed from the lack of discriminable 
differences, 

Subjects. Of the original 379 children studied (Sears 
et al., 1957), approximately 200 had moved out of the » 
school districts in which they had originally lived; to * 
arrange for securing their cooperation and that of the 
schools and teachers in their new locations promised a 
very heavy expense. Therefore, the new measures were 
Secured from only those children who were enrolled in 
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the sixth grade classrooms of the schools, in the original 
towns, in which seven or more of the sample were cur- 
rently enrolled. The consequence of this selective pro- 
cedure was that completed aggression attitude scales 
were obtained from 160 of the original 379 subjects, and 
also from 377 other sixth graders who had not been in 
the original study. In the following discussion, the 160 
“originals” who had been subjects (Sears et al., 1957) 
are referred to as the follow-up sample, and the others as 
the 1958 classmates. The teachers were interviewed only 
about members of the sample group—and of course only 
for that group are the child rearing measures from 
mother interviews ayailable—so that data analyses in- 
volving internal comparisons within the self-adminis- 
tered measures will be presented for the combined 
groups, follow-up sample and classmates, but analyses 
involving antecedent-consequent comparisons from the 
1951-52 mother interviews must be limited to the 
follow-up sample group. 


Results 


Since the drop from 379 to 160 cases is a 
large one, some importance must be attached 
to questions concerning the sample composi- 
tion. In Table 1 are summarized certain 
demographic properties for three groups: the 
original group, the follow-up sample, and the 
1958 classmates in the sixth grade, Reports 
of ordinal position and father’s occupation had 
been secured fromthe mothers of the original 
group, in 1951-52, and were obtained from 
the children themselves in 1958. Hence double 
figures can be presented for the follow-up 
sample, those obtained at kindergarten age, 


TABLE 1 
Some DEMOGRAPHIC CHARACTERISTICS OF THE ORIGINAL (1951-52) AND THE Forrow-uP (1958) POPULATIONS 
(In percentages) 
i |= 958. 1958 
Original pets VATES sample Classmates 
Characteristic 1952 measures 1952 measures 1958 measures 1958 measures 
i i Girls B Girls 
av P9 a tan P5, | a, | a e. q^ 84) uv e iola = dt) 
Ordinal Position; ja 
ud 16.3 | 10.7 | 21.0 | 60 | 18.7 | 3.6 | 67 | 46 
En 29.2 | 28.3 | 26.3 | 25.0 | 30.7 | 28.6 | 31.8 | 36.5 
yale 23.8 | 32.2 | 21.0 | 35.7 | 24.0 | 40.4 | 21.8 | 30.3 
ingest 30.7 | 28.8 | 31.7 | 333 | 26.6 | 27.4 | 39.7 | 28.6 
er’s occupati, d 
Upper middle ^ came 65.2 | 56.9 | 60.0 | 58.5 | 63.4 | 56.6 | 53.5 | 57.3 
wer middle 8.6 | 10.3 | 14.7 | 110 | 14.1 | 9.6 | 15.1 | 40.5 
due iue 26:2 | 32:8 | 25.3 | 30.5 | 22.5 | 33.8 | 31.4 | 32.2 
at 1958 testing: 
$ Dh 118 | 1110] i111| 11-10 
SD (months) 4.6 | 3.0 | 7.0 | 4.8 


n. Father's occupational status was measured on the Warner (1952) scale and on the Hollingshead (1958) scale. Both are seven-point 
** Upper middle includes Categories 1-3, Lower middle is Category 4, and Upper lower includes Categories 5-7. 
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for comparison with the total original group, 
and those obtained in the sixth grade, for 
comparison with the 1958 classmates groups. 
In comparison with the total original group, 
the follow-up sample contained a slightly 
greater proportion of the original “only” 
boys, a slightly lesser proportion of the “only” 
girls, and slightly greater proportion of the 
“youngest” girls. None of these differences 
seems of major import, particularly in light 
of the fact that a few more children were born 
to these families during the intervening years. 

It is interesting to compare the relatively 
unselected group of sixth grade classmates with 
the sample. As of 1958, the sample was biased 
in favor of more “only” and fewer “youngest” 
boys, and more “middle” and fewer “oldest” 
girls. This bias reflects the effort made in the 
original study to select a stratified sample 
providing a more nearly equal composition in 
terms of sex, ordinal position, and socio- 
economic status (cf. Sears et al., 1957, pp. 
24-27, 512). 

With respect to social status, comparisons 
have been made on the basis of father's oc- 
cupation. The Warner Classification (Warner, 
Meeker, & Eells, 1949) was used in the original 
study, the Hollingshead (Hollingshead & 
Redlich, 1958) for the follow-up. The two are 
sufficiently similar for the present comparison? 
which shows that the follow-up sample is 
closely representative of the total original 
group. However, comparison of the 1958 
sample and classmates boys shows that the 
sample population contained a moderately 
(10-11%) greater representation of upper 
middle class children than the classmates, 
Obviously the original group did, too. There 
is no difference between sample and classmates 
girls, and both are closely representative of the 
original 1952 group. Mean ages of the two 
1958 groups of boys differed by 3 months, the 
sample being younger (p < -001)*; the girls? 
groups were identical. Both the sample groups 


3 The Warner index of SES, which was used in the 
original study (Maccoby & Gibbs, 1954), could not be 
used for the follow-up because it included income as one 
variable. Since we lacked the resources for further 
inquiry from the mothers, we contented ourselves with 
father's occupation, which the children could report. 
Occupation is weighted two and income, one, on the 
Warner revised index of SES. 

* Unless otherwise indicated, all ? values are based 
on a two-tailed test. 
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were narrower in range, however, as would 
necessarily be the case; they had all been in 
kindergarten in 1952 and were all in the sixth 
grade in 1958, Children who became retarded 
or accelerated during the 6-year interval were 
not reached by our follow-up procedure. The 
unselected sixth grade classmates, on the 
other hand, contained both retarded and ac- 
celerated children, and hence the range of ages 
was greater. 

So far as these various demographic meas- 
ures are concerned, then, the follow-up sample 
is a surprisingly close representation of the 
total original group from which it was drawn. 
Since none of these three populations can be 
considered accurately representative of any 
wider population, the comparability among 
them (except for the heavier representation 
among upper middle class boys) permits us to 
turn with not too much worry to within- 
group comparisons, 

Here we find further reassurance of the rela- 
tively unbiased character of the follow-up 
sample. Comparison was made of the means 
and distributions of 22 mother interview 
scales for the 160 cases in the follow-up sample 
and the 219 cases not followed-up. Six of these 
scales described the child's behavior (amount 
of weaning upset, severity of feeding problems, 
bed-wetting, amount of aggressive behavior in 
the home, amount of current dependency at 
kindergarten age, amount of conscience); none 
of these showed any difference. The other 16 
Scales related to important maternal be- 
haviors. On only two of them was there a 
significant difference between the mean ratings; 
toilet training was a little less severe (p < .02) 
and the use of deprivation of privileges was 
less frequent (p < 03) in the follow-up 
Sample. There were no differences in early 
feeding, the handling of Sex, aggression Or 
dependency, the use of other disciplinary 
methods, or in family atmosphere, authority, 
and warmth. There was no difference, either, 
in the amount of aggression the children had 
shown on the first session of doll play in 1952. 

In sum, the follow-up sample was a pretty 
accurate representation, both demographically 
and with respect to measures of child behavior 
and child rearing, of the original group whose 
socialization experiences were reported in 
Patterns of Child Rearing (Sears et al., 1957)- 
Further, with the exception of a 10% over- 
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weighting of upper middle class boys, and 
three variations of 10-12% each in ordinal 
position weighting, the sample was very 
similar demographically to the large group of 
their sixth grade classmates. 


MEASUREMENT OF AGGRESSION 


How does one measure aggression in 12- 
year-olds? Clearly, the answer rests on what 
theory is adopted as to the genesis and dy- 
namics of such behavior. There have been 
two main views about the matter that have 
influenced measurement and interpretation in 
both the clinical and experimental settings. 
One of these is instinctual and drive oriented; 
the other rests on a general theory of action 
and learning. Neither has been spelled out in 
sufficient detail to permit a purely theoretical 
derivation of appropriate methods of measure- 
ment. However, researchers who lean toward 
the instinctual view have tended to emphasize 
the use of projective methods, while those who 
find action theory more congenial have tended 
to use social behavioral measures. A brief 
comment on each will make clear the reasoning 
that led to the choice of instruments in the 
Present study. 

The notion that there is a certain quantum 
of internal instigation to aggression has been 
inherent in much of psychological theory for 
Some time. It was expressed most explicitly 
first by Freud (1922) in his conception of the 
death instinct. This nativistic position gave 
little attention to the origins of individual 
differences in amount or quality of aggressive 
behavior except by implication; childhood 
experience and environmental influences could 
excite or inhibit the expression of the in- 
Stinctual impulse, and could displace it out- 
ward, From the measurement standpoint, such 
a theory seems to imply that there are various 
alternative behavioral indices of the underlying 
motive. The chief aim in choosing one rather 
than another is to obtain the freest (least 
inhibited) expression. Thus, a favorite device 
for the measure of many types of primary or 
Secondary “drive induced” behavior is some 
Projective technique such as the TAT, story 
completion, or doll play. These methods are 
assumed to aid in circumventing social in- 
hibitions that would otherwise suppress the 
full expression of the drive. They represent an 
effort to sample the “true” uninhibited 
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quantum. However, while there is no question 
that projective responses do reveal the exist- 
ence of instigation to aggressive, sexual, 
achievement, and other socially controlled 
actions, there is also good evidence that such 
projections are themselves strongly influenced 
by momentary conditions of excitation and 
inhibition (Clark, 1955; R. R. Sears, 1951). 

The alternative theory makes no assumption 
about a quantum of internal instigation that 
must be uninhibitedly tapped. On the contrary, 
it presumes that whole classes of aggressive 
acts are learned as responses to many dif- 
ferent stimulus situations, By stimulus 
generalization, these acts come to be evoked 
by great ranges of stimuli, both internal and 
external, including ones that are symbolic or 
have imaginal qualities only. By response 
generalization, acts can shift from one action 
category to another (e.g., from physical to 
verbal) or from one psychological function to 
another (e.g., from overt action to imagination 
or fantasy). Indeed, the lability of expression 
is so great that even entirely novel stimula- 
tional settings and manipulanda seem able to 
elicit and provide a means of expression of 
aggressive acts. Each such setting has its own 
instigative value, deriving from direct rein- 
forcements in the past, as well as from primary 
or secondary stimulus generalization. This 
effective instigative value is a cumulation of 
both excitatory and inhibitory values, of 
course.* 

This emphasis on stimulus situations need 
not exclude internal instigators, including 
those having essentially the qualities of drives. 
Indeed, in one respect, Frustration and Ag- 
gression (Dollard, Doob, Miller, Mowrer, & 
Sears, 1939), which first formally presented the 
action and learning theory of aggression, 
was not at all dissimilar to the earlier in- 
stinctual view; it simply relocated the source 
of impulsion from instincts to a frustration 
drive created by the cumulated frustrations of 
motivated behaviors of all kinds. The internal 
source was changed from a native reservoir of 


5 McDougall’s (1908) instinct of pugnacity rested 
somewhere between these two views. It did not assume 
a pool or reservoir of internal instigation (drive), but 
neither did it rely explicitly on learning principles to 
account for individual differences in directions and 
qualities of aggressive acts; these were instinctually 


organized action patterns. 
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energy to an experience-induced drive state. 
More importantly, however, this view in- 
troduced individual differences into the source 
of internal instigation, and placed the internal 
instigator in the same category of operationally 
definable antecedents as the external ones, 
The relevance of these theoretical matters to 
the problem of measurement lies in the as- 
sumptions that must be made about what is 
being measured. From a drive theory, one 
seeks to tap that way of behavioral expression 
of the drive that suffers least interference from 
either internal controls (e.g., anxiety, con- 
science) or external threateners (e.g., authority 
or moral symbols). The acceptability of any 
index can be evaluated either by a priori 
analysis of the measurement situation with 
respect to freedom from inhibition, or by ap- 
peal to some criterion against which the index 
can be validated. Projective techniques com- 
monly have received acceptance on a priori 
grounds—mainly, it is to be feared, as a result 
of uncritical analysis of the inhibiting poten- 
tialities of the test situation—or their validity 
has been tested against other behavioral 
measures which are themselves accepted on a 
priori grounds, 
With an action and learning theory, the 
Measurement problem is somewhat different. 
Measurement in this case means discovering 
the strength of specifically defined aggressive 
actions in Specifically defined stimulating con- 
ditions. It is assumed that the measured 
“strength” or “amount” is a product of all 
past direct reinforcements and inhibitions 
plus all relevant stimulus and response gen- 
eralizations of both excitation and inhibition, 
A measure is evaluated by three criteria; agree- 
ment by theorists that the measure is relevant 
to their interests and is not grossly mis- 
labeled, provision of a consistent response 
value under standard conditions (i.e., adequate 
reliability), and lawful relationships to in- 
dependently measured antecedent learning 
conditions or coincident stimulating condi- 
tions, Most measures of aggression, both 
projective and otherwise, meet the first 
criterion—if the theorists collected for judging 
are not too heterogeneous a lot. Reliability is 
more difficult. Projective methods, Particularly 
those that are susceptible to the influence of 
permissiveness in their administration, do not 
usually provide high coefficients on a test-re- 
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test basis, though odd-even reliability may be 
satisfactory. But the most important criterion. 
is the third. If stable and theoretically mean- 
ingful relationships can be found between a 
given measure of aggression and some past 
experience or concurrent situation, the measure? 
itself must have adequate reliability even” 
though this cannot be demonstrated by formal | 
methods of evaluation. Single acts that cannot | 
be repeated (e.g, a lynching), and acts the | 
potentiality for which suffers modification | 
simply as a result of their own performance 
(eg., doll play aggression), provide special 
difficulty in this respect. Quite a number of | 
them have nevertheless provided useful 
antecedent-consequent relationships. l 

To return, now, to the question of how to | 
measure aggressiveness in sixth grade children, 
There are many types of aggressive actions 
and many instigating situations that evoke 
them. The majority of these acts are im- | 
possible to measure directly; overt aggression 
toward mother in the absence of the father, | 
self-aggression when alone, spontaneous fan- 
tasy aggression toward symbolic authority | 
figures, etc., do not lend themselves readily to 
measurement, reliable or otherwise, even 
though they might be excellent choices from 
the standpoint of the first and third criteria, 
What does seem practical, however, is the use | 
of standardized forms of self-report of atti- 
tudes, and the securing of reports of overt | 
behavior from outside observers, For the 
latter, there are three main sources that would 
be useful to have—parents, teacher, and 
peers—because of their systematically different. 
stimulus significance to a child. The procedural 
limitations of the present study led us to seek 
only the teacher observations, and they proved 
useless with respect to aggression. For self- - 
Teports of attitudes, scales were constructed to | 
Measure five aspects of aggression, and they | 
proved useful. I 


Aggression Scales 


These scales were composed of items in the | 
form of declarative sentences with which the | 
subject could express his agreement or dis- 
agreement on a five-point scale ranging from |. 
“strongly agree” through “not sure" to $€ 
“strongly disagree.” The content of each item | 
was designed to express one type of aggression 
(or its rejection), The presumption was that 
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the reading of a statement would arouse an 
imaginal representation of the act or feeling 
described, and that marking the scale would 
be, by response generalization, an affirmation 
or rejection of that representation, A score of 
5 was given to the “strongly agree" or 
"strongly disagree" end of the five-point scale 
that represented acceptance of the type of 
aggressive action or feeling portrayed by that 
item, The next position received a 4, and so on. 
A total score for each of the five scales was 
obtained by cumulating these individual item 
scores for all the items belonging to a given 
aggression scale. The five scales were as 
follows: 

Aggression. anxiety. This scale measured 
feelings of fear, discomfort, and dislike of ag- 
gression. The aggressive stimulus to the arousal 
of such feelings was placed, in some items, in 
the person himself (“It upsets me to think 
some thoughtless word or crack of mine might 
hurt someone’s feelings”) and sometimes out- 
side the self (“It makes me uncomfortable to 
See two of my friends fighting”). These external 
Stimulus conditions ranged from the personal 
or social, as in the last example, to relatively 
impersonal or symbolic external events (“If 
Someone gets hurt in an auto accident, I 
usually try to get a good view of what hap- 
Pened”), The signs of the anxiety in response 
to aggressive stimulation included references 
to avoidance behavior, somatic symptoms, 
guilty feelings, expectations of injury or 
trouble, and the use of descriptive words in- 
dicative of strong negative feelings (‘vicious 
crimes"), 

Projected aggression. This scale measured 
the tendency to attribute aggression to sources 
Outside the self. The items included a range 
of objects of hostility, including the self, The 
genis of aggression ranged from specific 
(“Big dogs are likely to be dangerous”) to 
Very general (“Sometimes I feel that there are 
So many bad people in the world that even 
my own town is about as dangerous to live 
m as à real jungle"). The agents to whom 
aggression was attributed included people, an- 
NM and natural forces. About half the 
items involved aggression toward the self from 
other Sources, and the other half reflected ag- 
8tession toward other objects. Of those which 
reflected aggression toward the self, some in- 
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volved derogation, and some represented overt 
aggression. 

Self-aggression. All items referred to injury 
or punishment to the self. Two described 
impulses toward suicide, two related to self- 
punishment from other sources than the self, 
and one referred to accidents to the self. 

Prosocial aggression. The dimensions repre- 
sented by this scale and the next one were first 
described by Johnson (1951) in her study of 
doll play aggression in 5- and 8-year-old 
children. Prosocial aggression is aggression 
used in a socially approved way for purposes 
that are acceptable to the moral standards of 
the group. The items referred to law enforce- 
ment, strictness of control, punishment for 
tule breaking, and insistence on appropriate 
rules about aggression (“‘When a person has 
broken an important rule, he should definitely 
be punished for it"). A high score on this 
scale is a positively aggressive score, but the 
form of aggression is in favor of socially ac- 
ceptable controls and disciplines. In its ex- 
treme form, prosocial aggression probably 
represents “moral righteousness.” However, a 
low score could be the product of either one 
of two characteristics in the person: a low 
level of aggressive reactivity, or a tendency to 
be aggressive in unacceptable ways, with re- 
sistance to authority or the rules of the group. 
Four of the eight items that composed the 
scale would be influenced by this rejection of - 
authority, and four would not be. In the 
absence of other information, then, a low 
score on this scale cannot be clearly inter- 
preted. A high score does clearly mean a high 
aggressiveness of this socially accepted sort. 

Antisocial aggression. This dimension con- 
trasts with the previous one in the sense that 
the aggressions referred to here are ones that 
are normally unacceptable socially in the 
formal pattern of our culture. Some items in- 
cluded a reference to the positive desirability 
of powerful aggressive motivation, others to 
the acceptability of ordinarily disapproved 
forms of aggression, and still others to the 
“naturalness” of aggression as a form of social 
interaction (Sometimes an actual fight is the 
only way to settle an argument"). In terms of 
our contemporary middle class American 
culture, antisocial aggression represents a much 
lower level of socialization than does prosocial 
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aggression. A high score may be interpreted 
as indicating a strongly aggressive person 
without social control. A low score may de- 
scribe a person without much aggressive reac- 
tivity, or one whose aggression takes other 
forms, such as the prosocial. 


Scale Construction and Item Validation 


Seventy-three items were prepared for 
initial testing. These were divided unevenly 
among six scales, the number ranging from 
6 for self-aggression to 18 each for aggression 
anxiety and projected aggression. Twenty 
buffer items were constructed also; these 
related to such nonaggressive matters as 
School activities, career choices, eating, etc. 

"There were two item validation series, both 
performed with sixth grade classes in two 
school districts in the San Francisco Bay area. 
These classes were about equally composed of 
boys and girls, and had a wide socioeconomic 
range. For the first testing, the items from 
the first three scales (plus the ultimately re- 
jected "attenuation" items) were combined 
with the buffer items, in an alternation order, 
and presented to two classes containing 58 
children; the last two scales were handled in 
the same way, using two other classes of the 
same size, 

A priori scoring of the items provided a 

. total score on each scale for each child tested 
with that scale. Odd-even reliabilities were low 
but promising. A rough item validation was 
performed by dividing the groups in half on 
the basis of total score for each scale and count- 
ing the frequency with which each item was 
scored in the appropriate direction by the 
upper and lower half of the group. Several 
items proved to be nondiscriminating at the 
-10 level and were discarded in preparing the 
test forms for the second validation series. 

For the second series a substantial change 
was made in the form of the test blank. The 
retained items for the aggression scales were 
combined with the 20 buffer items, which 
were introduced among them in such a way as 


* An effort was made to construct a scale for “at- 
tenuated aggression,” but this conceptual dimension 
proved to have inadequate construct validity and the 
scale had poor internal consistency. The effort was dis- 
continued, therefore, though the items of the hoped-for 
scale were retained in the test blank in order to secure 
data which could be analyzed for other purposes. 
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to prevent too severe piling up of negative 
emotions by the repeated reference to aggres- 
sion. The items for the various aggression 
scales were distributed among one another in 
order to avoid establishing response sets on any 
one dimension, Attention was also given to 
randomizing positively toned and negatively 
toned items for which the high aggression 
answer would be “disagree” and “agree,” 
respectively. 

This test blank, containing a total of 80 
items (including the 12 ill-fated attenuated 
aggression items), was presented to two other 
classes of sixth grade children with a total V 
of 57. Odd-even reliabilities were substantially 
improved. Item validation was again per- 
formed, using for each scale, separately, the 
upper and lower half of the group on total 
score. Items that failed to be discriminative 
at the .05 level were discarded for the final 
form of the test, Reliability coefficients for 
the final form are reported in Table 2. 


Collection of Data 


For the measures taken in Newton and 
Watertown, the test blanks were lithographed 
in the same format as had been used for the 
second validation series.” Various test in- 
struments for the follow-up study were given 
to whole classes at a time on separate days. 
The aggression scales were the third such 
presentation. The first two had also been self- 
administering scales; they had been presented 
by research assistants who were introduced 
by the teachers as being from Harvard, Com- 
plete confidentiality of the test blanks had 
been assured, with emphasis that the teacher 
would not see them. For the aggression scales, 
the presentation was made by the teacher 
but with similar emphasis on confidentiality. 
The children placed their own blanks in an 
envelope which was then sealed and taken to 
the principal’s office. 

7A keyed copy of the test blank, together with à 
table showing proportion of each sex in the 1958 Class- 
mates group who scored at each of the five points on 
each of the 60 items, and a complete table of inter-item 
correlations calculated separately by sex has been de- 
posited with the American Documentation Institute. 
Order Document No. 6870 from ADI Auxiliary 
Publications Project, Photoduplication Service, Library 
of Congress; Washington 25, D. C., remitting in ad- 
vance $2.25 for microfilm or $5.00 for photocopies. 


Make checks payable to: Chief, Photoduplication 
Service, Library of Congress. 


: 
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TABLE 2 


Opp-EvEN RELIABILITIES OF THE FIVE AGGRESSION 
Scares: 1958 CLASSMATES 


(Sexes combined) 


Pearson ¢ 
Num- | Num- 
Scale ber of | ber of 
items | cases | Uncor- | Cor- 
rected | rected 
Aggression anxiety 12 | 363 | .44 |.61 
Projected aggression 14 | 361 | .44 | .61 
Self-aggression 5 |348 |.08 | .15 
Prosocial aggression 8 | 364 | .46 | .63 
Antisocial aggression 9 | 364 |.49 | .64 


Reliability 

The 1958 Classmates (i.e., the sixth graders 
who participated in this testing but were mot 
members of the follow-up sample) have been 
used as a group on which to determine the 
odd-even reliabilities of the five scales. In 
Table 2 are shown the numbers of items re- 
maining in each scale after the second valida- 
tion series, the number of children on whose 
papers reliability scores could be calculated, 
and both the uncorrected and corrected 
(Spearman-Brown) coefficients of reliability. 

The corrected reliability coefficients for 
aggression anxiety, and projected, prosocial, 
and antisocial aggression are all .6 or better. 
That for self-aggression is significant at just 
the = .01 level, which is scarcely promising 
for a reliability coefficient! Furthermore, the 
10 correlations among the five items, for the 
174 classmates boys, range from —.12 to 
+.31, with a median of +.02; for the 170 
girls, the range is —.14 to +.28, with a 
median of 4-.12. Nevertheless, as will be seen 
later, the boys’ scores on this scale appear to 
have certain theoretically predictable rela- 
tionships with child rearing antecedents, and 
it may be that, as in the case of certain apti- 
tude tests, the total score has greater stability 
than is implied by the intercorrelations among 
its components. Obviously, later results on 
Selt-aggression must be interpreted with con- 
Siderable caution. 


RESULTS AND DISCUSSION 


Intercorrelations among Scales 

, In Table 3 are shown, separately by sex, the 
Intercorrelations among the five scales for the 
total 1958 group, both follow-up and class- 
mates combined. In Table 4 are shown, for 
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TABLE 3 


INTERCORRELATIONS OF AGGRESSION SCALES: TOTAL 
"TEsrED POPULATION 


(Girls: N = 264; Boys: N = 261) 


Aggression anxiety 
Projected aggression 
Self-aggression 
Prosocial aggression 
Antisocial aggression 


Note.—Girls' figures above the diagonal, boys’ below. 


the follow-up sample, the intercorrelations 
among not only the aggression scales but also 
four measures of child behavior obtained from 
the mother interviews 7 years earlier. The 
intercorrelations among the five aggression 
scales are similar for the follow-up sample and 
the total population except in one instance; 
boys' projected and antisocial aggression are 
correlated .23 in the follow-up sample but 
only .04 in the total population (which in- 
cludes the sample). The theoretical implica- 
tions of these matrices will be discussed in 
connection with results of the antecedent- 
consequent comparisons. 


Sex Differences 

In Table 5 are shown, for the combined 
groups, the means and SDs of the scores on 
the five aggression scales for each sex, to-' 
gether with the significance of the sex differ- 
ences. The girls are significantly higher on 
prosocial aggression and aggression anxiety, 
while the boys are substantialy higher on 
antisocial aggression. There are no sex dif- 
ferences on the other two scales. 

As was mentioned earlier, a test of mas- 
culinity-femininity was also administered to 
the children. It was constructed from the 38 
items of the Fe scale in Gough's CPI (1957). 
That scale was revised and revalidated for 
the present study to make it more suitable for 
this younger age group. The scale as used had 
quite satisfactory odd-even reliability, and 
intrasex validity of the 33 individual items 
retained from the original form was uni- 
formly acceptable, as was the intersex validity 
of all but four items. 

In the present connection, it is interesting 
to note that the correlations within each sex 
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TABLE 4 
TNTERCORRELATIONS OF AGGRESSION SCALES AND FÓUR AGE 5 MEASURES OF CHILD BEHAVIOR: FOLLOW-UP SAMPLE 
(Girls: N = 84; Boys: N = 76) 


Dependency: age 5 
Conscience: age 5 
Bed-wetting: age 5 


Girls 
Boys Scale 1 2 

Aggression anxiety 1 
Projected aggression 2 
Self-aggression 3 
Prosocial aggression 4 
Antisocial aggression 5 
Aggression: age 5 ? 

8 

9 


Note.—Girls' figures above the diagonal, boys’ below. 


between femininity and both prosocial ag- 
gression and aggression anxiety are signifi- 
cantly positive, and those between femininity 
and antisocial aggression are significantly 
negative. Thus, the directions of differences 
in means between sexes are similarly reflected in 
directions of correlations within each sex. 

These findings suggest that high prosocial 
aggression is a feminine quality, and high 
antisocial aggression is a masculine one, 
Coupled with the feminine reluctance to ex- 
press this latter type of attitude is a higher 
level of aggression anxiety. 

Comment. These sex differences in kinds of 
aggressive feelings are an objectification of a 
rather common cultural stereotype. Girls are 
more fearful, more passive, more conforming 
to social norms and proprieties; boys are more 
violent, rougher, and more destructive. It is 
impossible to tell to what extent our measuring 
instrument reflects an expression of inner 
feelings and to what extent it provides op- 
portunity for the acquiescent expression of 
socially appropriate attitudes. Doubtless both 
factors influence the responses children make 
in such a situation, On the one hand, as our 
theory of measurement suggests, the question- 
naire items probably evoke imagery and 
feelings that are stable and relatively long- 
enduring; but on the other, one must recognize 
that the very setting in which the scales are 
presented contains stimuli appropriate for 
eliciting stereotyped reactions. The children 
are seated among their peers of both Sexes; 
they are in the school room; they are faced 
and instructed by adults. Whatever attitudinal 
expressions have become attached to these 


TABLE 5 


Sex DIFFERENCES ON THE FIVE AGGRESSION SCALES: 
Tora TESTED POPULATION 
(Girls: N = 264; Boys: N = 261) 


Girls Boys pe 
Scale | fas | CR 

M SD| M SD |ences 
Aggression anxiety 6.11 | 1.49 | 4.82 | 1.63 | 1.29 | 9.46 
Frojected aggression | 4.51 | 1.87 | 4.53 | 1.90 | .02| .123 
Self-aggression 4.13 | 1.65 | 4.27 | 1.61 | .14 | .986 
Prosocial aggression 5.46 | 1.48 | 4.81 | 1.74 | .65 | 4.61 
Antisocial aggression | 4.75 | 1.66 | 6.09 | 1.82 | 1.34 | 8.80 


Note,—Figures given here are based on a scale of 1-9 that 
covered the full range of obtained scores on each test. 


stimulus conditions, whether “stereotypes” or 
not, are elicited in some degree. 

This distinction between feelings and 
stereotypic attitudes may or may not be im- 
portant for an understanding of the structure 
of aggression, But since our present main 
interest is in the effects of early experience on 
these behaviors, the issue can be held in 
abeyance while we discover whether there are, 
in fact, predictable (or detectable) relations 
between these measures and the hypothesized 
antecedents. If so, then the question that 
needs to be raised—as with any measure—is 
as to how widely these particular measures 
may be generalized to other instances of ag- 
gressive behavior. The remainder of the present 
study, however, is devoted to the antecedent- 
consequent problem. : 


Permissiveness and Punitiveness 


The effect of maternal punitiveness on 
Preschool children’s aggressive behavior has 
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been shown in two previous studies, In the 
earlier (Sears, Whiting, Nowlis, & Sears, 
1953), the mothers of 40 3- and 4-year-olds 
were interviewed with respect to their control 
of aggression. Ratings of the severity of their 
customary punishments were made from 
transcribed interviews. The amount of ag- 
gressive behavior shown by the children in a 
preschool was measured both by teacher 
ratings and by direct observation. For boys, 
the relationship between amount of aggression 
shown and the severity of maternal punitive- 
ness was clearly positive. For girls, however, 
the relationship was curvilinear, both the 
most and least severely punished girls being 
less aggressive than those suffering an inter- 
mediate amount of punishment. 

An hypothesis was suggested that punish- 
ment serves as a form of frustration and hence 
increases the total instigation to aggression, 
but that when punishment becomes suffi- 
ciently severe, it inhibits the specific actions 
punished; in such cases, the increased ag- 
gressive instigation would be manifested only 
in forms of aggressive activity different enough 
from those punished not to suffer from in- 
hibition by means of stimulus or response 
generalization. To account for the sex dif- 
ference in the obtained data, it was further 
hypothesized that although the interviews 
with boys’ and girls’ mothers did not reveal 
any average difference in severity of punish- 
ment, the girls, by identification, felt more 
punished than did the presumably less highly 
identified boys, Support for the initial hypothe- 
Sis was gained from doll play, in which girls 
as well as boys showed a direct relationship 
between frequency of aggressive acts and the 
Severity of their mothers’ punishment. The 
data did not permit a test of the identification 
hypothesis, 

<The original study of the present subjects 


(Sears et al., 1957), made when they were of 


Kindergarten age, likewise found a positive 
telationship between child aggression and 
maternal punishment. In this instance, there 
Was no independent measure of the children’s 
aggression; it was rated from the same in- 
térview as the measure of severity of punish- 
Ment for aggression expressed toward the 
Parents, and referred to aggression in the home 
only. However, with these data, there was no 


evidence of a curvilinear relationship in the 
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girls, and we were forced to conclude either 
that the previously noted curvilinear rela- 
tionship was a sampling error, or that the 
inhibitory influence of severe punishment was 
evident only in the school and not in the home. 
Theoretical reasons for the latter interpreta- 
tion are not immediately obvious. 

A second maternal variable shown to be 
positively related to aggression in the home, 
in this same study, was permissiveness for 
expression of aggression toward the parents. 
Permissiveness was defined as an attitude of 
acceptance and expectation that a child will 
want to be aggressive. Although permissive- 
ness and punishment were negatively cor- 
related with one another, both were positively 
correlated with child's aggression in the home, 

Aggression. at age 5. These findings from the 
total original population are accurately rep- 
resented in the follow-up sample. In Table 6 
are shown the correlations, separately by sex 
of child, between 9 available child measures, 
including the five aggression scales taken at 
age 12 and 4 of the measures taken at age 5 
(dependency, aggression, conscience, bed- 
wetting), and 12 of the measures of maternal 
attitude or behavior obtained from the inter- 
views. Attention is called to the r's between 
the age 5 measure of aggression and the 
measures of permissiveness and punishment 
for aggression toward the parents. In the 
original data on 379 cases, aggression at age 5 
(III, 29) correlated with permissiveness for 
expression of aggression toward parents (i, 
30) (boys) .20 and (girls) .24, and with punish- 
ment (III, 31) (boys) -14 and (girls) 18. In 
the follow-up sample, the figures are similar.* 


V " 1958) factor analysis of the origi 
is of Milton's actor m 
rk for an analysis of the antecedents 


performed by the children at 
age 5. The permissiveness measure was constructed 


summing f permissiveness scales: aggression 
2. ys care of house and 


the 
rrela! 1 
a are given in Tables 6, 8, and 10, but it 


must be remembered that they are not independent of 
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Child rearing variable 
> 
ue 2 El 8 83 E 
fa g 86 | xl 2 E 
B B z - g a * = E 
HHERIEIEDRERMBIETEMESEREE 
B HI T i8 m ge| = |$] = 
a| ES 5 |333| $3 i zz 
EU AT ES [E38] ES BR ER EET s s E 
IH, |XVII, Xv, III, XVIIL| II, Ilt, III, III, III, III, 
Child behavior measures 30 5 5 31 6 47 36 37 53 15 


Aggression anxiet; 
Projected aggression 
Self-aggression 
Prosocial aggression 
Antisocial aggression 
Aggression: age 5 
Dependency: age 5 
Conscience: age 5 
Bed-wetting: age 5 


Aggression anxiety 
Projected aggression 
Self-aggression 
Prosocial aggression 
Antisocial aggression 
Aggression: age 5 
Dependency: age 5 
Conscience: age 5 
Bed-wetting: age 5 


Note.—Below each named scale is given the 
1957), Significance levels for girls: when r = 21, 
when r = 29, p = 01, 


tecedent-consequent 
all are in the correct 


code number from Appendix B of Patterns 
» 2 = 05; when r = 28, $ = .01. Significa: 


0j 


original Patterns 


obtained on girls in the Iowa study (Sears et 1 
al., 1953)—is that 


response. As will b, 
influence is clearly detectable by age 12. 


of Child Rearing (Sears, Maccoby, & Levin, j 
nce levels for boys: when r = .23, p = .08; 


study comparable to that 


of inhibitor of the punished 
seen below, this inhibitory 


sion anxiety and self-aggression, and they will not be 


discussed elsewhere. 


es- 


Anlisocial aggression at age 12. The fivel 
aggression scales provide a more differentiated 
measure of the children’s later aggression than 
did the single interview rating at age 5. Un- 
fortunately, there is no measure directly 
comparable to that rating, ile. a measure of 
aggression in the home. Probably the one 
scale most comparable to it, however, is that 
for antisocial aggression. The early rating 
attempted to measure the amount of aggres- 
sion the child showed in the home, mainly that Ay 
directed at the parents, Since most of the | 
mothers disapproved of such behavior in some 
degree, it must be considered a major early 
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form of unsocialized—or antisocial—aggres- 
sion. The items in the antisocial scale did not 
refer to the same kinds of aggressive acts, nor 
to the same objects of aggression, but they 
were meant to be representative of unsocialized 
and unacceptable forms of aggression at age 
12. None of them included a reference to 
parents or the home, so the similarity is but 
partial. And, of course, there is the radical 
difference between mother’s report and self- 
report. 

One would expect that antisocial aggression 
would be negatively correlated with those 
scales most representative of more highly 
socialized and acceptable attitudes toward 
aggression. Among the present dimensions, 
these would be prosocial aggression and ag- 
gression anxiety. In Table 3, which shows the 
intercorrelations obtained from the entire 
tested population, this expectation can be 
seen to be fulfilled; for both sexes the 7's are 
negative, although that between antisocial 
and prosocial in girls is significant at only the 
02 level. If we can assume that the total tested 
Population gives the more reliable measure, 
this low relation is apparently the product of a 
sampling accident within the smaller follow-up 
sample, which contributes to the total tested 
Population; in that sample, the relationship 
ìn girls disappears altogether, although the 
other three coefficients are approximately the 
same as for the larger population (cf. Table 4). 

A second expectation might be that the 
Tating at age 5 would be positively related to 
the antisocial score at age 12. This rests on the 
Presumption of a continuity of both the child’s 
habitual reactions and the mother's methods 
of child rearing. Evidently this reasoning is not 
Sound, for in Table 4 it may be seen that for 
the boys the r is —.03 and for the girls, —.08; 
1n other words, effectively zero for both. The 
Implications of this discontinuity from age 5 
to age 12 will be considered in a later section. 

The relation of permissiveness and punitive- 
Ness to all the aggression scales can be seen in 
Table 6. For antisocial, punishment for aggres- 
Slon to parents is negatively related in both 
Sexes, though neither of the coefficients reaches 
the 05 level of significance. This is the opposite 
rection from the relations found between 

ese same measures and the rating of ag- 
gression at age 5, If it can be presumed that 
these two child measures are tapping some- 
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what similar instigation-action sequences, 
then we may suppose that, during the interven- 
ing years, the inhibitory effect of the punish- 
ment had an opportunity to be influential; no 
longer was the punishment for aggression 
simply a goad to further action—it reduced the 
tendency to express the punished forms of 
behavior. 

The zero correlations between age 5 ratings 
and antisocial, however, show that the shift 
was not a simple unidirectional matter. Some 
children suffered inhibition—and reduced ag- 
gression—while others did not. Likewise some 
who had been low at age 5 remained low, some 
became higher at age 12. The reasons for this 
may involve some constitutional quality in 
the children, in part, or what is more likely, 
some unmeasured change in the mother's 
child rearing behavior. As will be shown below 
in the section on Change from Age 5 to Age 
12, however, there were suggestive differences 
in the original rearing patterns that are con- 
sistently associated with certain of these 
variations. It may be said at once that the 
severity of punishment itself played only a 
limited role. 

The contrast, at the two ages, in the relation 
of severity of punishment for aggression 
toward parents is presented graphically in 
Figure 1. The abscissa shows three degrees of 
punishment (III, 31) as measured from the 
mother interviews taken when the children 
were age 5. The age 5 measure of aggression 
was from the same interviews; the age 12 
measure is the antisocial aggression scale, As 
would be expected from the correlations re- 
ported in Table 6, the age 5 relationship (¢ 
between low and high groups) in girls is signifi- 
cant at <.01, while in the boys it is nonsigni- 
ficant. At age 12, the decrease for boys is 
significant at the .08 level, while the girls show 
no variations at all. The difference in trend 
between the two ages is significant for both 
sexes; the difference between the two T's for 
each is at the .05 level. 

A relation of permissiveness to antisocial 
aggression is suggested only in the boys, how- 
ever (+.18). It may be supposed that either 
the mother’s attitude of expecting and tolerat- 
ing aggression carried over its influence from 
the early years, or that mothers tended to be 
somewhat consistent in their levels of permis- 
siveness as the boys matured. There is no 
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HIGH 
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evident relationship between permissiveness 
and antisocial aggression in girls. The dis- 
tribution of girls’ scores on their mothers’ 
Permissiveness for aggression toward parents 
ls severely skewed, however, and the y is 
probably an inappropriate measure of this re- 
lationship, In fact, the seven most permissively 
` treated girls were all above the median on the 
„antisocial aggression scale. 

These various relationships may be sum- 
marized as follows: for both Sexes, high punish- 
ment for aggression in the first 5 years of life 
Was associated with a high level of reported 
aggression in the home, but at age 12 it was 
associated with a low level of antisocial ag- 
gression. In boys, at both ages, high permis- 
siveness was associated with high aggression; 
in girls, this effect was clearly evident only at 
the earlier age, with a mere suspicion of it at 
the later. 

In our earlier report (Sears et al., 1957, p. 
401), it was shown that mothers of girls were 
already less permissive of aggression, when the 
children were but 5 years old, than were the 
mothers of boys, One cannot but wonder 
whether the stricter attitude of control toward 

girls may not have increased even more during 
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the middle childhood years, and produced an 
essentially homogeneous level of permissive- 
ness—or at least an even more severely 
truncated distribution. If this were the case, 
the lack of correlation between the earlier 
measure of the mothers’ attitudes and their 
daughters’ 12-year-old aggressive responses 
would be quite understandable on purely 
statistical grounds. 

Aggression anxiety and prosocial aggression 
at age 12. Three possible effects of punishment 
have already been mentioned—instigation to 
aggression by frustration, and by conflict 
drive, and inhibition of the punished response. 
A fourth must also be considered—the estab- 
lishment of anxiety as a response to whatever 
instigators are present when punishment 
occurs. In the present context, the anxiety 
with which we are concerned is elicited by the 
same stimuli as aggressive responses, plus 
the response produced cues arising from overt 
or imaginal or affective aggressive actions. 
Hence with the present instruments we would 
expect a positive relationship to develop 
between punishment for aggression to parents 
and aggression anxiety. Table 6 shows that 
the expectation is fulfilled in the girls (p < 
-05), but not in the boys. Thus far, then, one 
can only say that the theoretical prediction is 
but half-supported, 

The relation of punishment to prosocial ag- 
Eression is the same as to aggression anxiety. 
High punishment is associated with high pro- 
Social aggression in girls, but is unrelated in 
boys. This positive effect of punishment is 
similar to that found in girls for aggression in 
the home at age 5; at that time, too, this re- 
lationship was nonsignificant in boys. 

So, what sifts out of these rather complex 
comparisons is that severe early punishment 
Íor aggression tended to reduce antisocial 
aggression (a little more clearly so in boys) 
and to increase prosocial aggression (but only 
in girls). Nevertheless, in both sexes, prosocial 
aggression is as unrelated to earlier aggression 
as is antisocial aggression. Since both types of 
aggression are Strongly related to aggression 
anxiety in both Sexes—prosocial positively and 
antisocial negatively—it comes as a shock to 
discover that the earlier aggression is positively 
related to aggression anxiety in girls and 
negatively in boys, The implications of this 
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finding are considered below in the section on 
Aggression Anxiety. 

Since the various attitude scale correlations 
referred to in this discussion of the follow-up 
sample are approximately replicated in the 
total tested population (with the single ex- 
ception of the size of the negative relation 
between antisocial and prosocial), we are 
forced to the conclusion that these sex dif- 
ferences surrounding the early measure of 
aggression and the aggression anxiety measure 
have something to do with changes that oc- 
curred between ages 5 and 12, and with dif- 
ferences in the sources of aggression anxiety 
for the two sexes. 

Mention must be made also of the role of 
permissiveness in connection with aggression 
anxiety and prosocial aggression. Both are re- 
lated negatively to permissiveness, though 
significantly so only in the girls. Since permis- 
Slveness and punishment are rather closely 
related to one another (see Table 10), there is 
some difficulty in being certain how inde- 
pendent the contributions of the two measures 
are, 

Projected. and. self-aggression. These two re- 
maining aggression scales are significantly 
associated with each other and with aggression 
anxiety and prosocial aggression in boys; the 
Same is true for girls, except that self-aggres- 
Sion seems unrelated to aggression anxiety and 
Prosocial (cf. Tables 3 and 4). 

The relationship to permissiveness and 
punishment is not entirely clear. In girls, pro- 
Jected aggression acts like part of the anxiety 
cluster with respect to permissiveness; non- 
permissiveness is associated with high projec- 
tion. In boys, the relation is nonsignificantly 
Opposite. For both, punishment is nonsignifi- 
cantly negative. The relations for self-aggres- 
Sion can best be considered later; an under- 
Standing of their antecedents will require 
analysis of other variables. 

Conclusions. When the children’s aggression 
at home was measured in the original mother 
interviews, there was evidence that both 
Permissiveness and punishment for aggression 
PM to facilitate such behavior. Permissive- 
en appeared to serve as both an instigator 
had i Symbolic reinforcer, while punishment 
n. ERIT effect as a further frustra- 
MM ence a goad to more aggression. The 

Trelations that displayed these relations in 
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the original group were significant but small; 
because of the reduced size of the follow-up 
sample, although the sizes of the correlations 
are similar, they are no longer statistically 
significant. However, the directions of the 
relationships in this smaller group are the 
same. 

The more differentiated measures at age 12 
appear to illustrate a rather radical shift in 
the role of punishment. Formerly an instigator 
to antisocial aggression, it now seems to have 
imposed an inhibitory influence on that be- 
havior and to have created aggression anxiety. 
The interesting thing about the various scales 
is that the three nonantisocial scales—which 
are positively related to anxiety—seem, if 
anything, to enjoy a facilitating effect from 
punishment. It looks as though the conflict- 
drive effect of punishment were still operating, 
producing higher prosocial and self-aggression 
while the inhibitory influence reduced the 
antisocial aggression. Projective aggression, 
which in terms of its correlations with the other 
scales appears to be a part of both the anxiety 
cluster and the antisocial, is influenced by 
permissiveness in the same way as is aggres- 
sion anxiety, and by punishment in the same 
way as is antisocial aggression. 

Having only two chronological points of 
measurement—ages 5 and 12—one falls easily 
into a manner of speaking that implies the 
operation of one effect of punishment (facili- 
tating) at one age and another effect (in-. 
hibiting) at the other. This is improper, of 
course. One can only suppose that both effects 
of punishment are always operative. The 
fact that at the earlier age the two interact 
in such a way as to allow the facilitative effect 
to be revealed in group averages, and at à 
later age the inhibitory effect, probably means 
that with the particular levels of punishment, 
permissiveness and other relevant variables 
occurring in this culture, the facilitating effects 
are more pronounced in early stages of social- 
ization and the inhibitory ones do not appear 
so generally until later. Indeed, the concept 
of conflict-drive itself (Sears et al., 1953) im- 
plies the simultaneous influence of both prop- 
erties. 

Tn summary, the influence of these two child 
rearing variables may be hypothesized to 


have been as follows: high punishment for. 


aggression reduced antisocial aggression, but 


à 
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increased aggression anxiety and prosocial 
aggression, and high permissiveness increased 
antisocial aggression, but reduced the other 
two. Only 2 of the 12 correlations on which 
these statements are based reach the .05 
level of significance, but the remaining 10 
nonsignificant r's are all in the right direction, 
and 5 of them are at the .10 level of signifi- 
cance. The evidence overall, in other words, 
is of just borderline significance, and though 
these hypotheses are congruent with the 
theoretical expectations concerning the in- 
fluence of punishment and permissiveness, 
the conclusions must be accepted with con- 
siderable caution, 


Alternative Responses: The Change from Age 
5 to Age 12 


One of the thorniest problems in aggression 
theory is that of alternative responses (cf. 
Berkowitz, 1958). It is really two problems, 
although these are often fused—and confused 
—into one. The first is catharsis. Both the in- 
stinct view and the frustration-drive concept 
assume a certain quantum of internal instiga- 
tion that requires expression. If overt aggres- 
sion is prohibited, then the impulse will be 
displayed in nonovert or fantasy or some 
other form of aggression not expected to elicit 
punishment (or counteraggression). This “dis- 
charge” (Freud, 1922), or “goal response” 
(Dollard et al., 1939), serves to reduce the 
strength of the residual impulse, thus, de- 
creasing the probability of further aggressive 
behavior immediately thereafter. The term 
caiharsis refers to this letting loose of drive 
instigated behavior, the lowering of tension, 
and by analogy to the original use of the term 
in connection with hysterical symptoms, the 
reduction of alternative expressions. An 
instinct theory that made no room for indi- 
vidual differences in the strength of the im- 
pulse would predict a negative correlation 
between the strengths of two temporally 
adjacent aggressive acts, A behavior theory 
(Dollard et al., 1939) that did assume indi- 
vidual differences—on the basis of either con- 
stitutional factors or stimulus input varia- 
tion—would make more equivocal predictions, 
because if the range of individual differences 
were large enough in comparison with the 
size of the cathartic reduction, a correlational 
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measure could be positive. Obviously corre- 
lational methods are inappropriate for this 
problem. Empirical investigation of this hy- 
pothesis has so far led to quite ambiguous 
results, but they need not concern us here, 
for the present data are irrelevant to the 
catharsis aspect of alternative responses. 

The other aspect has to do with traits, or 
habitual forms of aggressive action, Regard- 
less of which of the two foregoing views con- 
cerning internal instigation is chosen, there 
is a clear question as to whether the inhibition 
of one form of aggression is likely to be asso- 
ciated with the stronger development of others. 
The time interval implied in this question is a 
long one—months and years—in contrast with 
that implied in the catharsis hypothesis, 
Whereas catharsis refers to an action-theory 
construct of immediate drive reduction and 
ensuing performance strength, the habitual- 
form question relates to the learning process 
and the factors determining choice of ex- 
pressive form. 

With respect to this issue also, there are two 
Possible positions. They are alternative hy- 
potheses concerning the sources of instigation 
to aggression. One is a drive theory, including 
both the instinct and the frustration concep- 
tions of the drive's origin. The other is a cue 
stimulus theory, The drive theory would sug- 
Best that observed differences among people 
in their choice of forms of expression (e.g. 
antisocial, prosocial, self, etc.) are a function of 
reinforcement and inhibition, but that one 
major source of instigation to any given act is a 
drive. This drive could be constitutional or 
it could be produced by frustration of any act, 
including aggression. Given a frustration, 
then, the drive would be increased in strength 
and would, thus, strengthen whatever responses 
did occur. Concretely, it would be predicted 
that if one form of aggression (e.g., antisocial) 
Were repeatedly punished, and thereby in- 
hibited whenever external or internal stimuli 
aroused it, alternative forms of aggressive 
behavior (eg., Prosocial) that did not meet 
with punishment would be stronger than they 
would be if the first form (antisocial) were not 
inhibited. 

The cue stimulus theory, on the other hand, 
is based on the assumption that particular 
forms of aggression are specifically reinforced 
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or inhibited, and that the strength of any one 
form is causally related to the strength of 
others only by stimulus and response general- 
ization. Punishing one form would tend to 
depress other forms, on the average, but posi- 
tive or negative correlations among different 
forms would be assumed to result primarily 
from similar or different schedules of rein- 
forcement the occurrence of which would be a 
function of consistencies or inconsistencies in 
the actions of reinforcing agents (e.g. 
mothers). To be concrete again, we would 
expect, from this position, that because there 
in a demonstrable consistency of maternal 
handling of different forms of aggression, the 
strengths of alternative forms of aggression 
would tend to be positively correlated regard- 
less of the prolonged inhibition of any one of 
them, 

The data from the present study provide an 
opportunity to make one useful test of these 
two hypotheses. All children express some 
aggression toward their parents, and may be 
presumed to develop a measurable degree of 
habit strength for such changeworthy be- 
havior (Sears et al., 1957). They also learn to 
expect gratification because aggression pro- 
vides a certain control over the parents. 
Nevertheless, infantile forms of aggression 
must be relinquished eventually, and hence 
we may assume that by age 12 all the children, 
m varying degrees, had suffered inhibition 
of their changeworthy forms of aggression. 
That is, since nearly all the mothers had ex- 
Pressed some dislike of aggression when the 

ildren were 5, we can assume a continuing 
Pressure against its expression, even though 
We have no measure of the mothers’ behavior 
during these intervening years. 
are drive theory predicts that children 
S were still highly aggressive at age 5 but 
bs suffered inhibition of this behavior, and 
E: ace it, should show higher degrees 
m ternative forms than children who did not 
oe it. The cue stimulus theory predicts 
" Opposite; i.e., the inhibited group should 

ow lower degrees of the alternative forms. 

These differential predictions can be tested 
en of the mean scores on pro- 
by ee self, and*prosocial aggression obtained 
aes o subgroups of children who did and 

ot, respectively, show the presumed in- 
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hibitory effects of punishment. These are 
children who were all High (H) (upper half of 
the distribution) on aggression at age 5, and 
then were found to be either H or Low (L)— 
same defining basis—on antisocial aggression 
at age 12. 

From a developmental standpoint all these 
children may be considered as having posed a 
problem of control to their parents. They had 
been more severely punished, on average, and 
also treated more permissively than the other 
half of the group of children. We do not know 
how the mothers coped with this problem in 
the intervening years, but we do know that, at 
age 12, 29 boys and 23 girls were in the lower 
half of the distribution on antisocial aggres- 
sion, while 17 of each sex were in the upper 
half. One might say that the mothers had 
coped successfully with the former and un- 
successfully with the latter. 

Children at age 5. There were no differences 
at all between the means of these two groups’ 
reported dependency, aggression in the home, 
or conscience. There was one difference in the 
boys, however, that is relevant to aggression. 
On the mother rating scale (III, 27) called 
“extent to which parent has encouraged child 
to fight back," there was a nonscalar code 
labeled *Problem hasn't come up; child has 
always defended self." Of the 17 High-High 
(HH) boys, 7 were so judged; of the 29 High- 
Low (HL) boys, only 2 were. The difference 
by x? is at p < .01. Only 1 of the 7 HH boys 
was treated as nonpermissively as the 2 HL 
boys with respect to expressing aggression 
toward other children. The boys who did not 
reduce their aggression over these years were 
permitted to—and did—defend "themselves 
against other children by fighting, even at 
age 5. 

P Children at age 12. As is shown in Figure 2, 
the differences in test measures for the boys 
are much greater than for the girls, although 
both are in the same direction. They tend to 
support the drive theory, though not un- 
equivocally. The boys whose aggression had 
decreased (Group HL) were higher, by / 
test, on aggression anxiety (p = .001), 
self-aggression (p = .10), and prosocial aggres- 
sion (p < .001). For girls, the equivalent 
values are p < .05, ns, and .15. There was no 
difference on projective aggression in either 
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Fic. 2. Mean frequency of prosocial and self-aggression in two groups equally high on aggression at age 5 but 


one of 


Sex. These differences, particularly those with 
respect to anxiety and prosocial are what 
might be expected from more severe control 
by both punishment and nonpermissiveness, 
although the anxiety measure is not one of 
the alternative forms we are considering here, 
of course. 

Mothers! behavior when children Were age 5. 
The evidence for any such difference of treat- 
ment in the early years is minimal, however. 
For both sexes, there was Slightly less per- 
missiveness for aggression toward parents and 
toward other children, but none of these four 
differences reached the .05 level of signifi- 
cance, There was no difference in Severity of 
punishment for aggression toward parents, 
and only a slight nonsignificant difference on 
physical punishment, with the HL group 
having suffered the more severely. The latter 
group had also been less strongly encouraged 
to fight back when attacked by other children. 

These nonsignificant differences are all in 
the right direction to support the inference 
that the group of originally aggressive children 
who did drop into the lower half of the dis- 
tribution on antisocial aggression were ones 
whose mothers were initially more controlling 
and less inciting of aggression, But the dif- 
ferences in maternal behavior are nowhere 
near as great as the collateral differences in 

aggression anxiety and Prosocial aggression 
might lead one to expect. 


which (HH) was high on antisocial aggression at age 12 and the other of which was low (HL). 


It is worth adding, perhaps, that none of 
the following maternal variables showed any 
differences whatever between these two groups, 
in either sex: severity of toilet training, 
warmth, use of the four disciplinary tech- 
niques of tangible rewards, praise, deprivation 
of privileges, or withdrawal of love, not follow- 
ing through on threats of punishment, or who 
did the disciplining in the family. 

These findings suggest that the effective 
suppression of an aggressive child's early 
expression of aggression in the home tends to 
increase the strength of prosocial and self- 
aggression in the later years. This conclusion 
is in accord with the drive theory of instigation 
and in opposition to the cue stimulus theory. 
What little information can be gained from 
the mother interview scales suggests, further, 
that Donpermissiveness rather than punish- 
ment was the technique used for controlling 
the aggressive behavior. 

Aggression Anxiety 

In the discussion so far there has been re- 
peated reference to sex differences in the sizes 
of relationships between the permissiveness- 
Punitiveness antecedents and certain of the 
aggression scale consequent measures. These 

ifferences have been glossed over and treated 
as if they were Possibly no more than varia- 
tions in sampling. There are certain differences 


mM connection with aggression anxiety that 
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TABLE 7 
AGGRESSION ANXIETY AND  SELF-AGGRESSION: 
CORRELATIONS WitH CxILtp BEHAVIOR 
VARIABLES MEASURED AT AGE FIVE* 
(Girls: N = 84; Boys: N = 76) 


Aggression | Self- 
anxiety aggression 
Child behavior: age S | Code. 

Girls | Boys | Girls | Boys 
Aggression Til, 29 448 |—.21 | .04 | —.10 
Dependency IH, 10 | .04|-.16| os| .08 
Conscience II, 34 |-.4| .30|-.07 | .00 
Bed-vetting I, 50 | .00|—.17] .24| —.18 
Quarreling with sibling | II, 21 | -04 |—.19 | .00 | —.08 


^ See also Table 4. 


can no longer be ignored, however. The most 
curious ones are illustrated in Table 7, which 
shows a sharply defined sex difference in direc- 
tion of correlation between aggression anxiety 
at age 12 and both aggression and conscience 
at age 5, Both these correlational differences 
are significant at p < .01. 

What is surprising about this finding is that 
there is only one significant sex difference 
among the correlations within either of the 
two sets of child measures made at the re- 
Spective ages. As can be seen in Table 4, the 
SIX correlations between (age 5) aggression, 
dependency, conscience, and bed-wetting are 
in the same direction and of about the same 
Size for the two sexes except for that between 
Conscience and bed-wetting, and the differ- 
ence of .12 between the two nonsignificant 
Correlations of aggression with dependency. 

ikewise, among the five aggression measures 
taken at age 12, only 1 of the 10 possible com- 
parisons is significant at the .05 level, namely, 
that between the two correlations of antisocial 
en Prosocial aggression mentioned earlier; 
or the boys, the 7 is — .36, and for the girls, 
~.03. None of the r's relating aggression 
DEM to the other four aggression measures 
F significantly different for the two sexes. 
“urthermore, aggression anxiety is almost 
identically related (+.34 and +.37) with the 
Ga of femininity obtained, at age 12, 
as the self-administered M-F test derived 
inn Gough's CPI Fe scale. Paralleling this 

trasex evidence of the “feminine” quality of 
cum anxiety is a highly significant 
jue difference in the means on this scale (see 

able 5). 
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Now as to antecedents. The predictable 
effect of punishment as a determiner of ag- 
gression anxiety in girls has already been 
mentioned. It is to be noted, however, that 
only punishment for aggression toward par- 
ents, and the punishment factor score, are so 
related (see Table 8). Physical punishment 
and punishment for dependency are not. 
Neither are the various techniques of dis- 
cipline: praise, use of tangible rewards, dep- 
rivation of privileges, or withdrawal of love. 
Indeed, the last is somewhat negatively re- 
lated (—.17). 

None of the punishment measures shows a 
relation to aggression anxiety in boys except 
withdrawal of love, which surprisingly cor- 
relates +.21. 

Nonpermissiveness, however, is associated 
with aggression anxiety in both sexes (Table 
8), but only with respect to aggression toward 


TABLE 8 


AGGRESSION ANXIETY AND SELF-AGGRESSION: 
CORRELATIONS WrrH CHILD 
REARING VARIABLES* 


(Girls: N = 84; boys: N = 76) 


gression Self- 
Code ES aggression 
Child rearing variable | number 
Girls | Boys | Girls | Boys 
Permissiveness: 
Aggression to parents III, 30|—.20 |—.17 |-.09 | —.06 
Aggression to other | IIE, 28] .27| 14 |—.07 | ~.13 
children 
Aggression to siblings II, 22] .04|—.08 | .11 | —.19 
Dependency to mother XV, 5|—.02| .12|—.08 | —.02 
Factor score XVII, 5|-.13 |-.23 |-.17 | —.15 
Punishment: 
Aggression to parents II, 31 24| .10| .11 .08 
Dependency to mother | XV, 7| -13 |--53 0| .28 
Physical punishment IH, 46| .07 |-.03 | .12 B 
Factor score XVII, 6| .23| .04| .10 +22 
Child encouraged to fight nt, 27| .23| .07 oi | —.15 
back 
Reward for dependency XV, 6|—.18 | .03 |—.20 | —.15 
Withdrawal of love IH, 53|—.17 | .21 |—-40 48 
Use of praise II, 37| .06|—.08 | .03 | —.14 
Use of tangible rewards TIT, 36| .03 | .04] .19 +39 
Deprivation of privileges II, 47|—.06 | .13]| .06 .24 
Severity of toilet training IL, 51 10| .06| .12 27 
Sex role differentiation IH, 20|—.08 |-.06 | .15 10 
Mother warmth IH, 15|—.04 | .09 |—.06 | —.07 
Father warmth IH, 65| .09 |—.15 |—.14 —.17 
Amount of father care- I, 18| .11 |—.12} .02 .02 
taking: infancy 
Amount of father care- TH, 64| .28 |—.04 |-.12 | —.08 
taking: age 5 
"Which parent disciplines III, 66 |—.23 |-.06 |-.10 | —.16 
(father — 1) 
3 See also Table 6. 
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parents (or as measured by the factor score). 
Exactly the opposite relation obtains with 
permissiveness for aggression toward other 
children (excluding siblings); aggression anx- 
iety is slightly but nonsignificantly related to 
this scale in a positive direction for boys and 
significantly at p < .01 level in girls. This 
reversal in the role of permissiveness is the 
key to a possible explanation of the sex dif- 
ference. 

A glance down the boys’ column of Table 8 
shows but two significant 7’s (.05 level or 
better), namely, those with the permissive- 
ness factor score and with withdrawal of 
love. In the girls’ column, however, there are 
Six that reach that level, and three others 
(including withdrawal of love) close to it. 
These mother scales are by no means inde- 
pendent of one another, of course, and the 
actual numbers of significant 7’s are less im- 
portant than the kinds of variables that seem 
to be most connected with aggression anxiety, 

The hypothesis is suggested that aggression 
anxiety has quite different dynamic and 
genetic qualities for boys and girls. In boys, 

it is a part of the conformity syndrome, a 
close relative of conscience, and one of the 
Consequences of relatively high but non- 
punitive pressure toward socialization; it is a 
positive reaction to the same love oriented 
method of discipline and control that suc- 
ceeds at an early age in reducing both de- 
pendency and aggression toward parents and 
quarreling among siblings, It is a feminine 
quality only in the sense that most behavioral 
representations of rapid and effective socializa- 
tion in the preschool years are more charac- 
teristic of girls and women than of boys and 
men. It is established by a general aura, in 
the home, of honpermissiveness for infantile 
and destructive impulse expression; the sanc- 
tioning power when deviant behavior does 
occur comes from the threatened loss of the 
love relationship to the mother, Dynamically, 
it is one of many Tesponses to stimulation that 
arouses affiliation desires and conformity 
behavior, 

In girls, aggression anxiety is more complex 
and has a quite different genesis, Dynamically 
it is part of the syndrome of forced conformity 
and subordinated autonomy in which anxiety 
is evoked by any instigator, internal or ex- 
ternal, that serves to arouse incipient aggres- 
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sive actions or feelings. It is the product of an 
internalized Type II conflict (Sears & Hov- 
land, 1941) in which the positive response is 
a forbidden form of aggression, the negative is 
fear of punishment, and the source of instiga- 
tion is a frustration induced drive. 

This conception is in sharp contrast to that 
about boys, suggested above. The reasoning 
that favors some such difference rests on the 
quantitative reversal of the relationship be- 
tween aggression and aggression anxiety in 
the two sexes, Constitutionally, girls are prob- 
ably less overtly aggressive than boys, and 
certainly the conditions for their rearing are 
Such as to produce more rapid maturing and 
relinquishing of earlier changeworthy be- 
havior, aggressive and otherwise. They evi- 
dently receive fewer external instigators to 
aggression and have fewer response produced 
cues. Girls as a generality, then, should show 
less aggression anxiety than boys if both sexes 
have such responses evoked mainly by inter- 
nal instigations to aggression. On the con- 
trary, girls show much more. 

Genetically, the conditions that establish 
aggression anxiety in girls may be summarized 
under the label of conditions antithetical to 
appropriate sex typing. With the exception of 
nonpermissiveness for aggression to parents 
(which is appropriate), all the larger correla- 
tions in the girls’ column of Table 8 are with 
items of child rearing that either were shown 
originally (Sears et al., 1957, p. 401) to be dif- 
ferent for boys and girls or are related to 
father involvement and to the punishment of 
aggression or dependency. The direction of 
every one of these ^s is opposite to that im- 
plied by the sex difference in rearing. Thus, 
for example, girls are less frequently en- 
couraged to fight back when attacked, but the 
correlation between this scale and aggression 
anxiety is Positive; likewise, girls are more 
commonly disciplined by withdrawal of love, 
but this scale is negatively related to aggres- 
Sion anxiety, The amount of father caretaking 
at age 5 was not different for the two sexes 
in the original population, nor is it signifi- 
cantly so in the follow-up sample, but pro- 
portion of father discipline was; it seems 
reasonable to suppose that greater maternal 
care for girls and paternal participation with 
boys is appropriate for sex typing in each 
case. Both these scales show a positive rela- 
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tionship between father participation and 
aggression anxiety in girls. 

Not all of the sex differences in rearing ex- 
periences described in the original study are 
represented in the list of larger correlations 
with aggression anxiety. Time for completion 
of weaning, warmth to infant, use of maids 
and sitters in infancy, kinds of chores re- 
quired, expectancy of going to college, amount 
of use of praise and amount of physical pun- 
ishment, for example, do not show the rela- 
tion. Neither does sex role differentiation by 
the mother, the scale measuring the degree 
to which the mother conceived that there were 
differences between the sexes that required 
differences in rearing. One might expect that 
those mothers who were not aware of sex 
role differences might treat their daughters 
less appropriately for feminine sex typing 
and, thus, produce more aggression anxiety. 
The r of —.08 is of insufficient size to be said 
to support this expectation, however. 

To summarize, this conception of the dif- 
ferent dynamics and genetics of aggression 
anxiety in the two sexes is congruent with 
the age 5 correlations shown in Table 7. 
There it is seen that, for girls, high aggression 
anxiety is associated with higher aggression 
in the home and lower conscience. There is no 
relation to dependency or quarreling among 
siblings, as there is among the boys. For the 
boys, on the other hand, the anxiety is asso- 
dated with low aggression and high con- 
Science, On the antecedent side (Table 8), 
the different dynamics suggested for the sexes 
are congruent with the more specifically ag- 
gression connected nonpermissiveness in the 
girls’ experience and the more general (factor) 
Donpermissiveness in the boys’, and with the 
dear influence of punishment in the girls" 
and its irrelevance in the boys’ experience. 
The other relevant child rearing dimensions 
that correlate significantly with the anxiety 
Measures in girls are ones which have been 
shown previously to be relevant to sex-typing, 
and the direction of correlation in every in- 
Stance ls such as to associate high aggression 
anxiety with the inappropriate sex-typing 
Practice. The theory also accounts for the 
Sua! aggression anxiety of girls than of 

Oys, even though girls are less aggressive in 

oe overt behavior and fantasy (P. S. Sears, 

51); the explanation of this phenomenon is 
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TABLE 9 
AGGRESSION ANXIETY AND ADULT ROLE: CORRELA- 
TIONS IN TorAL TESTED POPULATION AND 
FOLLOW-UP SAMPLE 


Total nee Follow-up 
Adult role measures A EE 
Girls | Boys| Girls | Boys 
Enforcing rules RALA BAAS 1. $23 
Accepting rule enforcement| .19 | .24 | .10 | .36 
02 


.20 |-.04 | .37 
.28| .23 | .36 
261| 84 | 76 


Giving nurturance 
Adult versus child choice 


2 
Number of cases 264 


not in terms of a greater frequency of inap- 
propriate sex typing experiences in girls, of 
course, but is seen as a consequence of the 
development of internalized Type II conflict. 
One further set of differential relationships 
should also be demonstrable. These have to 
do with maturity of role choice and nurtur- 
ance. One would expect that boys would show 
substantial positive correlations of aggression 
anxiety with the mature and nurturant forms 
of behavior, while the girls would show no 
relationship. Table 9 gives evidence for these 
propositions in the form of correlations be- 
tween aggression anxiety and four measures 
reported elsewhere by Maccoby (1961). 
These measures relate to the positive choice 
of adultike roles in connection with using 
rule enforcement, accepting rule enforcement, 
giving nurlurance, and a general series of items 
having subordinate-superordinate role choice 
comparisons. In the total tested population 
the girls do have significant r's with respect 
to all but the giving nurturance scale, though 
the boys’ r's are slightly larger. In the follow-up 
sample, the sex differences are still larger. 
On giving nurturance, both sets of figures 
show a significant positive relation (p < .01) 
in boys and zero relation in girls, as expected. 
These figures support the expectation so far 
as nurturance is concerned, but do not with 
respect to the other forms of mature behavior. 
Conclusions. It would be an obvious ab- 
surdity to suppose that either of these dy- 
namic or genetic conceptions of aggression 
anxiety is applicable rigidly to one sex only. 
have had their 


Undoubtedly many males 
aggression anxiety somewh: acerbated by 
parental punishment for ag ion o; too 


severe demands for aggre: 
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their relations with other children. The vir- 
tually zero correlations between these dimen- 
sions and boys’ aggression anxiety cannot be 
interpreted as meaning that those variables 
have had no influence for any boy in the group, 
but simply that as a cumulative matter within 
the whole matrix of influences, these are not 
detectable for the group as a whole, Likewise, 
in the female, one must assume that the af- 
filiative conforming motivation created in 
part by nonpermissiveness and by appeal to 
maturity and conscientiousness is also a de- 
terminant of aggression anxiety. The rela- 
tions with the adult role scales favor this 
interpretation, Indeed, one cannot but wonder 
whether both sexes may not about equally 
derive aggression anxiety in this way, but 
that girls have overlaid a more severe degree 
of it by those socializing experiences which, 
in the exciting of aggressive impulses counter 
to girls’ proper nonaggressive sex role, are 
inappropriate to the sex-typing process, 

In any case, the evidence presented here 
Suggests two quite different dynamic and 
genetic forms of aggression anxiety. Whatever 
combinations of them may occur in individual 
cases, within the limits of our sample, one is 
predominant in boys and the other in girls at 
age 12. 


Self-Ageression 


Although the internal consistency of the 
self-aggression scale is virtually nil, the sizes 
and directions of its relationships with the 
other child measures are so consistent, both 
between the follow-up sample and the 1958 
classmates and, with one exception, between 
boys and girls, that its child rearing antece- 
dents seem worth investigation. The one ex- 
ception is in respect to aggression anxiety; 
for boys, these two Scales are positively re- 
lated in both groups (follow-up, r = .25, and 
total tested population, r = .24), while for 
girls the slight Positive relation is nonsig- 
nificant in both groups (respectively, .11 and 
.08). In view of the different dynamics of 
aggression anxiety described in the preceding 
section, this is Scarcely surprising, but does 
Warrant an attempt at interpretation, The 
other relationships of self-aggression are also a 
little smaller for girls than for boys, but not 
significantly so. The r with projected aggres- 

sion is clearly positive in both sexes, while the 
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T's with both prosocial and antisocial aggres- 
sion are effectively zero. 

From a theoretical standpoint (Dollard et 
al., 1939), self-aggression has been considered 
a displacement of an aggressive response from 
the frustrating agent to the self, produced by 
anticipated punishment for direct aggression 
toward the agent, Such an expectancy would 
arise if there had been repeated punishment 
for being aggressive toward the frustrator. 
Since aggression toward the self is intrin- 
sically painful, this would be the direction of 
aggression adopted only if most other objects 
of displaced aggression were also interdicted, 

In the present context, we would expect 
that high self-aggression would be associated 
with various child rearing experiences that 
both heightened frustration. and served to 
prevent the expression of overt aggression 
toward parents, siblings, and other children. 
In Table 8 are shown the relevant correlations 
for a test of this hypothesis. It is apparent at 
once that the boys’ self-aggression is signifi- 
cantly related to more socialization variables 
than the girls’, Indeed, the only significant 
correlation for the latter is with the nonuse of 
withdrawal of love as a disciplinary technique. 

For the boys, however, there is substantial 
Support for the hypothesis. First, we would 
expect any early socialization experiences 
that were strongly frustrating to create high 
instigation to aggression and, at age 5, high 
aggression toward the frustrator. In a pre- 
vious study (Sears et al, 1953), there was 
some evidence for a positive connection be- 
tween severe toilet training and aggression 
in the preschool; in the present follow-up 
sample of boys there had been a nonsignifi- 
cant relation (r = .14) between severity of 
toilet training and aggression in the home. 
Severity of toilet training was correlated .41 
With use of physical punishment, which 
Suggests that the hypothetical conditions 
outlined above were fulfilled—namely, high 
frustration, with some consequent aggression, 
and punishment for expressing aggression. 
As was predicted, there is a significant relation 
(r — .27) between severity of toilet training 
and self-aggression, None of the other aggres- 
sion scales is significantly related to this ante- 
cedent. 

A second source of early frustration was the 
existence of siblings; we may assume that 
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TABLE 10 


INTERCORRELATIONS AMONG TWELVE CHILD REARING MEASURES OBTAINED FROM THE 
MOTHER INTERVIEWS: FOLLOW-UP SAMPLE 


(Girls! mothers: VW = 84; Boys’ mothers: N = 76) 


Girls 
" A à Code 
Child rearing variable number |Item| 1 2 3 4 $1156 7 8 SPANO!) Ont ete 
Peimissiveness: Aggression m, 30} 1 +70 24 —.44 —28 —.30 —04 —.04 19 .0£ 15 —.29 
to parents 
Permissiveness factor XVI, s| 2] .67 64 —40 —.35 —.35 —.13 —.07  .16 -.07 .27 —.36 
Permissiveness for XV, 5| 3]|-05 .4 —.22 —.35 —.09 —.12 .00 05 —.14 .35 —.10 
dependency 
Punishment: Aggression to It, 31] 4 |—.61 —47  .08 162.58 0S IS. 00 —.02 =.37  .18 
parents 
Punishment factor XVIII, 6| 5 |—31 —.22 —.04 ól S0 0 0.00 0.18. 00 07 —.45 18 
Physical punishment II, 46| 6 |—.22 —12.—.04 .34 68 .08 .22 —.05 —.09 —.34  .21 
Deprivation of privileges III, 47 7 |—.07 —.04 .10 .06 .14 .25 335 22 —02 —.10  .20 
Tangible rewards IM, 36| 8 .01 —.17 —.07  .13  .16 .16 .2 .00 —08 .07 .21 
Use of praise II, 37] 9 | .10 —.07 —.13 —.04 —.10 —.12 —.09 —.05 42 446 1 
Withdrawal of love II, 53] 10 | —.19 —.27 —16  .36 3k  .40 34 32 —,20 =.20  .08 
i 

Mother warmth IH, 15| 11 .08  .23 40 —.01 —.24 —.30 —.08 —.06  .15 —.49 =18 
Severity of toilet training I, st 


Note.—Girls’ mothers above the diagonal; boys’ mothers below. 


other children in a family, either older or 
younger, add not only companionship but also 
competition for things, space, and people. 
Here again there is a detectable relationship; 
by ¢ test, only boys were less self-aggressive 
than those with siblings (p = .04). 

A third source of frustration at the earlier 
age was punishment for aggression and de- 
pendency. However, punishment should also 
have inhibited aggression eventually, so that 
on two counts a high level of punitiveness 
Would be expected to be positively related to 
Self-aggression. Interestingly enough, punish- 
ment for aggression toward parents shows no 
relation at all (r — .03) and physical punish- 
ment a nonsignificant amount (.11), while 
Punishment for dependency (.23) and the 
nent factor score (.22) are both sig- 
as related, Furthermore, two of the 
a niques of discipline—which are indirect 
a ue of punishment—are associated in 
Eo Same way, deprivation of privileges (.24) 
iin rona of love (.48). The use of 
io s © rewards, which is positively related 

€privation of privileges, is also related 


positively to self-aggression (.39). (In boys, 
the four disciplinary techniques have a median 
intercorrelation of .28, while in girls, the 
median is .05; see Table 10.) 

As is the case with nearly all the other 
antecedent-consequent relations that have 
been reported in this and similar studies, 
these correlations are small on an absolute 
scale. This is to be expected, of course, in 
view of the multiple antecedents responsible 
for a given item of behavior; no one variable 
contributes much of the total variance. 
The influence of these several types of ex- 
perience can be displayed more dramatically, 
however, if comparison is made of a few ex- 
treme cases. For example, if the upper and 
lower halves of the distributions on permissive- 
ness and punishment factor scores are com- 
bined as follows: 

High permissiveness, low punishment 


(N = 17) 
Low permissiveness, high punishment 
WN = 23) 


for the two 


the mean scores on ggr 
el, p = .02, 


subgroups are diff t 
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one-tailed test? with the second subgroup 
the higher. 

Of course, with the relatively low intercor- 
relations among the child rearing variables 
(Table 10), adding further variables to such a 
crossbreak dom. i reduces the number of 
cases. For example, adding severity of toilet 
training as a third selector to the above cross- 
break gives the following: 

High permissiveness, low punishment, low 

severity of toilet training (V = 10) 
Low permissiveness, high punishment, 
high severity of toilet training (V = 17) 
The second group has higher mean self- 
aggression at the level, ? < .01, one-tailed 
test. If one uses four variables instead of three 
for obtaining some cumulative effects on self- 
aggression, the number of cases becomes very 
low indeed, However, the second of the fol- 
lowing pair of subgroups has higher mean 
self-aggression at the level, p < .004, one- 
tailed test, even with so few cases; the range 
of scores for the first group is from 1 to 4 
and that for the second group is 4 to 9: 
High permissiveness, low punishment, low 
deprivation of privileges, low use of tan- 
gible rewards (V = 6) 
Low permissiveness, high punishment, 
high deprivation of privileges, high use 
of tangible rewards (V = 8) 

By the time a fifth variable is added, the I 
becomes so small as almost to nullify the 
advantages of dealing with extreme cases, 
For instance, if severity of toilet training is 
now added to the above crossbreak, the Vs 
for the first and second groups are only 2 
and 6, respectively, and the significance of 
the difference between means is p = 02, 
one-tailed test. 

In the foregoing comparisons, permissive- 
ness has been added to the several punish- 
ment measures. As Table 8 shows, neither 
permissiveness for dependency nor for aggres- 
sion toward parents is at all related to self- 
aggression, but there are nonsignificant nega- 
tive correlations with permissiveness for 
aggression toward siblings (—.19), toward 
other children (—.15), the permissiveness 
factor (—.15), and encouragement to fight 


® Significance values by one-tailed test refer to 
hypotheses formulated before the data were collected, 
and which were derived directly from the more general 
theory published 19 years earlier (Dollard, Doob, 
Miller, Mowrer, & Sears, 1939, Ch. 3). 
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back when attacked (—.15). In other words, 
nonpermissiveness in every direction (except 
towards parents) seems related to self-aggres- 
sion. If we use the same device for selecting 
extreme cases of nonpermissiveness alone, as 
was illustrated above in. connection with 
punishment plus nonpermissiveness, the fol- 
lowing tiny subgroups are shown to be signifi- 
cantly different at the level, p = .05, one- 
tailed test; the second group's being the higher, 
of course: 

High permissiveness, low punishment, 
high permissiveness for aggression to 
other children, high encouragement to 
fight back (N — 3) 

Low permissiveness and high punishment 
on same scales (N — 7) 

Restriction on early expression of aggression. 
According to the theory (Dollard et al., 1939), 
self-aggression should develop most vigorously 
when the early expressions of aggression are 
successfully inhibited, There are three ways ot 
testing this proposition in the present data, 
and all the appropriate comparisons show 
correlations or mean differences in the pre- 
dicted directions, though the levels of sig- 
nificance are not very convincing. 

1. The comparison of age 5 aggression with 
antisocial aggression offers one test. The 29 
boys who were judged high at age 5 but low 
at age 12 expressed more self-aggression «than 
the 17 boys who remained high at age 12 
(p = .05, one-tailed test). It will be recalled 
that the permissiveness factor score was sig- 
nificantly lower for the more inhibited of these 
subgroups; so this present rather tenuous dif- 
ference can be reasonably interpreted as re- 
lated to the restriction produced by such 
nonpermissiveness. 

2. A second test involves toilet training and 
bed-wetting. In the earlier study of these 
children, it was shown that severe toilet 
training was positively related to ineffective 
control of urination. In the present follow-up 
sample this correlation is only .12, however; 
and there are an almost exactly equal number 
of boys who had severe toilet training and 
who did and did not suffer from enuresis at 
age 5. If we may interpret the enuresis as à 
form of disguised aggression, then the 1 
boys who showed such behavior may be con- 
Sidered less restricted in early aggression than 
the 12 boys who did not suffer enuresis. AS 
predicted, the former had lower self-aggression 
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scores than the latter, but the difference does 
not quite reach the .10 level, one-tailed test. 
Now, if we add the differential effects of high 
and low frustration to this comparison, 
as in the following pair of groups, we find a 
much larger differenée in the predicted direc- 


| tion: 


High bed-wetting, low severity of toilet 
training (V = 12) 
Low bed-wetting, high severity of toilet 
training (V = 12) 

The second group—with restricted early ag- 
gression and high level of frustration—was 
more self-aggressive at age 12, p < .001, 
one-tailed test, than the first group, which had 
unrestricted aggression (in the form of bed- 
wetting at age 5) and a relatively low level 
of frustration. (nonsevere toilet training). 
_3. A third test takes us back to the most 
highly controlled subgroup of boys as measured 
by the factor scores for permissiveness and 
punishment, "These boys were subjected to low 
permissiveness and high punishment. The 
restriction hypothesis predicts that those who 
were already low in their expression of ag- 
gression at age 5 (W = 6) would show greater 
Seli-aggression at age 12 than those who were 
still high at age 5 (NV = 5); for this compari- 
Son, the 10 boys rated on the middle point of 
the hine-point aggression scale at age 5 are 
disregarded. As predicted, the restricted 
Broup does show higher self-aggression than 
the Nonrestricted group, but p = .20. If we 
add to this comparison the congruently ex- 
cow groups on amount of quarreling with 
; ngs that the mother reported, we get 
pe few cases in the cells to permit further 
€sting. However, it is worth noting that the 


_ boy who obtained the highest self-aggression 


Score in the entire follow-up sample was not 


_ only in the least aggressive-in-the-home-at- 


ES group but also received the lowest pos- 
siblin Tating on amount of quarreling with 
m 85, and was in the earliest toilet trained 
En although he suffered the third highest 
i FD of toilet training. In other words, 
TUR not only in the least permissive and 
B home atmosphere, but he 
fespond AA severe frustration very early, 
restricted toit by submission, and was highly 
age 5. (inhibited) in his aggression at 


fusa) various bits of evidence support the 
Position that, in boys, restricted expression 
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of aggression in the earliest years is associated 
with a higher level of self-aggression in middle 
childhood, but the statistical significance of 
the differences cited is too low to give great 
confidence in their replicability. Nevertheless, 
these findings, coupled with the predicted 
influences of punishment and nonpermissive- 
ness, do give one an optimistic sense of right- 
ness about the general theory that self- 
aggression is a displacement of strong aggres- 
sion from frustrating parental agents to the 
self as a consequence of anticipation of punish- 
ment for expressing aggression directly to- 
ward the frustrators or other immediately 
available objects. 

Aggression anxiety. It will be recalled that 
the correlation between aggression anxiety 
and self-aggression was the only one among 
the age 12 scales that was significantly dif- 
ferent for the sexes; the boys’ r was higher 
than the girls’. The findings reported above 
suggest a reason. In boys, nonpermissiveness 
is prominent among the child rearing ante- 
cedents for both responses; hence a positive 
relation between the two is to be expected. 
But for girls, what little suggestion there is of 
meaningful antecedents to self-aggression 
lies nonsignificantly in this same area (e.g., 
low permissiveness factor) and in only one 
narrow aspect of the antifeminine sex-typing 
area (low reward of dependency and low use 
of withdrawal of love). The other relevant 
antecedents to girls’ aggression anxiety are 
unrelated to self-aggression. Thus, it is to be 
expected that these two scales would them- 
selves be but weakly correlated in girls. 

Role of the father. So far we have been con- 
cerned mainly with theoretical properties of 
behavior, such as punishment, permissiveness, 
and restriction (inhibition) of action. How- 
ever, the measures of these dimensions, with 
the exception of the factor scores, have re- 
ferred to particular substantive dimensions, 
e.g, permissiveness for aggression toward 
parents, or toward other children, punish- 
ment for dependency, physical punishment, 
etc, Furthermore, all of these measures imply, 
even when they do not explicitly state, an 
agent of the action as well as a substantive 
quality, eg, punishment (by mother) for 
dependency (toward mother). There are, in 
other words, three aspects to every child 
rearing dimension: a property that represents a 
theoretical variable in the general theory of 
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action and learning (e.g., punishment); 
a substantive property that describes a dimen- 
sion within the special theory of human social 
interaction (e.g., dependency, aggression); 
an agent, also defined within this latter theory, 
who is the opposite member of a dyad in which 
the child is object (e.g., mother, father). 

In the discussion of permissiveness and 
punitiveness as determiners of antisocial 
aggression, aggression anxiety, and self- 
aggression, we have been concerned with the 
first of these three aspects. Evidence con- 
cerning the effects of these two antecedents 
was sought wherever it might be found, re- 
gardless of the substantive dimension being 
measured; sometimes the factor score gave 
the strongest correlation, sometimes one of 
the more specific dimensions. When the ante- 
cedents of aggression anxiety came under 
consideration, however, the marked sex 
differences required more careful considera- 
tion of the second and third aspects, the sub- 
stantive quality of the dimensions involved 
and the agents; conclusions concerning the 
sex-different dynamic and genetic charac- 
teristics of aggression anxiety were stated in 
terms of the sex appropriateness of socializa- 
tion procedures, This kind of principle belongs 
in the special theory of human social interac- 
tion, a theory that in its present degree of de- 
velopment is certainly “special” to human be- 
havior and may be more narrowly limited 
than that; recent efforts to define transcul- 
tural variables are promising, but their suc- 
cess has not yet been fully assessed (Sears, 
1961; Whiting & Child, 1953). 

Any consideration of the role of specific 
persons in socialization belongs to the special 
theory. In connection with girls’ aggression 
anxiety, evidence was presented that high 
father participation in a girl's rearing at age 
5 was conducive to high anxiety. There is 
some indication that fathers also have a spe- 
cial influence on the development of self- 
aggression in boys. The evidence will be de- 
scribed but briefly, and only for purposes of 
suggesting an hypothesis in the realm of the 
special theory (where fruitful hypotheses are 
harder to come by than in the general theory 
of action and learning); for while trends within 
the data are quite consistent, almost none 
of the crucial antecedent-consequent com- 

parisons reaches a satisfactory level of statis- 
tical significance. 
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It will be noted in Table 8 that two scales 
involving the father show small correlations 
with self-aggression, his coldness, and his 
degree of participation in discipline when 
both parents are present. These two measures 
are uncorrelated with one another, and hence 
it is theoretically possible to maximize their 
combined effects by a crossbreak as follows: 


% HIGH ON 
FATHER N — SELF-AGGRESSION 
Cold 
Father chief 13 46 
disciplinarian 
Parents about equal 12 8 
Mother chief 17 35 
disciplinarian 
Warm 
Father chief 14 28 
disciplinarian 
Parents about equal 12 25 
Mother chief 7 14 
disciplinarian 


Regardless of whether the father is de- 
scribed as warm or cold, if he is the chief 
disciplinarian of the 5-year-old boy, the boy 
is more likely to have a high self-aggression 
Score at age 12 than if his mother is the chief 
disciplinarian. Likewise, if the father is cold, 
regardless of which parent does the disciplin- 
ing, the self-aggression scores are higher. None 
of the comparisons approaches significance by 
either ¢ tests between means or by x*. The 
top and bottom groups of the list maximize 
these joint effects. 

A third variable that may be influential 
is the father’s strictness. In general, the parent 
who is stricter tends to be the chief discipli- 
narian (by x, p < .01), and there is a close 
Positive relationship (by x’, p< .001) 
between the father’s rated strictness and his 
adoption of the chief disciplinary role. In 
order to see the effects of his strictness per 
se, then, the first and fourth groups above 
must be used for the next crossbreak. 


% HIGH ON 
FATHER N SELE-AGGRESSION 

Cold, chief disciplinarian 

High strictness 7 28 

Low strictness 6 67 
Warm, chief disciplinarian 

High strictness 4 75 

Low strictness 10 10 


In spite of the large percentage difference 
in the top pair of groups, the actual means are 
almost identical; by both ż test and x? the 
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difference is not significant. Between the strict 
and lenient warm fathers, however, the per- 
centage difference in high self-aggression 
scores faithfully represents a large mean 
difference also; by x’, p < .02, and by ¢ 
test, p < .01. 

In summary, there are minimal indications 
that high participation of the father as dis- 
ciplinarian, and his coldness, are both condu- 
cive to self-aggression, but only in a warm 
father does his strictness have a facilitating 
effect. This latter finding is reminiscent of our 
earlier discovery (Sears et al., 1957) that a 
mother’s withdrawal of love was influential 
on the development of conscience only if the 
mother was warm to start with; she could not 
affect a child if she tried to withdraw some- 


| thing she had never given! With the fathers, 


one may suspect that somewhat the same 
principle holds—his being strict can frustrate 
his son only if there would be love gained in 
the absence of strictness, 

These hypotheses are presented blind, in 
one sense. We do not know to what extent 
the effects of the father’s behavior are simply 
reflections of the operation of the general 
principles of learning and action involving 
punishment and nonpermissiveness. It may 
be that fathers used more of the tactics that 
produced self-aggression than did mothers, 
and that the seeming effect of father behavior 


_ 88 such is but a reflection of the more intense 


use of them. Even if this is so, however, a 
Principle having to do with the influence of 
fathers, as such, would be a useful addition 
to the special substantive theory of socializa- 
ig our culture) if it could be substan- 


Prosocial Aggression 


There is little to be added, concerning 
Prosocial aggression, to what was mentioned 
m the initial discussion of punishment and 
Pimissiveness, Like aggression anxiety, with 
Which 1t is strongly correlated, it is significantly 

gher in girls than in boys (Table 5). Within 

lé boys’ group, it correlates significantly 


_ With the femininity measure, and within the 


tha tested group of girls, likewise; within 
posi follow-up sample of girls, however, this 
pud relation is reduced almost to zero. 
a We m i i is 
pling error re presumably dealing with sam 


So far as antecedents are concerned, the 
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same high permissiveness and high punish- 
ment associated with aggression anxiety in 
girls are found similarly associated with pro- 
social aggression, though none of the other 
anti-sex-typing variables appears to be re- 
lated. For boys, there is some slight evidence 
of the same nonpermissiveness that was found 
in aggression anxiety, and a complete lack 
of any other child rearing correlate. 


SUMMARY 


V A follow-up sample of 76 boys and 84 girls 
from the original population of 202 boys and 
177 girls described in Patterns of Child Rearing 
(Sears et al., 1957) was given a set of self- 
administering aggression scales at age 12. 
These included measures of antisocial, pro- 
social, projected, and self-aggression and 
aggression anxiety. Reliabilities of all but the 
self-aggression scale were satisfactory. 
Intersex comparisons showed significantly 
higher scores for girls on aggression anxiety 
and prosocial aggression, and for boys on 
antisocial aggression. Intrasex correlations 
with a measure of masculinity-femininity, de- 
rived from Gough’s (1957) Fe scale, were in 
directions congruent with these obtained sex 
differences in mean scores. : 
Among the aggression measures, antisocial 
aggression was negatively correlated with both 
prosocial aggression and aggression anxiety, 
and had virtually zero relation to projected 
and self-aggression. These latter, however, 
were significantly related positively to ag- 
gression anxiety. In other words, there ap- 
peared to be an anxiety cluster including 
four of the scales, with 7’s ranging from 0 to 
+.50, and antisocial aggression, with r's 
between it and the members of the anxiety 
cluster ranging from 0 to —.44. 
Antecedents of these age 12 scores were 
sought among the ratings of the mother inter- 
views that had been obtained when the chil- 
dren were in kindergarten (age 5). Primary 
attention was given to dimensions involving 
permissiveness and punitiveness, such as 
those for control of aggression and dependency 
and methods of discipline. j 
Antisocial aggression was found to be posi- 
tively related to high permissiveness and low 
punishment. The former of these two effects 
was the same as that with respect to aggres- 
sion in the home at age 5 (rated from mother 
interviews); the latter effect was just the 
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opposite from that found at age 5. At the 
earlier period punishment incited aggression, 
preponderantly, while at age 12 the negative 
correlations are interpreted as exemplifying 
the inhibitory influence of punishment. 

Prosocial aggression and aggression anxiety 
likewise were found to be associated with high 
permissiveness, but in these instances high 
punishment was related to the two aggression 
measures. This was the opposite of the relation 
with antisocial aggression. 

Evidence was presented to support an ex- 
pectation, based on a theory of frustration 
induced drive, that the successful inhibition 
of early forms of aggression in severely pun- 
ished children would produce heightened 
amounts of aggression anxiety, prosocial, 
and self-aggression at a later age. 

An important sex difference was found in 
the antecedents for aggression anxiety, a 
fact that led to a theoretical interpretation 
that this phenomenon has different dynamic 
and genetic characteristics for the two sexes, 
In boys, this anxiety appears to be a part of a 
general pattern of rapid socialization and 
high conformity in a love oriented atmosphere 
of tight control of aggression. In girls, such 
anxiety appears to be conflict induced by an 
internalized Type II conflict derived from 
socialization experiences that exacerbate their 
aggressive activities because the preponderant 
training is antithetical to appropriate femi- 
nine sex-typing. This interpretation is sup- 
ported by a series of antecedent-consequent 
relationships, as well as by a cluster of inter- 
correlations between aggression anxiety and 
earlier forms of behavior, that are almost 
entirely different for boys and girls. 

Self-aggression in boys was found to be 
strongest in those who had suffered most 
severe frustrations and the most severe con- 
trol of outward directed aggression in their 
early years. Early successful restriction of 
aggressive behavior, as in severe toilet training 
with early success and in low expressions of 
aggression toward parents and siblings, was 
also conducive to high self-aggression. 
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S report examines one aspect of 
= | "identification." It is one of a series 
J Æ of reports on a longitudinal study, the 
initial phase of which was reported in Patterns 
of Child Rearing (Sears, Maccoby, & Levin, 
1957), in which the child training practices 
of the parents of 379 kindergarten-aged 
children in the greater Boston area were 
described, At that time, we also reported some 
» relationships between early childhood parental 
Practices and several personality characteris- 
‘ties of their 5-6 year old children. A follow-up 
study was done 6 years later, with those 
"children from the original study whom we could 
locate at this later date. (A more detailed 
description of this study appears in Sears, 
1961.) During the follow-up study, we did not 
obtain new information concerning the parents 
and their methods of rearing their children, 
but we did obtain fairly extensive information 
about the characteristics of the children, who 
Were in the sixth grade at the time of our 
follow-up Measurements. We can therefore 
Ieport three kinds of findings: the interrela- 
; Honships between several personality measure- 
ns taken at age 12; the interrelationships 
-Bebween certain aspects of child behavior at 
‘two different ages we studied; the relation- 
a between early childhood parental prac- 
a and the characteristics children display 
ee middle childhood. The present report 
aS primarily with the first and third kinds 
Of relationships, 
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——THE TAKING OF ADULT ROLES IN MIDDLE CHILDHOOD! 


ELEANOR E. MACCOBY 
Stanford University 


During the process of early childhood inter- 
action with his parents, the child, of course, 
learns to perform a host of actions that his 
parents deem desirable for a child, ranging 
from manual skills such as tying his shoes to 
temperamental matters such as controlling 
his temper. These kinds of behavior he learns 
in large part through guided trial and error, 
retaining primarily the behavior his parents 
reward and eliminating what they do not 
approve of. There is another class of behavior 
which we believe the child learns, even though 
he seldom performs the behavior overtly and 
is therefore seldom directly reinforced for it. 
This is the behavior which forms part of the 
adult role performed by his parents in caring 
for him, but which would be inappropriate for 
him to perform toward them in return. 
Examples of adult role behavior would be 
setting rules, applying discipline, and giving 
medicine when the child is sick. 

At first the child’s learning of adult role 
behavior may be manifested overtly, as when 
a 2- or 3-year-old child is heard giving stern 
commands and admonitions to himself or his 
dolls, in a tone of voice patterned exactly after 
his parents when they say controlling phrases 
to him. But the child soon learns that he is 
not allowed to do or say certain things that 
he sees adults doing—he cannot scold adults, 
for example, nor start the car. We assume that 
although he ceases to do and say items of 
adult role behavior overtly, he does continue 
to practice them covertly, and continues. to 
learn a great deal of the content of the behavior 
that characterizes the people with whom he 
interacts. A detailed description of how we 
believe this learning takes place has been given 
elsewhere (Maccoby, 1959). The sentences and 
other units of behavior thus learned are ready 
for elicitation when the appropriate situation 
occurs, and there may be a considerable lapse 
in time between the time they are learned and 
the time they are overtly manifested. 

While all children presumably learn at least 
some elements of the adult role behavior that 
characterizes their parents, there should be 
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differences among children in the amount of 
adult role behavior they learn and the strength 
of the learning. As in all learning, the 
strength of a behavior tendency acquired 
through covert practice varies, we assume, 
with the amount of practice. Some children 
wil frequently imagine themselves saying 
parent-like phrases and doing parent-like 
acts; others will seldom think of these things, 
and will therefore establish only weak tend- 
encies to display parent-like actions later on. 
What are the conditions that govern the 
amount of a child's covert practice of adult 
roles? 

We believe that several motives underlie 
such practice. The first is the child's desire to 
reproduce in fantasy events that have been 
reinforcing or associated with reinforcement. 
A parent's soothing, affectionate, and praising 
statements should be remembered by the child 

' and rehearsed by him, especially in situations 
where the parent is not present so that the 
child must provide reassurance and social 
support for himself, 

The second motive underlying a high level 
of covert practice of adult roles is based, we 
believe, on a high degree of parental control 
over the child. If the child cannot satisfy his 
needs without getting some mediating behavior 
from his parents (getting their permission to 
visit a friend, spend money, etc.), then his 
“vicarious trial and error? will necessarily 
involve rehearsing the kinds of controlling 
phrases his parents would say to him if he 
asked for their help in getting what he wanted. 
If he is not required to get parental permission 
for most of the steps he follows in pursuing 
his goals, then he will not have to take parental 
reactions into account in making his plans 
and will not engage in extensive covert practice 
of parental phrases and strictures, 

We have suggested that there are two 
motives a child may have for practicing 
covertly the words and actions characteristic 
of his adult caretakers. One is to enjoy once 
more the satisfactions that come from reward- 
ing, nurturant, anxiety reducing behavior 
directed toward him by his caretakers. The 
second is to anticipate the consequences of 
possible acts of his own, in order to work out 
good strategies for reaching his goals, A third 
motive will be mentioned here only briefly: 
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to obtain vicarious satisfaction by pretending 
to be a person who is enjoying rewards that 
are denied to the child himself. The child's | 
fantasy role playing of fictional charac- 
ters has a large component of this motive, 
we think, The child likes to play that he is 
Davy Crockett or Superman, because then he | 
can vicariously win the admiration of others 
and perform a great variety of feats of skill 
and strength. Presumably, the child can also 
get some vicarious satisfaction by pretending, 
to be his parents as they interact with each 
other and the outside environment, since they 
have so much more power than he does and 
direct access to more sources of gratification. | 
But for most of the discussion below, we will 
concentrate upon the first two motives, which | 
have to do with the child’s learning of those 
aspects of adult role behavior that are directed 
toward himself, the child, and are elicited by 
some feature of the child’s own behavior. 

As just mentioned, much adult role behavior 
is not appropriate for a child to display overtly 
vis-à-vis his parents. Indeed, he may be 
punished for displaying it. If he continues to 
practice it covertly, and thus establishes 
behavior potentials (habits) that are ready for 
elicitation, what will the situations be in which 
we may expect the behavior to be overtly 
manifested? Of course, when the child grows 
up and has children of his own, the actions 
which he has learned that are appropriate to 
the role of parent should emerge. But we expect 
that some elements of adult role behavior will 
emerge earlier, as the child interacts with his 
age-mates, Specifically, a child's adult role 
repertoire should be called out in relation to 
his age-mates whenever one of his age-mates 
provides the stimulus by acting out some 
aspect of a reciprocal child role. Thus if an 
age-mate solicits nurturance by making 4 
dependent response, the child will make an 
adult role nurturant response, provided this 
Tesponse is well established through earlier 
covert role playing of a nurturant parent’s 
tole. Similarly, if a child sees his age-mate 
breaking a rule, this should elicit rule enforce- 
ment on his own part, provided again that his 
adult role learning has been strong. 

The reader may have noted that although we 
used the term identification in the opening 
sentence of this article, we have not since 
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referred to this concept explicitly. We con- 


ceive of covert practice of adult role behavior 
as a basic process underlying the child's 
acquisition of behavior tendencies that are 
matched to the perceived behavior of a model. 
We are therefore talking about a process, or a 
product, that has frequently been labeled 
"identification." We have avoided use of the 
term because it has such a wide range of 
meanings in the existing literature, Also, we 
wanted to concentrate on both conditions of 
acquisition and conditions of elicitation of the 
model-matched behavior in question, and felt 
that this distinction was better handled with 
the terminology we have employed. Neverthe- 
less our work took the concept of identification 
as its point of departure, and we consider that 
the phenomena we are studying do come under 
the heading of identification as the term is 
generally employed. 

The present study concentrates upon the 
following issue: is there a tendency for children 
to employ, in their social interactions with 
age-mates, the same kinds of behavior their 
parents used toward them in similar situations? 
Can we discover conditions that determine 
which children will have taken on their parents’ 
characteristic adult role behavior to a high 
degree and which will not? We have chosen, as. 
the aspect of adult role behavior for study, the 
Child's tendency to enforce rules when another 
child deviates, Our hypotheses are as follows: 
F iim à child sees another child breaking 
the D he Will be more likely to try to enforce 
ks e if his parents were strict rule enforcers 

uring his early childhood, To test this hypoth- 
E We employ two measures of the parents' 

e enforcing behavior: their restrictiveness 
and their punitiveness. 
jai FR match between parental behavior and 
E : the child with respect to rule enforcing 

€ closest in those families where the 
Parents were highly nurturant. 


METHOD 
Sample 


Our measures of sixth i 
Dri e graders were taken in the 
S a 1958, in selected schools in the greater Boston 
whom fee were a group of children some of 
bored 2 Ve included previously in the study 

5 dés alterns of Child Rearing (Sears et al., 1957). 
tudy AE and sampling procedures for the earlier 
escribed in some detail in that book. Similar 


495 


details for the follow-up study appear in Sears (1961). 
Most of the children in our original sample had, by the 
spring of 1958, reached the sixth grade. We adminis- 
tered most of our measuring instruments to all the 
children in the sixth grades in the schools where we did 
our work. We obtained measures for the majority of 
our scales on 525 children, of whom 165 had been in 
the original sample 6 years earlier. In the following 
report, our analysis is based primarily upon the children 
who were in the original sample, although occasionally 
findings from the larger sample can be used. 


Measures 


The primary scale to be considered in the following 
report is Enforcing Rules. The measuring instrument 
that included the items for this scale also contained 
items for a number of other scales, which we describe 
briefly since we will report their interrelationships 
with the Rule Enforcement scale. 

There were seven scales scored from our so-called 
“Role Taking" questionnaire. This questionnaire was 
administered during a regular classroom hour by a 
research assistant. Most of the items were two or three- 
choice items, and each child checked his own choices 
after a silent reading of the items. For each scale, an 
item analysis was carried out on 100 cases from our 
final sample, not including any of the children in our 
original study. We correlated each item with total 
scores on the scale of which the item was intended to 
form a part, and on the basis of this analysis, discarded 
a few items that proved unreliable on the basis of 
internal consistency. For the scales where item analysis 
revealed generally poor scale reliability, we examined 
the inter-item correlations to see whether we could 
detect coherent groups of items that could be used to 
form one or more subscales. In Table 1 appears a de- 
scription of each scale finally employed, giving an 
example of an item, the number of original items, and 
the number dropped during item analysis. In each 
case, the example given is taken from the boy's form. 
For the girls form of the questionnaire, the items 
were changed to refer to another girl with whom the 

bject was interacting. 
e The Adult-Child Role Choice scale had two classes 
of items. The first involved the child’s choosing be- 
tween adult-like functions and child-like functions 
when both could not be chosen. The second involved 
situations in which another child deviated and was 
caught by an adult, and the question was which of the 
two (adult or deviating child) the subject “sided with, 
or empathized with. While we feared these diveres 
items might not turn out to form a unidimensional 
scale, item analysis did reveal positive relationships 


2The seven scales and a table presenting their 
intercorrelations have been deposited with the Ameri- 
can Documentation Institute. Order Document No. 
6869 from AD1 Auxiliary Publications Project, 
Photoduplication Service, Library of Congress; Wash- 
ington 25, D. C., remitting in advance $1.75 for 
microfilm or $2.50 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, Library 


of Congress. 


496 


ELEANOR E. MAccosy 


TABLE 1 
SCALES SCORED FROM ROLE TAKING QUESTIONNAIRE 


Scale 


Example of an item 


Rule enforcement 


Acceptance of rule 


enforcement 


Give nurturance 


Accept material help| 


Accept comfort 


Beg for return of 
nurturance 


Adult-child role 
choice 


You are on your way to school. A boy in your class is walking in 
Íront of you. He drops a bottle on the sidewalk, and it breaks in 
many pieces. The boy walks on. What would you do? 

— — Say nothing; it’s his business, not mine. 

— — Tell the boy to pick up the pieces. 


By accident, you knocked down a bunch of coats in the coat room. 
Another boy said to you, “Hey, you better pick them up.” What 
would you say? 

— — Don't worry, I was just going to. 

— — O. K. Thanks for telling me. 


A boy in your class asks you to help him with his homework. You 
think this boy could do his homework by himself if he really 
tried. What do you do? 

— — Tell him I haven't time. 

— Tell him to do his own homework and stop bothering me. 

— —Help him. 


You and your friend have each bought a bag of candy. You finish 
yours first, and you are still hungry. He offers you one of the 
pieces of candy he has left. What do 

— Thank him and take one. 

— — Say, “No, thanks." 


Suppose you fall down and hurt yourself. You aren't hurt badly, 
ut you've skinned your arm and leg and you feel shaky. One of 
the other boys offers to stay with you till you feel better. Would 
you rather someone would stay with you, or would you rather 
they'd leave you alone? 
— I'd like someone to stay with me. 
— ld rather they'd leave me alone. 


Suppose there was a boy who had been quite a good friend of yours. 
He starts acting less friendly. He doesn't wait for you after 
school as often as he used to. What do you do? 

——Tell him I think he's being unfair. 

——Ask him to wait after school the way he used to. 

— —Don't do anything—it will blow over. 


1, You are at a school movie. You can either sit with your friends 
and watch the movie, or be an usher and show people to their 
seats. Which would you rather do? 

——Ait with my friends and watch the movie. 
———Be an usher and show people to their seats. 

2. In your school, it’s against the rules to slide down the banisters. 
One day, a boy you know slides down, and at the bottom he is 
met by a teacher, You are watching. How do you feel? 

— Sorry for him, because he got caught. 
——Not sorry for him; it’s his own fault for breaking the rules. 


Original | y | 
| 
6 6 
6 6 
7 5 
5 4 
you do? 
4 4 
5 5 
9 9 


between the two kinds of items, and between individual 
items and total score. Consequently, we did not sub- 
divide our scores on Adult-Child Role Choice. 

In the present report, we consider the relationship 
of rule enforcement not only to the scales just de- 
Scribed, but to a set of scales on aggressive tendencies. 
These scales are described in a separate publication 
(Sears, 1961). 

During the follow-up study, we interviewed the 
classroom teacher about each of the children in her 
room who had been included in our original study 6 
years earlier. We asked about a number of aspects of 
the child's behavior in the classroom, including his 


relationship to the other children in the class as well 
as to the teacher. In the present report, we present 8 
few findings from the teacher interview as they relate 
to rule enforcing tendencies. 


RESULTS 
Relation of Adult Role Functions to Other Per- 
sonality Traits 


First we consider the meaning of rule 
enforcement in the sense of its interrelations 
with other characteristics of the children- 
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TABLE 2 


CORRELATIONS BETWEEN ScORES ON RULE 
ENFORCEMENT SCALE AND ÜTHER 
PERSONALITY MEASURES 


(Total sample) 
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TABLE 3 


RELATIONSHIP BETWEEN TEACHER-INTERVIEW 
REPORTS AND CHILD'S SCORES ON 
RULE ENFORCEMENT 


(Follow-up sample only) 


= j 
Mekrüre wio wE 
Accept rule enforcement :24 -06 
Accept comfort .19 ll 
Accept material help —.05 .06 
Beg for return of nurturance 13 .16 
Adult-child role choice 34 325 
Give nurturance -10 —.08 
Self-criticism —.02 —.10 
Aggression anxiety .18 17 
Projected aggression .06 .00 
Self-aggression .02 .02 
Prosocial aggression .24 .08 
Antisocial aggression —.18 —.12 


Table 2 shows the several scales derived from 
the role taking and aggression questionnaires, 
and their intercorrelations with “enforcing 
rules,” 

Table 2 shows that, for both sexes, the 
correlations between “enforcing rules” and the 
other personality traits we measured are low. 
But quite a few are significant: a correlation 
of .17 is significant at the .01 level with a 
sample of this size. The table, then, shows the 
following small but significant relationships: 
ae children of both sexes who select the 
a enforcing alternative to our questions 
€nd to be “well socialized” in other respects. 
heen ae With a choice between adult 
rid child-like forms of behavior, the rule 
ond child chooses the adult forms more 
m an other children. Furthermore, rule 
m cement for both sexes is positively related 

gression anxiety, 
a pote of rule, enforcement appears 
Er [ed in some respects for the two 
ills d oys, requiring other boys to obey 
Docs T to be accompanied by a willingness 
bus a rule enforcement from other boys 
eroas has deviated oneself. This is the 
3 ud which Piaget (1948) believed was 
M t: element in mature moral develop- 

— En we do not find a correlation 
Others n tendency to enforce rules on 
t d e the willingness to accept rule 

evelop eu in turn. Piaget noted that moral 
ua ent among girls did not appear to 

pendent upon the development of a 


Teci 3 
Cliproca] relationship with peers as was the 


Boys rule en- | Girls rule en- 
forcement score | forcement score 
Low | High i 
E3] ed | CS | 49 
Teacher's report, percentage 
who: 
Are moderately or mark- | 45%** | 13%** | 18% 25% 
edly aggressive toward 
peers 
Misbehave when teacher is | 35% 31% 10%** | 35%** 
out of room 
Are relatively highly de- 45% 33% | 28% 36% 
pendent on teacher 
Are highly motivated to do| 33%" | 60%* | 79%* | 57%* 
their school work 
Number of cases 32 45 50 37 


* Significantly different at <.05 level. 
** Significantly different at <.01 level. 


case with boys, and our finding confirms his 
observation. 

The sexes also differ in the relationship of 
“Tule enforcement” to characteristics that 
relate to the handling of aggression. For boys, 
rule enforcement again shows itself as a symp- 
tom of successful socialization in that boys 
who give many rule enforcing responses are 
low in antisocial aggression and high in pro- 
social aggression, as measured by our self- 
administered scale. For girls, the relationships 
between these traits are in the same direction 
but weaker and not statistically significant. 

We see some confirmation of the trends we 
have reported in Table 3 when we study the 
teacher interviews and relate the characteris- 
tics the teachers report to the "rule enforcing” 
scores derived from the self-administered 
scales. We see that, for boys, high scores on 
both these scales are associated with “good” 
behavior in the classroom; that is, high-scoring 
boys are reported by the teachers to be un- 
likely, as compared with other boys, to start 
fights with their peers; and the teachers report 
that they work especially hard at their class- 
room assignments. None of these relationships 
holds for girls. In fact, the girls who score 
high on rule enforcing tendencies are lower than 
other girls in their interest in their school 
work, and more likely to misbehave when the 


teacher leaves the room. 
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In view of the fact that the tendency to 
enforce rules forms part of somewhat different 
constellations of traits for the two sexes, we 
must be prepared for the possibility that differ- 
ent aspects of early childhood training may 
predict it for each sex. Nevertheless, we 
proceed similarly for the two sexes with an 
analysis that follows from the theoretical 
considerations outlined at the beginning of 
this report. 


Antecedents of Rule Enforcement 


We predicted earlier that if parents had 
been strict rule enforcers toward a child during 
his early years, the child would in his turn be 
a strict rule enforcer with his peers in middle 
childhood. For our first test of this hypothesis, 
we employ the “summary permissiveness” 
scale derived from our early mother interviews. 
This scale was based on a factor analysis of a 
large number of mother-interview scales, and 
is composed of a sum of the following scores: 
sex permissiveness, permissiveness for aggres- 
sion toward parents, permissiveness for de- 
pendency, and low restrictions on the child’s 
use of house and furniture. High scores on this 
scale mean that the mother was permissive 
(nonrestrictive) in all or most of these spheres 
of behavior. Our prediction is that if the mother 
was restrictive (scored on the low end of our 
scale), her child should have relatively strong 
tendencies to enforce rules upon other children. 

As may be seen in Table 4, the prediction is 
borne out for boys, but not for girls. Boys 
whose mothers were restrictive in early child- 
hood give significantly more rule enforcing 
responses at age 11 than boys whose mothers 
were permissive during the child's early years. 
For girls, the aspect of child rearing that 
predicts the number of the girl’s rule enforcing 
Tesponses is not how restrictive the parents 
were in early childhood, but how punitive they 
were. Punitiveness is again measured by a 
summary scale, based on our factor analysis 
of the mother interviews. We added scores on 
the three variables that proved to have the 
highest loadings on our punitiveness factor: 
punishment for aggression toward parents, 
amount of physical punishment, and demands 
for the child to be aggressive toward other 
children, We also added the scores on the 
mother’s irritation over the child’s dependency. 


ELEANOR E. MaccoBY 


TABLE 4 


RELATIONSHIP OF EARLY CHILDHOOD SOCIALIZATION 
PRACTICES TO CHILDREN’S SCORES ON 
RULE ENFORCING 


(Follow-up sample only) 


Mean score: rule enforcement 
Boys Girls 
Mother’s permissiveness: 
Low (1-4) 4.28 3.26 
t = 2.64 
High (5-9) 3.39 3.07 
Mother’s punitiveness: 
Low (1-5) 3.76 2.61 
1233 
High (6-9) 4.08 3.61 
Correlation with rule enforcement scores* 
Mother’s permissiveness —.25 —.10 
Mother’s punitiveness .03 o7 
Number of cases 78 87 


* With a sample of this size, a correlation of .22 is significant f 
at the ,05 level, and one of .28 at the .01 level, A 


This scale was not included in the factor 
analysis, but in intercorrelations done sub- 
sequently, it proved to be part of the puni- | 
tiveness cluster. As Table 4 shows, when 
mothers were punitive toward their daughters 
in early childhood, these daughters tended, 
several years later, to be high rule enforcers 
with peers who deviated. 


Boys: Conditions Predicting a Close Match to 
Parental Behavior 


Let us concentrate now on the relationship à 
between maternal restrictiveness and rule 
enforcing among boys. Our original hypothesis 
was that children should resemble their parents 
in the rule enforcing tendencies they displayed 
But we further predicted that some boys — 
should resemble their parents more than 
others. We said that a condition governing à 
close match should be high parental warmth. 
Our reason for expecting that parents who | 
were warm and nurturant toward the children 


Ra 


would have children more similar to themselves | 
than less warm parents was that the child's i 
covert role practice of the parents’ behavior | y 

7 


should be increased if the child could give 
himself pleasure by imagining things that his 
parents had said and done toward him. We are 
not saying that warm mothers should havé 
tule enforcing children. We are saying that if 
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TABLE 5 
RELATIONSHIP BETWEEN PARENTAL RESTRICTIVENESS AND THE CHILD'S RULE ENFORCING 
(Boys) 

E Moor ie bars N Difference between Difference between 
T enforcing percentages differences 
p relatively warm: 

"Mother restrictive 81% 16— 

ther permissive 38% BE (p< 2n 

other relatively cold: us 

"Mother restrictive 64% 22—, nsum 

"Mother permissive 53% wa oo 


Mother is warm, her level of rule enforcing 
d be closely reflected in that of her child, 
‘that if she is warm and restrictive, the child 
d be high in rule enforcing, and if she is 
and permissive, her child should be low 
e enforcing. If the mother is cold, on the 
er hand, we do not expect her degree of 
itictiveness to make as much difference in 
Tule enforcing tendencies of her child. We 
that our prediction is at least partially 
€ out, for (among boys) the relationship 
En parental restrictiveness and the 
Der rule enforcing is indeed somewhat 
| a When the mother is warm (see Table 5). 
ES group, there is a significant 
ap lonship between maternal restrictiveness 
T the son's rule enforcing, despite the 
um number of cases per cell. When the 
i E me relationship is small and 
T ant. However, analysis of the 
second-order differences? does Bn show a 
acant effect of maternal warmth on the 
Of the relationship between restrictiveness 
Tule enforcing, so the interaction we 
ted is suggested but not demonstrated, 


B e and in the instances which follow, the signifi 

i jecond-order differences has been computed 
is p ‘Owing procedure: The x* value for each of 
p -order differences was computed, and from 
» 4 Standard error of the difference was obtained 
n mula: Vx2 = D/ôd for fourfold tables). 
Ea ard error for the difference between differences 
‘en obtained from the formula: 


i Võn+ ôn: 
a criti H 
om Critical ratio computed. One-tailed tests of signifi- 


nu. dave been employed in these second-order 
Since the direction of differences was pre- 


EAR 


$a = 


There are, of course, other conditions than 
parental nurturance that should cause the 
child to have his attention fixed on his parents 
and thus learn their characteristic responses, 
We can make the prediction that whatever 
the origin of dependency in a particular child, 
the children who were highly dependent during 
the preschool years should have acquired more 
of their parents’ characteristics and therefore 
show a closer match in their rule enforcing 
tendencies at a later time. This prediction 
is moderately well borne out (see Table 6). 

The analysis of second-order differences 
shows that there is a tendency (significant at 
p = < .05) for the boy’s early-childhood 
level of dependency to make a difference in 
the closeness with which his rule enforcing be- 
havior is matched to that of his parents, and 
the nature of the relationship is as predicted, 

In our earlier study (Sears et al, 1957, 
p. 168), we did not find any direct relation- 
ship between the mother’s warmth and the 
child’s dependency; so we can regard the two 
foregoing sets of tabulations as tests of two 
independent hypotheses. 

We see that it is the child's dependency on his 
parents that is associated with a close match 
between their behavior and his. We should not 
expect the same results if we consider a child's 
dependency on his peers. That is, the child 
who is dependent on emotional support from 
his age-mates at school, in the sense of being 
anxious to have their approval and anxious 
to regain their approval when it is withdrawn, 
will have taken new models for his behavior. 
We would expect that the child who shows 
the closest match to his parents! behavior 
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TABLE 6 
RELATIONSHIP BETWEEN EARLY DEPENDENCY AND THE CHILD’S RULE ENFORCING 
(Boys) 
Pei on rule N Difference between Difference between 
enforcing percentages differences 
Boys relatively dependent at pre- 
school age, and: 
Mother restrictive 70% AS] " 
Mother permissive 25% 20— (p « .01) " 
s o 
Boys relatively nondependent at (p < .05) 
preschool age, and: 
Mother restrictive 73% 15— 
596 
Mother permissive 68% 19— (ns) 
TABLE 7 
_ RELATIONSHIP BETWEEN NURTURANCE BEGGING AND RULE ENFORCING 
(Boys) 
Pe on ien N Difference between Difference between 
enforcing. percentages differences 
Boys low on begging for return of ! 
nurturance from peers, and: 
Mother restrictive 14% 19—, 
41% 
Mother permissive 33% 21— (p « .01) 
—34% 
Boys high on begging for return of (p = .06) 
nurturance from peers, and: 
Mother restrictive 68% sns % 
7%: 
Mother permissive 61% 18— (ns) 


should be the one who was both dependent 
upon them during early childhood, and not 
especially dependent upon peers currently. 
If he has shifted from parent dependency to 
peer dependency, we expect that although he 
will have learned earlier to act as his parents 
did, these tendencies will have been supplanted 
by new learning based upon his new focus for 
dependency. To discover whether current 
dependency on peers does, indeed, weaken the 
match between parent and child behavior, we 
employed our measure of the child's tendency 
to beg for the return of nurturance when peers 
have withdrawn it (see Table 7). We see that 
the relationship between the mother's strict- 
ness and her son's tendency to enforce rules 
on peers is weaker when the boy is dependent 
on his peers in the sense of dependency we 
have used. This difference in the size of the 
relationship under the two conditions of early 
dependency is strongly suggested by our data, 


although a chance hypothesis cannot be 
rejected with a p value of .06. 

We have seen, then, that a boy's tendency 
to act toward peers as his parents acted toward 
him is greater when he was highly dependent 
upon them. It further follows from our theory 
that the similarity in behavior between the 
two generations should be greater when the 
parents exercised close control over the 
children, This is a difficult hypothesis to test 
if the dependent measure we are using is the 
child’s restrictiveness toward peers. The only 
way we can approach the question is through 
the standard deviation of the child's responses. 
We predict that when the parent is restrictive; 
the child should be restrictive (rule enforcing) 
toward peers, and that there should be rela- 
tively little variability in rule enforcing among 
the children of restrictive parents. But when 
the parents are nonrestrictive, our prediction 
is that the children will have failed to copy 
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them, so that the child may or may not be a 
tule enforcer, His behavior should simply be 
unrelated to that of his parents. What we are 
predicting, essentially, is a scatterplot with 
an empty quadrant. That the relationship 
between the parent's restrictiveness and the 
son's rule enforcing is somewhat of this nature 
is indicated by the following facts: when the 
parents are restrictive, the sons are not only 
higher in rule enforcing, but the standard 
deviation of their scores is relatively low 
(5 = 1.69); for boys whose parents were 
permissive, their mean score on rule enforcing 
is lower, as we saw in Table 4, but the disper- 
sion of their scores is somewhat, but not 
significantly, greater (ô = 2.55). This suggests 
that when a parent is restrictive, his sons will 
be restrictive. When the parent is permissive, 
the son tends to be either permissive or restric- 
tive toward peers. His behavior tendencies 
in this sphere will, we suggest, have been 
acquired in some other way than by copying 
the behavior of his parents. 

We had a subsidiary hypothesis, not previ- 
ously mentioned, that the child’s behavior 
Would be more like that of his parents if 
authority in the household had been primarily 
exercised by the same-sexed parent. Our 
Tationale for this prediction was that, for 
Teasons already given, we would expect a 
ue to acquire more of his action tendencies 
e the more powerful of the two parents; 
a = action tendencies were acquired from 
ead parent, there would be little 
ann between them and other things the 
eo iv learned about the appropriate sex- 
rn g o behavior; so they should be more 

pes find their way into the child's overt 
m A at a later time than would be the case 
Sos. avior tendencies acquired from an 
En -sexed parent. Our prediction on this 
m Poe out. We classified the families 
Sine c UN we studied according to the 
ho CHEN the parent who was stricter and 
Bis didnt uk likely to be the one to adminis- 
N Dos e when it was required. There was 
Eileen due between the boy's rule 
his id. and the parent's restrictiveness when 

er was the authority figure in the 


ho 
ERU than when the mother played this 
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Girls: Conditions Related to the Closeness of 
Matching to Parental Behavior 


We have been pursuing the conditions that 
govern the closeness of the relationship be- 
tween a boy’s rule enforcing and his parents’ 
permissiveness or restrictiveness. Let us now 
turn to the girls, and see whether any similar 
regularities can be discovered. We have 
already seen that it is punitiveness rather 
than restrictiveness on the part of the parents 
that is associated with a girl’s being a strict rule 
enforcer with her peers. Why there is this sex 
difference in the home conditions associated 
with a child’s rule enforcing tendencies we do 
not know. We suspect, however, that in view 
of the difference in antecedent conditions 
underlying rule enforcement, and the differ- 
ence in the pattern of other personality traits 
associated with it, rule enforcement is a 
qualitatively somewhat different thing when 
girls do it than when boys do it. For girls, 
it probably has more aggressive overtones. 
In any case, we assume for the ensuing dis- 
cussion that a girl who is high on rule enforcing 
is displaying toward her peers the same kind 
of behavior her mother displayed toward her 
when the mother was punitive during the 
girl’s early childhood. Having already shown 
that there is an association between maternal 
punitiveness and the daughter’s later rule en- 
forcing tendencies, we now explore the condi- 
tions that govern the closeness with which the 
daughter’s behavior is related to her mother’s, 
That is, as we did with boys, we now try to 
discover what other conditions (either in the 
child training practices prevailing in the home, 
or in the personality traits developed by 
the child) increase the correlation between 
mother’s and daughter’s behavior, and what 
conditions reduce it. 

When we subdivide the girls according to 
whether their mothers were predominantly 
warm or cold toward them in early childhood, 
we do not find that this makes any difference 
in the size of the relationship between parental 
punitiveness and the girl’s rule enforcing 
tendencies. The girls’ earlier dependency, how- 
ever, does make a difference, Just as we 
found with boys, the girl’s behavior toward 
her peers is a closer reflection of her mother’s 
earlier behavior toward her, if the girl was 
relatively dependent as a young child (see 
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TABLE 8 


(Girls) 


Percentage of girls 
“high” in rul 
enforcing 


N Difference between 


percentages differences 


Girls relatively dependent at pre- 


RELATIONSHIP BETWEEN DEPENDENCY AND THE CHILD’S RULE ENFORCING 
Difference between 


school age, and: 
Mother punitive 16% su 659, 
Mother not punitive 11% 18— (p « .001) 31% 
Girls relatively ondepeadeny at (p < .001) 
reschool age, and: 
Mother punitive 40% ud L 
o 
Mother not punitive 32% 19— (ns) 
TABLE 9 
RELATIONSHIP BETWEEN NURTURANCE BEGGING AND RULE ENFORCING 
(Girls) 
Percentage of girls T 4 b 
torig ^ Tu mean — diem 
Girls low on begging for return of 
nurturance from peers, and: 
Mother punitive 71976 17— 
i RN 
Mother not punitive 13% 15—i (p « .001) 359% 
pu 0 
Girls high on begging for return of (p< .06) $ 
nurturance from peers, and: 
Mother punitive 50% pu 239; 
Mother not punitive 21% es (p < 11) 
Table 8). And again, as we did with boys, we from peers. Therefore, our findings do nol 
inquired whether the girl's behavior toward mean that when a child is anxious to keep | 


her peers is more closely matched to that of her 
mother if the girl is not currently especially 
dependent on her peers. We find that the 
relationship between maternal punitiveness 
and the daughter's tendency to enforce rules 
toward peers is greater if the girl is not 
dependent on her peers—at least as we have 
measured this dependency through our Begging 
for Return of Nurturance scale. The second 
order difference reaches only the .06 level of 
significance, however (see Table 9). 

It should be noted that the over-all propor- 
tion of high rule enforcers is about the same 
for the girls who are high on current 
dependency toward peers as it is for those who 
are low, disregarding the mother's punitive- 
ness. In fact, there is a small positive correla- 
tion in the sample as a whole between rule 
enforcing and begging for return of nurturance 


the good opinion of age-mates he or she wil Y 
avoid enforcing rules, for fear of alienating i 
his peers by a show of seeming bossiness 
Instead, the foregoing table (and the paralld 
one for boys) means that among these peer 
dependent children, the mother's punitivenes* 
is not the determining factor in whether they 
will be rule enforcers, while among the children 
who are more independent of their pen 
approval, rule enforcing tends to be matche 
to what their parents would do in similat_ 
situations, } 
For girls, too, we inquired whether the 
mother-daughter similarity in rule enforcing 
is affected by whether the mother is the na ! 
agent of discipline during the child's eat)” 
years. We find, to our surprise, that among 
girls the correlation between parental puni 
tiveness and the child's rule enforcing P | 


I 


somewhat (though not significantly) closer 
"when the father is the agent of discipline. The 
hypothesis, then, that having a same-sexed 
"agent of discipline produces greater matching 
of the child's behavior to that of the parent is 
p not sustained for either sex. 


SuMMARY 


_ This is one of a series of reports on a follow- 
"up study done with some of the families origin- 
ally described in Patterns of Child Rearing 
"(Sears et al., 1957). The present report deals 
“with one aspect of moral development in 
"Middle childhood (sixth grade): namely, the 
hild’s tendency to enforce rules when other 
Children break them. Scores on a scale measur- 
ing this tendency were correlated with 
Measures, taken concurrently, of other per- 
Sonality traits. Then the subjects’ rule 
enforcing tendencies were related to certain 
‘Aspects of the child rearing practices employed 
- by their parents during the children's preschool 
years. The major findings are as follows: 
d. Rule enforcing is part of a somewhat 
different constellation of personality traits for 
€ two sexes. For example, among boys it 
‘tends to be accompanied by a reciprocal 
tendency to accept rule enforcement from 
_ Peers when the subject himself has deviated, 
- While among girls this reciprocity is not found. 
2. Rule enforcing boys have parents who 
Were fairly strict rule enforcers during the 
: child's earlier years, in the sense that they were 
Testtictive (nonpermissive) about a variety of 
“childish impulsive behavior, The match 
etween the son's rule enforcing tendencies 
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and the earlier behavior of his parents is 
significantly closer when the child was rela- 
tively dependent during early childhood, and 
tends to be closer when (a) the parents were 
relatively warm, and (b) the child is not cur- 
rently especially dependent upon peers in 
middle childhood. 

3, Rule enforcing girls tend to have had 
relatively punitive parents during early child- 
hood. The match between parental punitive- 
ness and the daughters’ rule enforcing tend- 
encies is significantly greater when the child 
was relatively dependent upon the parents in 
early childhood, and tends to be greater if the 
child was not strongly dependent upon peers 
at the time of the follow-up study. The findings 
are interpreted as supporting the view that 
the child acquires a set of adult-like behavior 
tendencies during early childhood, through 
covert practice of his parents’ actions, and 
that these tendencies find their way into 
overt expression during interaction with 
peers at a later time when peers provide the 
necessary stimulating conditions, 
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GROUP THERAPY OF PHOBIC DISORDERS BY SYSTEMATIC 
DESENSITIZATION! 


ARNOLD A. LAZARUS 
Psychiatric Unit, Witwatersrand University Medical School 


HE increasing demands for psychological 
and psychiatric services dictate the 
need for effective short-term therapeutic 
techniques and the extension of the existing 
services. Consequently, group techniques have 
grown in clinical stature, and the past decade 
has witnessed the development of numerous 
divergent procedures. A most promising 
variety of short-term therapy is Wolpe’s (1958) 
system of “reciprocal inhibition,” by which he 
achieved the recovery of 188 out of 210 
neurotic cases in an average of 34.8 sessions. 
A double economy can be achieved by 
combining the advantages of Wolpe’s (1958) 
expedient clinical procedures with the addi- 
tional time- and effort-saving properties of 
group therapy. This paper describes the 
adaptation of Wolpe’s most important thera- 
peutic procedure—the technique of systematic 
desensitization based on relaxation—to the 
group treatment of phobic disorders. In 
addition, the therapeutic effects of group 
desensitization were compared with more 
conventional forms of interpretive group 
psychotherapy on matched pairs of phobic 
subjects. 


METHOD 
General Procedure 


The sample consisted of 35 middle-class urban 
white South Africans who were handicapped by 
phobic disorders.? Social class membership was defined 
in terms of education, vocation, and income. There 
were 7 university graduates, 16 matriculants, and 12 
patients with at least 3 years of secondary schooling. 
Apart from 3 professional women, the majority of 


! This paper is an outline of the experimental section 
of a thesis entitled “New Group Techniques in the 
Treatment of Phobic Conditions,” which was accepted 
by the University of the Witwatersrand in December 
1960, for the degree of Doctor of Philosophy. 

2The sample was not drawn from psychiatric 
hospitals or institutions as it was felt that extraneous 
variables would be introduced. Since the rules of the 
South African Psychological Association forbid regis- 
tered psychologists to advertise in the press, the 
patients were obtained with the generous aid of friends 
and colleagues who made announcements at lectures 
and contacted their own associates. 
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female patients were housewives whose husbands’ 
average earnings were the equivalent of $550 a month, 
The mean income for the rest of the group was approxi- 
mately $600 a month. In all, there were 12 men and 23 
women, the mean age being 33.2 years with a standard 
deviation of 9.87. 

The entire group included 11 acrophobics, 15 
claustrophobics, 5 impotent men (treated as suffering 
from sexual phobia), and a mixed group of 4 phobic 
patients. The latter comprised a girl with a fear of 
sharp objects, a man with a fear of physical violence, 
a woman who was afraid to be a passenger in a moving 
vehicle, and a woman with a phobia for dogs. 

The basic experimental design was to compare 
group desensitization therapy with more conventional 
methods of group treatment (or therapy based on 
“group dynamics”). The group desensitization tech- 
nique consisted of systematically counterposing by 
relaxation graded lists of anxiety evoking stimuli 
which the separate groups of patients were asked to 
imagine. 

The efficacy of group desensitization was first 
compared with group interpretation. The same therapist 
(the investigator) conducted all the therapeutic groups. 

The initial comparison was made on a group of five 
acrophobic patients, two of whom received desensitiza- 
tion therapy, and three who were treated by interpretive 
group procedures. 

Throughout the experiment, pairs of phobic patients 
were matched in terms of sex, age (within a 4-year 
range), and the nature and objective severity of the. 
phobic disorders. A coin was tossed to decide whether 
a given member of each matched pair would be treated 
by desensitization therapy or by group interpretation. 
Extra (unmatched) individuals were always placed in 
the interpretive groups because it is generally agree 
that these groups require a minimum of three members 
(Corsini, 1957). 

When, after six sessions, treatment with the acro- 
phobic groups was well underway, the next group (five 
impotent men) was selected and similarly subdivide 
into two additional groups, treated by desensitization 
and interpretation, respectively. ; 

"Three months later, the group of six claustrophobic 
patients was selected and equally subdivided to form 4 
third Separate desensitization-interpretation COM- 
parison. t 

Thus, at the end of 6 months, a total of seven patients 
had received group desensitization and nine had been 
treated by group interpretation. 

Seven months later, additional acrophobic and 
claustrophobic patients were obtained in order t9 
investigate the effects of relaxation per se. It was 
hypothesized that individuals who received training 1 
relaxation at the end of each interpretive session woul 
show a greater diminution of phobic reactions tha? 
those patients who had been treated solely by gtUP 
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interpretation. The suggestion that interpretation-plus- 
relaxation might be more effective than desensitization 
was also tested. 

Accordingly, an additional six acrophobic patients 
were equally divided into two matched groups, the one 
receiving desensitization therapy and the other receiv- 
ing interpretation-plus-relaxation. The latter group 
was trained in an accelerated version of Jacobson’s 
(1938) progressive relaxation for about 15 minutes at 
the end of each interpretive therapeutic discussion. 

A few weeks later, an additional nine claustrophobic 
patients were similarly subdivided. Group desensitiza- 
tion was administered to four patients and group 
interpretation-plus-relaxation was applied to five. 

Finally, the mixed phobic group was treated by 
desensitization in order to determine whether desensi- 
tization could be successfully applied to a heterogeneous 
phobic group. 

Thus, group desensitization was applied to 18 
patients; group interpretation was applied to 9 patients, 
and 8 patients were treated by group interpretation- 
plus-relaxation. 


Selection of Phobic Patients 


4 Although there were numerous volunteers for 
inclusion in the investigation, only those people whose 
phobias imposed a severe limitation on their social 
mobility, jeopardized their interpersonal relationships, 
or hindered their constructive abilities were admitted 
to the therapeutic groups. 

Several people who were greatly handicapped by 
Dhobic disorders were excluded because they had 
received previous psychiatric treatment, ranging from 
Psychoanalysis to electroconvulsive therapy. These 
People were given individual treatment in order to 
avoid ambiguity concerning the effects of the thera- 
peutic groups. 

The character and severity of the phobias were 
assessed in the following manner: Patients reporting 
pavenobic symptoms were privately and individually 

uted to climb a metal fire escape. The experimenter 
climbed the stairs directly behind the patients and 
oe them to see how high they could climb. Few of 
te neu were able to proceed higher than the first 
h ang (approximately 15-20 feet from ground level). 
€ patients who were admitted to the acrophobic 
groups were all able to achieve a pretherapeutic height 
of between 15 and 25 feet. 
ad ue patients with claustrophobic traits were 
wh F individually into a well-ventilated cubicle 
Nass arge French windows which opened onto a 
ae i The patient sat facing the open windows. To 
Eur of the patient was a movable screen which 
thus € pushed as far as the centre of the cubicle, 
Rane ee a sensation of space constriction. The 
HER ite EE told that the experimenter would first 
EE : tench windows and then proceed to push the 
E Owards the centre of the room. They were 
BE 0 remain in the cubicle for as long as possible 
need mu the windows only when they felt that the 
patients, ae had. become unbearable. Most of the 
Aes aa Showed visible signs of discomfort as soon as 
thee lows were shut, and no one was able to tolerate 
creen at a distance of less than 20 inches. 
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. Detailed information regarding the purpose of the 
investigation was withheld from the patients to avoid 
possible prejudice to the results. They were merely 
informedthatthe experimenter was conducting research 
into the alleviation of phobic disorders by group 
methods. 

Apart from the initial screening procedures, in- 
dividual contact with the patients was avoided in 
order to exclude the influence of any additional thera- 
peutic factors. It is thought, for instance, that history 
taking and psychometric investigations may in them- 
selves be therapeutic. In order to determine the value 
of group therapy per se, it was considered necessary to 
eliminate as many of these extraneous variables as 
possible. Attention was deliberately focused, therefore, 
on the specific techniques under investigation, avoiding 
the use of any supplementary measures which might 
facilitate therapeutic progress. For instance, in clinical 
practice it is customary to precede the application of 
systematic desensitization by a brief outline of the 
theoretical rationale behind the technique. Since it 
could be argued that this practice has a direct bearing 
on the results, the patients were directly desensitized 
without any preliminary explanation. 


Group Desensitization 


Anxiety hierarchies (graded lists of stimuli to which 
the patients reacted with unadaptive anxiety) were 
constructed (Wolpe, 1958). In preparing these hier- 
archies, the experimenter extracted common elements 
from remarks which individual patients wrote on the 
questionnaires they filled out, and a group hierarchy 
was constructed. It must be emphasized that the 
hierarchical situations were imaginary ones, listed on 
paper and presented only symbolically to the patients. 

The acrophobic group hierarchy, for example, 
consisted of the following situations: looking down 
from a very high building, seeing films taken from an 
airplane, looking down from a height of 80-100 feet, 
looking down a well, sitting high up on a grandstand 
during a football game, looking down from a 60-foot 
balcony, sitting on a narrow ledge at a height of 60 feet 
with a safety net a few feet away, looking down from a 
height of about 55 feet, seeing someone jump from a 
50-foot diving board, sitting on a wide ledge about 35 
feet from the ground, looking down from a height of 
approximately 20 feet, looking down from a height of 
about 10 feet. X 

The claustrophobic group hierarchy consisted of 16 
situations ranging from “sitting in a large and airy 
room with all the windows open,” to “sitting in front 
of an open fire in a small room with the doors and 
windows shut.” The group hierarchy ied to the 
impotent men contained 10items referri. gressively 
intimate sexual situations requiring incr 
of initiative. 

The first therapeutic session was d 
to training the patients in intensive musci 
At the end of the session, the patients 

to practise specific relaxation exercises 
minutes morning and night. 

The second session was held 3 da; 
further training in relaxation was provi 
the end of this session, desensitizatio 
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with the presentation (in imagination) of the two 
weakest items of the relevant anxiety hierarchy. The 
acrophobic patients, for example, were first told to 
picture themselves looking out of a window about 10 
feet from the level of the street. It was impressed upon 
them that if any scene proved upsetting or disturbing, 
they were to indicate this by raising their left hand. 
When any patient signaled in this manner, the scene 
was “withdrawn” immediately. 

When the two least disturbing items in the relevant 
anxiety hierarchy had been presented, each of the 
patients was asked to report on the clarity of the 
imagined scenes and their accompanying levels of 
disturbance. The second session ended after the patients 
had been told to practise relaxation twice daily for 
periods of about 10 minutes. 

The subsequent desensitization sessions followed a 
set pattern. The therapist named the various muscle 
groups to be relaxed. When a deep level of relaxation 
was reached, the patients were presented with succes- 
sive items from the hierarchical series. The desensitiza- 
tion procedure was conducted at the pace of the “slow- 
est” (i.e., most anxious) subject. 

The third session was terminated only when all the 
patients were able to tolerate an exposure of about 10 
seconds to the first three items on the hierarchy without 
signaling anxiety. Thereafter, new items were intro- 
duced only when a 10-second tolerance to the preceding 
item had been achieved. It took several sessions before 
the entire group was able to visualize a given item for 
as long as 10 seconds without one or another member’s 
signaling some disturbance. 

Apart from occasional restlessness in those who were 
ready for more “difficult” anxiety items but who were 
constantly re-exposed to stimuli which they had long 
since mastered, no harm seemed to ensue from proceed- 
ing at a pace that was obviously too slow for part of 
the group. On the other hand, experience has shown 
that too rapid a pace can prove extremely antithera- 
peutic and lead to increased levels of anxiety. 

Therapy in the desensitization groups was termi- 
nated when the final item on the hierarchy was tolerated 
by the patients for 10 seconds without signaling, 
Patients often reported a marked amelioration of their 
phobic responses when the anxiety hierarchies were 
only half completed. It was insisted that each member 
would nevertheless have to undergo desensitization 
of the entire hierarchy in order to consolidate and 
reinforce their therapeutic gains. 

The treatment of the claustrophobic groups was 
conducted out of doors. The patients in the mixed 
phobic group were handed the items of their relevant 
anxiety hierarchies on slips of paper. Here, the relaxa- 
tion procedure adopted was as previously outlined, 
but instead of describing the items, the therapist handed 
a typewritten anxiety scene to each group member and 
instructed him to read the description of the scene, to 
close his eyes and to try to imagine the situation with 
tranquility. The patients were instructed to signal in 
the usual manner when a given situation became 
disturbing, and then immediately to stop imagining the 
scene and to continue relaxing. After about 10 seconds, 

all the patients were told to stop picturing the scene 
and remain relaxed. Those who had successfully 
imagined their item without undue disturbance were 
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then handed a new anxiety situation. In this manner, 
each group member was able to proceed at his own 
pace. No more than two successive items were presented. 
at any one session. 


Group Interpretation 


The approach used in the interpretive groups was a. 
form of insight therapy with re-educative goals (Wol- 
berg, 1954). Leadership was basically democratic, and 
the therapist's primary role was that of a participant 
observer. The groups passed through two phases: 
First, there was an introductory period during which: 
the group situation was structured with the emphasis 
on a free and permissive emotional atmosphere. Feelings 
of initial tension and reticence were dealt with by 
open discussion, emphasizing group tolerance and 
acceptance and clarifying numerous misconceptions, 
Second, descriptions of phobic symptoms preceded 
intensive discussions which focused attention on 
emotions and on current interpersonal relationships. 
The emphasis shifted from a situational to a personal 
exploration. A considerable amount of historical data 
emerged and frequently provided abreactive and 
cathartic responses. The recall of forgotten memories 
was often accompanied by violent emotional reactions. 

The group of impotent men displayed a high degree 
of empathy for one another, and frequently expressed 
feelings of hostility and resentment towards the thera- 
pist. These feelings were accepted by him and clarified 
for the patients; they were followed by discussions of 
the effects of frustration. 

At the end of each session, the therapist provided a 
summary of the proceedings. He attempted not only to 
recapitulate the remarks of the subjects and to reflect 
back to the group the emotional significance of their 
statements, but also to suggest possible connections 
between their symptoms and their feelings. Interpretive 
remarks dealt mainly with possible motives behind the 
facade of manifest behavior. Premature interpretations 
were vigilantly avoided. Obvious rationalizations, 4$ 
well as statements of overprotestation, were challenge 
by the therapist only when he sensed a readiness on 
the part of the group. y 

Both the interpretive groups and the desensitization 
sessions were usually conducted three times a week. 
The desensitization groups were disbanded when all 
the patients were able to tolerate the most severe 
anxiety producing stimulus in the hierarchy without 
undue disturbance. 

Members of the interpretive groups were given the 
same number of sessions as the corresponding desensi- 
tization groups. Since very few patients recover 
from their phobias by means of the interpretive 
procedures, the ones whose phobic symptoms persist 
were provided with an opportunity of undergoing group 
desensitization. (There were too few desensitization 
failures to satisfy the minimum numerical requiremen 
for comparable interpretive groups.) Although t 
main response was an immediate willingness to intro- 
duce the “different group technique,” the grouP 
impotent men decided to continue employing interpre- 
tive procedures a while longer. Group desensitization 
was then administered to those patients who were 00 
rendered symptom-free by the interpretive methods 
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RESULTS 


Assessment of Recovery 


One month after therapy had terminated, 
the acrophobic and claustrophobic patients 
who claimed to have recovered from their 
phobias were required to undergo additional 
stress tolerance tests. 

The acrophobic subjects were required to 
climb to the third landing of a fire escape 
(a height of about 50 feet). From the third 
story, they were required to go by elevator 
with the experimenter to the roof garden, 
eight stories above street level, and then to 
count the number of passing cars for 2 minutes. 

The claustrophobic subjects were required 
to remain in the cubicle with the French 
windows shut and the movable screen a few 
inches away. Those who were able to endure 
the situation with no apparent distress for 5 
minutes were regarded as recoveries, provided 
that they were also able to present satisfactory 
evidence that they were no longer handicapped 
in their life situations. The tests were con- 
ducted individually in the presence of a 
witness, 

With two exceptions, all the patients who 
Stated that they had recovered from their 
Phobias were able to face the tolerance tests 
with outward tranquility, although some of 
the acrophobic patients later admitted that 
they had felt “a trifle anxious” when looking 
7 down. from the edge of the roof garden. 

Neither the impotent men nor the members 
of the mixed phobic group were objectively 
tested, 

The most rigorous criteria were used in 
assessing therapeutic results. For instance, 
only those patients who displayed an unam- 
biguous posttherapeutic freedom from their 
Tespective phobic disorders were classified as 
recoveries, These criteria were, of course, 
essentially symptomatic. If a claustrophobic 
Patient for instance, was still unable after 
therapy to visit the cinema for fear of suffoca- 
tion, his treatment was considered a failure, 
Tegardless of any ex parte testimony to the 
contrary. Merely to enable a patient to 
«accept his neurosis” or to achieve a so-called 

Personality reintegration” without sympto- 
ya relief was considered not good enough. 

€covery from a phobic condition implies total 
Neutrality or indifference to the formerly 
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TABLE 1 


NUMBER or PATIENTS Assicnep TO EACH CONDITION 
AND THE THERAPEUTIC OUTCOME 


Treated| pe- [Treated el 

R in. | Re- | b Re 
Patients PUE gt Pr ERES uated peed 
tization | €€d | tation | cred |and relax- | ered 

ation 
Acrophobics 5 4 3]|0 3 1 
Claustropho- 7 4 3 |0 E) T 
bics 

Impotence 2 2 3 | 0 — -— 
Mixed group 4 Ser Hes n 

Total 18 | 13 9/0 8 2 


anxiety generating stimulus constellation. 
The present study made no provision for 
moderate or slight improvements. The latter 
were all classified as failures. 


Statistical Analysis of the Results 

Results are summarized in Table 1. As 
shown, there were 13 recoveries and 5 failures 
for desensitization, 2 recoveries and 15 failures 
for other forms of treatment. The resulting chi 
square is 10.69, which is highly significant 
(p < .01). 

Additional statistical comparisons were 
computed for the matched pairs of acrophobic 
and claustrophobic patients who received 
group desensitization or group interpretation, 
respectively. There was a total of five matched 
pairs in these groups. Both members remained 
unimproved in one pair; in four, the desensiti- 
zation patients recovered but the interpreta- 
tion cases failed. There were no pairs in which 
both recovered or in which only interpretive 
methods succeeded. By applying the null 
hypothesis that the two methods are equally 
effective, the probability of obtaining this 
result is .0625. 

Tn the case of the impotent men, no matching 
was carried out. Fisher’s (1946, p. 97) exact 
test, which gave a probability of .1, was 
employed for testing significance. When the 
two probabilities, .0625 and .1, were combined 
(Fisher’s 1946, p. 99), the resulting level of 
significance was .03, favoring desensitization. 

Tt is interesting to note that when com- 
parisons were made between the matched 
pairs of acrophobic and claustrophobic patients 
who received group desensitization as opposed 
to group interpretation-plus-relaxation, the 
level of significance in favor of desensitization 
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was only 12.5%. There were three matched 
pairs in which desensitization proved success- 
ful and interpretation-plus-relaxation failed. 
There were no cases where group interpretation 
plus-relaxation succeeded while group desensi- 
tization failed. Both methods failed with 
three matched pairs and both methods 
succeeded with one matched pair. 

Since a significance level of 12.5% falls 
outside the conventional limits, the obvious 
conclusion is that there is no evidence of 
differences between the desensitization and 
interpretation-plus-relaxation conditions. Of 
course, seven matched pairs provides one 
with little leverage and the statistical analysis 
of such a small number cannot be conclusive. 
It is worth noting, however, that of the six 
individuals who were initially unsuccessfully 
treated by interpretation-plus-relaxation, four 
later recovered from their phobias after a mean 
of 9.8 group desensitization sessions. 

Of the total of 15 patients who had derived 
no apparent benefit from the interpretive 
procedures, 10 recovered from their phobias 
after a mean of 10.1 group desensitization 
sessions, as compared with the mean of 20.4 
sessions which were necessary for effective 
group desensitization when only this pro- 
cedure was employed. 


Follow-Up Studies 


Follow-up studies were conducted by means 
of the following questionnaire: 


1. Has your original phobic disorder returned? 

2. If you haye had a relapse, is it slight, moderate, 
or severe? 

3. Since receiving treatment have you developed 
any new symptoms? (If so, please elaborate.) 

4. Please underline all the following complaints 
which apply to you: 

Tension Depression Anxiety Palpitations 
Dizziness Insomnia Nightmares Headaches 
Tremors Sexual problems Fatigue Stomach 
trouble Other symptoms (specity) 

5. Please indicate whether any of the above com- 
plaints commenced after your participation in the 
therapeutic groups. 

6. Are you still handicapped in any area of your 
daily living? (Specity.) 

7. Have you consulted another therapist? 


The duration of after-study history varied 
from group to group, and ranged from 15 
months to 1.5 months with a mean of 9,05 
months. All those subjects whose follow-up 
reports revealed even slight phobic recurrences 
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were considered to have relapsed. Particular. 
attention was devoted to the question of 
possible symptom substitution, but no evi- 
dence of this phenomenon was encountered, | 

When the follow-up evaluations were taken | 
into account, 10 of the 13 patients who had 
recovered by means of group desensitization 
still maintained their freedom from phobic 
symptoms. Thus, 3 patients were regarded as 
having relapsed. 

Of the 2 patients who had recovered after 
undergoing group interpretation-plus-relaxa- 
tion, 1 maintained his recovery. 

Eight of the 10 patients who recovered after 
undergoing postinterpretive group desensitiza- | 
tion maintained recovery. 


Summary of Findings 

Group desensitization was applied to 18 
patients of whom 13 initially recovered and 
3 subsequently relapsed. 

Group interpretation was applied to 9 
patients. There were no recoveries in this 
group. 

Group interpretation-plus-relaxation was 
applied to 8 patients of whom 2 recovered and 
1 subsequently relapsed. 

The 15 patients who had not benefited from 
the interpretive procedures were then treated 
by group desensitization. There were 10 
recoveries of whom 2 subsequently relapsed. 


DISCUSSION 


Wolpe (1958) has expressed the basis of his 
"reciprocal inhibition" therapy as follows: 


If a response incompatible with anxiety can be made 
to occur in the presence of anxiety-evoking stimuli so 
that it is accompanied by a complete or partial sup- 
pression of the anxiety-responses, the bond between 
these stimuli and the anxiety-responses will be 
weakened. 


His method of systematic desensitization 
based on relaxation incorporates Jacobson's 
(1938) finding that muscular relaxation 
inhibits anxiety and that their concurrent 
expression is physiologically impossible. $ 

The deliberate use of the parasympathetic 
accompaniments of skeletal muscular relaxa- h 
tion to inhibit neurotic anxieties reciprocally 
may be termed “specific reciprocal inhibition.” 
There is, however, a broad range of stimuli 
which have nonspecific properties for inhibiting 
neurotic responses reciprocally. The more 
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usual clinical medium of verbal interchange, 
for instance, may in itself bring about the 
incidental or nonspecific reciprocal inhibition 
of neurotic responses. In other words, it is 
postulated that interview situations can some- 
times evoke autonomic responses similar to 
those of deep muscle relaxation. 

The fact that far fewer sessions were re- 
quired to desensitize those subjects in the 
present sample who had previously received 
interpretive therapy may be explicable by the 
notion that interpretive group situations 
evoked appropriate emotional responses in 
most of the subjects to inhibit some of their 
anxieties. In other words, it is probable that 


, some of the anxiety responses evoked by the 


group discussions underwent a measure of 
nonspecific reciprocal inhibition. Furthermore, 
those patients who received postinterpretive 
group desensitization had the advantage of 
having established a therapeutic relationship 
with the experimenter. It is postulated that 
the therapeutic atmosphere of empathy and 
acceptance may in itself reciprocally inhibit 
neurotic anxieties" (Lazarus, 1959). 

It should be mentioned that the interpretive 
groups apparently enabled many of the 
patients to achieve a constructive modification 
of their self-evaluation, often clarified their 
evaluation of others, and enhanced their 
potentialities of interpersonal integration. 
These gains, however, appeared to have little 
bearing on their phobic symptoms, which 
Usually persisted until desensitization pro- 
cedures were administered. 

The comparatively high relapse rate in the 
Present series is probably related to the fact 
that the treatment was rather narrowly con- 
fined to a single range of stimuli which could 
In some cases have been a small part of a 
broad constellation, other elements of which 
May have afforded additional and possibly 
More useful bases for desensitization. In a 
Proper clinical setting, the group desensitiza- 
tion Procedures would have been preceded by 
individual history taking and the compilation 
of detailed clinical information for use either 
o aay or in the group situations. 
P uay „the conditions for the applica- 
lon of desensitization therapy were far less 
e Optimum, a point which suggests that 
Wu l outcomes are only minimally 

cative of the utility of this therapeutic 
approach to phobic symptoms. 
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The concept of “experimenter bias" is a 
relevant consideration in any study of this 
kind. It is difficult to determine the extent to 
which the present results were influenced by 
the therapist's theoretical affiliations. In terms 
of subjective interest, however, it should be 
noted that the experimenter's preferences were 
decidedly in favor of the interpretive methods. 
Fortunately, the ennui which is generated 
while applying desensitization procedures is 
adequately offset by the gratifying results. 

If another therapist had treated the inter- 
pretive groups, a significant difference in the 
results might merely have reflected the 
superiority of the individual therapist rather 
than the methods employed. The treatment of 
phobias by interpretive methods, however, is 
well known to be difficult. Curran and Partridge 
(1955), for instance, state that “phobic 
symptoms are notoriously resistant to treat- 
ment, and their complete removal is rarely 
achieved." Similar views are expressed by 
Maslow and Mittelmann (1951), Henderson 
and Gillespie (1955), and Mayer-Gross, 
Slater, and Roth (1955). By contrast, phobias 
respond to desensitization exceedingly well 
(Eysenck, 1960; Lazarus & Rachman, 1957; 
Wolpe, 1958). It is contended, therefore, that 
the superior results achieved by group desensi- 
tization are not a function of the therapist’s 
disproportionate skills (or unconscious preju- 
dices) but a reflection of the intrinsic value of 
desensitization per se in the treatment of 
phobic disorders. 

The point may legitimately be raised as to 
whether desensitization achieves any result 
other than the elimination of the phobic 
symptom. Comments on the general reper- 
cussions of desensitization are not possible in 
the context of the present study. No attempt 
was made to study changes in personality or 
general adaptation. Many patients, however, 
made remarks which suggested that the elimi- 
nation of a phobic symptom is not an isolated 
process, but has many diverse and positive 
implications. As Eysenck (1959) states: 


The disappearance of the very annoying symptom 
promotes peace in the home, allays anxiety, and leads 
to an all-round improvement in character and behavior. 


The extent to which desensitization is a 
method of general applicability (i.e., whether 
this method would benefit any neurotic 
patients other than those suffering from 
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phobic disorders) is also worthy of mention. 
The value of desensitization is limited to those 
conditions wherein appropriate hierarchies 
can be constructed and where specific rather 
than pervasive anxiety is present. In other 
words, it is only where reasonably well-defined 
stimulus configurations can be identified that 
desensitization techniques should be applied. 
For example, patients whose interpersonal 
relationships are clouded by specific fears of 
rejection, hypersensitivity to criticism, clear- 
cut areas of self-consciousness, or similar 
specific anxiety evoking stimuli often derive 
benefit from desensitization procedures. By 
contrast, desensitization cannot readily be 
applied in such cases as character neuroses, 
hysterical disorders, and chronic inadequacy. 
A further prerequisite for the effective appli- 
cation of desensitization is the ability to 
conjure up reasonably vivid visual images 
which elicit emotional reactions comparable 
to the feelings evoked in the real situation, 

While dealing with the limitations of 
desensitization procedures, one should not 
lose sight of the fact that systematic desensiti- 
zation appears to be a most valuable technique 
in the alleviation of phobic disorders. The fact 
that this method can be effectively adminis- 
tered in groups suggests greater availability 
with little loss in economy or effectiveness for 
phobic sufferers. 


SUMMARY 


Wolpe's (1958) technique of systematic 
desensitization based on relaxation was 
adapted to the treatment of phobic disorders in 
groups. Of the 18 subjects who were treated 
by direct group desensitization, 13 recovered 
in a mean of 20.4 sessions. Follow-up inquiries 
after an average of 9.05 months revealed that 
3 of the subjects had relapsed. With a more 
traditional form of interpretive group psycho- 
therapy applied to 17 subjects, after a mean 
of 22 therapeutic meetings, it was found that 
only 2 patients were symptom-free. Both these 
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patients had attended groups in which relaxa. 
tion was employed as an adjunct to the inter 
pretive procedures. The 15 subjects who were 
not symptom-free after interpretive group 
therapy were then treated by group desensiti- 
zation. After a mean of 10.1 sessions, 10 of 


them recovered. The very much shorter time 


required to effect a recovery by desensitization 
in those patients who had previously received 
interpretive therapy suggests that the thera 
peutic relationship and additional nonspecific 


inhibition of neurotic anxieties motivating the 
phobic symptoms. There is some basis for the 


factors may have facilitated the z 


idea that therapists of every persuasion could 
helpfully employ systematic desensitization 
as an adjunct to their traditional techniques 
in the management of phobic disorders. 
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SOME PARAMETERS OF THE CONDITIONING OF VERBAL AFFECT 
RESPONSES IN SCHIZOPHRENIC SUBJECTS: 
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techniques to verbal behavior has 
proved fruitful in a variety of situations 

(see recent reviews by Krasner, 1958; Salzinger, 
1959b). The specific task of the experimental 
analysis of the interview has been outlined by 
Salzinger (1959a), and this approach has met 
with experimental success. 

It has been shown that the output of verbal 
affect responses of schizophrenic patients can 
be manipulated in an interview situation by 
the use of discriminative stimuli such as 
questions (Salzinger, 1956; described in 
Zubin, 1958) and by the use of reinforcement 
such as verbal expressions of agreement 
(Salzinger, 1956; Salzinger & Pisoni, 1958; 
described in Zubin, 1958). Verbal reinforce- 
ment was also shown to be effective in increas- 
ing the frequency of occurrence of verbal 
affect in normal subjects (Salzinger & Pisoni, 
1960), and a comparison of a schizophrenic 
and a normal group revealed a difference only 
in their extinction rates. 

: In the following studies a number of addi- 
tional parameters of the conditioning of 
verbal affect in an interview situation were 
investigated. 


T= application of operant conditioning 


EXPERIMENT I 


The following study? was undertaken to 
compare the effectiveness of verbal reinforce- 
ment when given at the beginning of the 
interview with its effectiveness when given in 
the middle, 
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Procedure 


Subjects for this experiment were 10 female and 4 
male hospitalized schizophrenics, ranging in age from 
21 to 48 years with a median of 34. Subjects were 
selected from admissions to a state hospital and were 
accepted as schizophrenic on the basis of the diagnosis 
given at the hospital distribution center. Eight patients 
had been previously hospitalized. None had received 
insulin, electroshock, or tranquilizing drugs for at 
least one week before the interview. 

Each subject was interviewed one week after arrival 
in the hospital. All interviews were recorded with the 
recording equipment in full sight of both the subject 
and the experimenter. 

The interview was presented to the subject as a 
routine mental hospital procedure. The experimenter 
brought the subject into the experimental room and 
explained that the interview was being conducted to 
help him. 

The experimenter questioned the subject about his 
name, age, marital status, children, and siblings. The 
subjects had little difficulty answering these questions, 
Any response given was reinforced by the experimenter 
repeating the answer, writing it down, and saying 
“mm-hmm,” “ubha,” or “I see.” This procedure was 
adopted in an effort to obtain factual information 
upon which the remaining interview questions could 
be based and to establish the experimenter as a source 
of reinforcement, thus encouraging the subject to 
speak in the presence of the experimenter. Then the 
experimental part of the interview was initiated with 
the question, “Would you tell me why you are here in 
this hospital?” 

All subjects were interviewed for a period of 30 
minutes, During the first 10-minute period (condi- 
tioning), the experimenter questioned the subject and 
reinforced each affect response (as defined below), By 
reinforcement is meant verbal agreement by the 

imenter through the use of such words as “‘mm- 
hmm,” “ubha,” “I see,” “yeah,” “yes,” etc. immedi: 
ately following the affect statement. Ae 

This was followed by 20 minutes of extinction 
(divided into two 10-minute periods in the analysis 
presented below). The experimenter continued asking 
questions but withheld all reinforcement. 


Definition of Response 

The response class of affect was defined as any 
statement by the subject describing or evaluating his 
own state in other than intellectual or physiological 
terms. In other words, included jn the response class 
were only those affect statements in which the subject 
referred the affect to himself by using the pronouns I or 
We, as indicated in the following examples: “I am 
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satisfied,” I’m happy,” “We enjoyed it,” “I like him” 
(Salzinger & Pisoni, 1958). 

A count per minute was taken of affect statements 
and general self-referred statements (all statements 
beginning with I or We). The latter served as a measure 
of each subject’s general output of self-referred state- 
ments. 


Interviewer Questions 


After the question, *Why are you here?" the 
experimenter asked additional questions only when the 
subject ceased talking for at least 2 seconds, The 
following topics were discussed during the interview: 
reasons for being in the hospital, causes for illness, 
patient’s relationships to his parents, siblings, fellow 
employees, employers, fellow students and teachers, 
wife or husband, children and friends, patient’s activi- 
ties during free time, and plans for the future. The 
experimenter made an attempt to balance these topics 
over the different conditions of the interview by bring- 
ing up the same topics in each 10-minute period. As 
long as the topics were approximately balanced over 
the three 10-minute periods, however, the experimenter 
took his cues as to topic from the content of the subject's 
Statements. Thus both the number of topics and the 
order in which they were discussed varied from inter- 
view to interview. Questions dealing directly with 
affect—e.g., “How did you feel about that?" or “Were 
you happy?"—were not used. 

The results obtained from this group (Group B) 
were compared with those obtained from two previously 
reported groups (Salzinger & Pisoni, 1958). For one 
group (Group A) the 30-minute interview was divided 
in the following way: During the first 10 minutes 
(operant level), the base rate of affect statements was 
determined—i.e., the experimenter did not reinforce 
any statement; during the second 10 minutes (condi- 
tioning), the experimenter reinforced all affect state- 
ments; during the third 10 minutes (extinction), the 
experimenter again withheld reinforcement. For the 
second group (Group C), no reinforcement was given 
for the entire 30 minutes of the interview. 


Results 


Conditioning effect for total groups. Fried- 
man’s nonparametric analysis of variance 
(Siegel, 1956) was applied separately to each 
group in order to determine whether reinforce- 
ment produced an increase in affect statements 
in the period where it was delivered, Inspection 
of these total group analyses in Table 1 shows 
that the largest number of affect responses 
occurred during conditioning when the rein- 
forcement was given in the middle of the 
interview (Group A).? There is a nonsignificant 


3 For this paper, all interviews were carefully typed 
and later checked by the investigators to insure the 
accuracy of the frequency counts. In the course of this 
process, small changes were made in the coding of the 
interviews previously reported (Salzinger & Pisoni. 
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TABLE 1 
Sum or RANKS or ArrkcT STATEMENTS FOR EACH oF 
THE TEN-MINUTE PERIODS FOR THE THREE 
Groups or PATIENTS 


Successive ten- 
minute periods 


Group xr > N 
First | Second | Third 

A | 41.5 | 50.5 | 28.0 | 12.82 | .002| 20 

B 31.5 | 25.5 | 27.0 | 1.39 | .50 14 

Cc 30.0 | 37.0 | 29.0 | 2.37 | .30 16 


concentration of affect statements in the first’ 
10 minutes when the reinforcement was given 
in the beginning of the interview (Group B). 
The control group (Group C) shows a non- 
significant clustering of affect statements in 
the second 10-minute period of the interview. 
Conditioning effect as a function of a minimum 
number of reinforcements. A comparison of the 
number of reinforcements given to Groups A 
and B reveals that members of the former 
received a slightly larger number of reinforce- 
ments than the latter (although p = .50). 
Furthermore, examination of Group C (which 
received no reinforcement) shows a distri- 
bution (p = .30) in a direction opposite to 
that expected from the experimental paradigm 
of Group B and in the same direction as that 
expected in Group A. Although neither of 
these two differences is statistically significant, 
the combination of both trends would tend to 
obscure the effect of the reinforcement in 
Group B for two reasons: Conditioning may 
not take place if the number of reinforcements 
is too small, and the expected larger number 
of affect statements in the first 10 minutes may 
not be apparent because of the “natural” 
trend (see Group C) for the second 10 minutes 
to exceed the first 10 minutes. J 
For these reasons, and also because à 
previous study (Salzinger & Pisoni, 1958) 
had already shown that the strength of a 
response varies as a function of the number of 
reinforcements applied to it, it was decided to 
search for the minimum number of reinforce- 
ments necessary for conditioning to take place. 


1958), thus producing discrepancies between the sums 

of ranks reported in that paper and in this study. It is 

to be noted, however, that the new counts showed no 

difference in the substantive results; all the conclusions 

H reported then are still supported by the corrected 
lata. 


| 


CONDITIONING OF VERBAL AFFECT RESPONSES 


This was accomplished by successively re- 
moving from Group B those individuals who 
received the lowest number of reinforcements 
until the group showed the conditioning effect. 
When those subjects receiving the lowest 
number of reinforcements (two) were removed, 
the sums of ranks for each of the succeeding 
10-minute periods (conditioning period and 
first and second extinction periods) were 28.5, 
22.5, and 21.0, respectively (p = .27). When 


| the individuals having the next highest number 
| of reinforcements (four) were removed, the 
| sums of ranks changed to 25.0, 18.5, and 16.5 


(p = .15). Finally, when subjects receiving the 
next highest number of reinforcements (six) 
were removed, the sums of ranks were 22.0, 
15.5, and 10.5 (p = .02). In other words, 
subjects receiving six reinforcements or less 
did not give evidence of conditioning, whereas 
subjects receiving a larger number did. 

A reasonable question arising from this 
analysis is whether the selection of individuals 
with a larger number of reinforcements is not 
simply a matter of biased sampling. Conse- 
quently, the equivalent procedure was applied 
to Group C, which received no reinforcement. 
Removing subjects with less than seven affect 
statements in the first 10 minutes produced the 
following sums of ranks: 11.5, 15.0, and 9.5 
(p = .27). No significant difference, then, was 
found between the periods of the interview, 


_ and the trend was quite different from the one 


found in Group B after removing subjects who 
Were given less than seven reinforcements. 
Comparison of conditioning and extinction 
effects in Groups A and B. A comparison of 
the number of affect statements emitted during 
the conditioning periods of Groups A and B 
showed no significant difference (p = .72). A 
Comparison of the number of affect statements 
emitted during the last 10 minutes of Group A 
(extinction) with the second 10 minutes of 
Group B (first extinction period) yielded no 
Significant difference (p = .50), nor was there 
a Significant difference (p = .22) between the 
extinction period of Group A and the last 10 
Minutes of Group B (the second extinction 
Period). 
4 In view of the fact that conditioning was 
demonstrated in Group B for only those 
Individuals who received more than six 
Teinforcements, Group A was subjected to a 
Similar analysis by removing all subjects who 
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had received six reinforcements or less during 
conditioning. The resulting sums of ranks for 
the three conditions were operant level, 26.0; 
conditioning, 38.0; and extinction, 20.0 (p = 
.002). Inspection of the individuals who had 
received six or less reinforcements showed that 
most of these individuals did not condition. 
In other words, for Groups A and B, more than 
six reinforcements seemed necessary for 
conditioning. 

Self-referred statements. Application of 
Friedman’s analysis of variance to the change 
in the general class of self-referred statements 
revealed no statistically significant difference 
among the three 10-minute periods for Group 
C (p = 44). There were significant differences 
for Group B (p = .03) and for Group A (p = 
.01). The largest number of self-referred state- 
ments occurred during the first 10 minutes for 
both groups. 

The Friedman analysis of variance was then 
applied to the general self-referred statements 
of those subjects in Groups A and B who had 
received more than six reinforcements. This 
analysis yielded no significant difference 
among the three 10-minute periods (p = .12 
for Group A, p = .45 for Group B), In Group 
C, those individuals who emitted more than 
six affect responses in the first 10 minutes 
showed no significant difference in self-referred 
statements among the three 10-minute periods 


(p = .35). 
Discussion 

Although Group A emitted a larger number 
of affect statements than Group B during 
conditioning, the difference was not significant. 
Since Groups A and B showed significant 
conditioning effects, taking into account the 
number of reinforcements, it must be con- 
cluded that the effectiveness of reinforcement 
is the same whether the period of conditioning 
is placed at the beginning or in the middle of 
the interview. Different parts of the interview 
assume their importance not because of their 
temporal placement, but rather because of the 
presence or absence of reinforcement. 

The procedure of selecting those subjects 
who received more than a minimum number of 
reinforcements to investigate the effectiveness 
of reinforcement was based on the assumption 
that a variable must be sufficient in magnitude 
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to be effective. Furthermore, the fact that the 
control group showed no “conditioning” when 
subjects are selected in an analogous manner 
suggests that a minimum number of reinforce- 
ments must be administered before a condi- 
tioning effect can be expected. 

In these interview studies, however, the 
period of time during which reinforcement was 
available was held constant (10 minutes), and 
the number of reinforcements received was 
therefore dependent, in part, upon the sub- 
Ject’s "normal" (operant level) output of 
verbal affect. 


EXPERIMENT IT 


In a second experiment, a given number of 
reinforcements was arbitrarily assigned to each 
subject prior to his entering the experimental 
situation. In this way the number of reinforce- 
ments administered was made independent of 
the subject’s output. Furthermore, by allowing 
the duration of the conditioning period to vary, 
it was possible to study the rate of acquisi- 
tion—i.e., the time between successive affect 
Statements during conditioning—as another 
parameter in the conditioning of verbal affect. 


Procedure 


Subjects were 22 female and 10 male hospitalized 
Schizophrenics, ranging in age from 19 to 51 with 
median age of 32.5 years. Eighteen subjects had been 
previously hospitalized. The selection of subjects 
was the same as in Experiment I. 

The interview procedure was identical with that of 
Experiment I except that during the first 10 minutes 
(operant level), the base rate of verbal affect was 
determined. The experimenter asked questions but did 
not reinforce any statement made by the subject. 
During the Second part of the interview (conditioning), 
the experimenter continued asking questions whenever 
necessary, but reinforced only a predetermined number 
of successive self-referred affect Statements. Each 
subject was arbitrarily assigned a specific number of 
reinforcements; 4, 6, 8, 10, 15, or 20. The length of the 
conditioning period depended, therefore, upon the 
length of time it took the subject to receive his assigned 
number of reinforcements up to a maximum of 15 
minutes, 

Of the original 32 subjects, 17 received the assigned 
number of reinforcements in the allotted time, the 
conditioning periods ranging in length from 30 seconds 
to 14 minutes. The remaining 15 subjects did not 
receive their assigned number of reinforcements in the 

allotted time; therefore, their conditioning periods ran 
the full 15 minutes.* 


‘This paper was presented in part at the Eastern 
Psychological Association, Philadelphia, 1958. 
5 The 2 subjects assigned to the 4reinforcement 
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Immediately following the conditioning period, all 
subjects were given 10 minutes of extinction during 
which the experimenter withheld all reinforcement, 


Results 


Conditioning effect as a function of minimum 
number of reinforcements. The analyses were 
restricted to those subjects who received the 
assigned number of reinforcements within the 
allotted time. Subjects who did not fall within 
this category were excluded because the 
number they did receive was determined not 
by the experimenter’s assignment but by their 
operant levels. This was done in order to 
exclude operant level as a determining factor 
for number of reinforcements. 

The mean number of affect statements per 
minute was found for each condition, The 
group was then divided into subjects who 
received 10 or more reinforcements and those | 
who received 8 or less. A Friedman nonpara- 
metric analysis of variance, applied to the 
group receiving 10 or more reinforcements, 
revealed a significant difference among the 
three conditions of the interview (p = .05). 
The largest mean number of affect statements 
occurred during the conditioning period (sum 
of ranks = 21.5), the next largest during 
extinction (sum of ranks = 14.5), and the | 
smallest during operant level (sum of ranks = | 
12.0). | 
When the Friedman test was applied to the 
three conditions of the interview for the group | 
receiving eight or less reinforcements, nO | 
significant difference was found (p = .40).. 
Examination of the trend revealed that the 
largest number of affect statements occurred 
during the extinction period (sum of ranks — 
21.0) the next largest during conditioning 
(sum of ranks — 17.5), and the smallest during 
operant level (sum of ranks — 15.5). 

When the same statistical procedures were 
applied to the general self-referred statements, 


group and the 6 subjects assigned to the 6-reinforcement 
group received their allotted number of reinforcements. 
For the 4 subjects assigned to the 8-reinforcement 
Broup, one did not get the allotted number; of the 6 
assigned 10 reinforcements, 3 did not; of the 7 assigned 
15 reinforcements, 5 did not; of the 10 assigned 20 |, 
reinforcements, 6 did not. i 
Comparisons of the subjects who received their | 
assigned number of reinforcements with those who di 
not revealed no statistically significant differences 
>. .05) in terms of sex, age, educational level, or ad- 
mission status. 


CONDITIONING OF VERBAL AFFECT RESPONSES 


no statistically significant differences were 
found either for subjects who received 10 or 
more reinforcements (p = .65) or for subjects 
who received 8 or less (p = .38). 

Number of reinforcements and resistance to 
extinction. In order to investigate the relation- 
ship between number of reinforcements and 
| resistance to extinction, a Spearman rank-order 

correlation coefficient (Siegel, 1956) was 
computed between number of reinforcements 
and the number of affect statements in the 
extinction period. For the total group, the 
correlation was not significant (r, = +.29, 
p > .05). For subjects with 10 or more rein- 
forcements, the correlation between number of 
reinforcements and number of affect state- 
ments in the extinction period was —.04 
(p > .05); for subjects who received 8 or less, 
the correlation was 4-.68 (p « .05). 

Rank-order correlations were also computed 
for the relation between the rate of acquisition 
and resistance to extinction. The ratio of 
number of affect statements to the time taken 
to emit them was correlated with the number 
of affect statements during extinction. For the 
total group, the correlation was +.28 (p > 
05); for subjects receiving 10 reinforcements or 
more, the correlation was +.64 (p < .05); for 
subjects receiving 8 or less, the correlation was 
+.22 (p > .05). 

To investigate further the effect of rate of 
acquisition upon resistance to extinction, 11 
subjects from this sample were matched with 
11 subjects selected from a previous study 
(Salzinger & Pisoni, 1958). The interview 
procedures were identical except that the 
length of the conditioning period in the earlier 
study was held constant (10 minutes) and the 
number of reinforcements administered varied 
as a function of the subject’s verbal behavior. 
These 11 pairs were matched in terms of 
diagnosis (schizophrenic), sex, operant level 
(no difference greater than four responses in 
10 minutes), and number of reinforcements 
teceived (no difference greater than two). 
They varied only in terms of their rate of 
Acquisition, ie. time taken to receive the 
Same number of reinforcements. 

Nine of the 11 subjects who had the faster 
Tate of acquisition also showed a larger number 
of extinction responses. A paired replicates test 
(Siegel, 1956) of these differences was signifi- 
cant (p = .03). The median difference in 
number of extinction responses was seven. 
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Discussion 

In this experiment, where the number of 
reinforcements administered was independent 
of the subject’s behavior, it is clear that a 
minimum number of reinforcements is neces- 
sary for conditioning to take place. Subjects 
who are given less than this minimum number 
generally show no evidence of conditioning. 
This result corroborates the findings in 
Experiment I. 

Allowing the duration of the conditioning 
period to vary permitted an evaluation of the 
parameter of rate of acquisition. It appears 
that number of reinforcements, within the 
range used in this experiment, in and of itself, 
does not relate to resistance to extinction. A 
more important factor is the time taken to 
receive these reinforcements, i.e., the rate of 
acquisition. When individuals were matched 
in terms of their operant levels and the number 
of reinforcements received, but allowed to 
differ in the rate of acquisition, it was shown 
that the faster the acquisition, the greater is 
the resistance to extinction. Very likely, as the 
number of reinforcements given increases, the 
rate of acquisition becomes a less important 
factor in determining resistance to extinction. 

This finding, however, has importance for 
the large number of brief interviews utilized 
by many psychologists. The interviewer who 
uses verbal reinforcement influences the 
interviewee's behavior, but his delivery of 
reinforcement is influenced in turn by the 
patient's behavior. This implies that the 
amount of influence (bias) that a given inter- 
viewer exerts will vary differentially with the 
rate of acquisition of each particular subject. 
Before an investigator can say that two groups 
of subjects, whether patients or normals, 
differ from each other in terms of the informa- 
tion, opinion, or feeling he has elicited, he 
must be able to show not only that his rein- 
forcement procedure is identical in both groups, 
but also that their reaction to reinforcement 
(rate of acquisition) is the same. 


SUMMARY 


The present studies were designed to 
investigate some parameters in the condition- 
ing of verbal affect in an interview situation 
with schizophrenic subjects. In the first experi- 
ment it was shown that the process of condi- 
tioning verbal affect is independent of the 
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temporal placement of the conditioning 
period—at the beginning or in the middle of a 
30-minute interview—and that the administra- 
tion of a minimum number of verbal reinforce- 
ments is necessary for the conditioning effect 
to take place. 

In a second experiment, where the number 
of reinforcements administered was independ- 
ent of the subject’s behavior and determined 
by the experimenter’s arbitrary assignment, it 
was shown that conditioning of verbal affect 
Statements occurs for those individuals who 
receive 10 reinforcements or more, but not for 
those who receive less. Furthermore, rate of 
acquisition was related to resistance to extinc- 
tion; those subjects who had the faster rates of 
acquisition showed the greater resistance to 
extinction. 
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EYELID CONDITIONING AND THREE PERSONALITY PARAMETERS? 


J. G. FIELD? anp J. C. BRENGELMANN?® 
Institute of Psychiatry, Maudsley Hospital, University of London 


ccEPTING the Hullian system, the 
A group has argued that high 
drive subjects will, under conditions 

where a single habit tendency is evoked, have 
a higher performance level than low drive 
subjects. One measure of performance was 
derived from conditioning the eyelid response. 

To measure drive, a questionnaire, the 
Taylor (1953) Manifest Anxiety (MA) scale 
was constructed. No attempt was made to 
make the MA unidimensional and there is 
strong evidence that it measures neuroticism 
(Eriksen & Davids, 1955; Kerrick, 1955) and, 
to some extent and negatively, extraversion 
(Franks, 1956b). 

Five experiments comparing extreme scorers 
on the MA are construed as supporting the 
Towa hypothesis (Spence & Farber, 1953; 
Spence, Farber, & Taylor, 1954; Spence & 
Taylor, 1951, 1953; Taylor, 1951). When the 
entire range of MA scores is used, the correla- 
tion with conditioning is .25 (Taylor, 1956). 

; Eysenck (1955) and Franks (1956a) ques- 
tioned the construction placed on these experi- 
ments. Also basing themselves on Hull's 
system, they suggested that the Iowa findings 
could equally well be accounted for by a 
different explanation, Individual differences in 
the performance of conditioned responses may 
be due not to differences in drive, but to 
differences in the growth and decay of habit 
Strength. This change in emphasis is supported 
by an experiment by Franks (1957a) in which 
ho differences in eyelid conditioning were found 
between a satiated (low drive) student group 
and one deprived of food and water (high 
drive), 

In an effort to unify diverse experimental 
findings in the psychology of personality, 

E The authors are indebted to Her Majesty's Prison 

ommission for permission to publish this article, but 
n does not imply that it represents the 
È mmissioner's views. The investigation was supported 

Y the Medical Research Council and the Home Office. 


E * Now at Research Services Ltd., the London Press 
Xchange. 


$ 
5 Now at the State Department of Institutions and 
&encies, State Colony, Woodbine, New Jersey. 


Eysenck (1957) has attempted to show that 
one major source of individual differences is 
attributable to differences in the excitation- 
inhibition balance of the central nervous 
system. Eysenck postulated that those indi- 
viduals in whom excitatory potential is 
generated slowly and is weak and in whom 
reactive inhibition is developed quickly, is 
strong, and dissipates slowly, tend to exhibit 
extraverted behavior patterns; conversely, 
those individuals in whom excitatory potential 
is generated quickly and is strong and in whom 
reactive inhibition is developed slowly, is 
weak, and dissipates quickly, tend to exhibit 
introverted behavior patterns (p. 114). 

A number of experiments have attempted to 
assess the relative importance of drive and 
extraversion-introversion in the contribution 
they make to the individual differences in 
eyelid conditioning. The measures of drive have 
been the MA, the Cycloid and Depression 
scales (Guilford, 1940), the Maudsley Medical 
Questionnaire (Eysenck, 1952), or the 
Maudsley Personality Inventory (MPI) 
Neuroticism scale (Eysenck, 1956). These 
scales are highly correlated with the MA and 
have been independently validated against 
psychiatric diagnoses. The measures of 
extraversion have been the Rhathymia scale 
(Guilford, 1940) and a development of it, the 
MPI Extraversion scale (Eysenck, 1956). 

Two experiments support the hypothesis 
that it is the differences in the excitation- 
inhibition balance, as measured by extra- 
version scales, which make the more significant 
contribution to eyelid conditioning. The MA 
or equivalent scales fail to make a significant 
contribution, or do so to a significantly lesser 
degree (Franks, 1956a, 1957b). When subjects 
are unselected instead of in extreme groups, 
the correlation between extraversion (MPI) 
and conditioning is —.46 (acquisition) and 
—.34 (extinction trials) (Franks, 1957b). 

Two other experiments provide consistent 
but not significant results favoring the 
Eysenck-Franks rather than the Iowa hypoth- 
esis (Das, 1957; Willett, 1960). In the in- 
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vestigation reported below, a further attempt 
is made to assess the relative importance in 
eyelid conditioning of drive and excitation- 
inhibition balance as measured by the MPI 
Neuroticism and  Extraversion scales. In 
addition, the contribution oj "rigidity," a 
source of individual differences not previously 
considered in this context, is investigated. 

Rigidity has been variously described as 
resistance to change, stereotypy, inflexibility, 
extremeness, dogmatism, intolerance, etc. 
Such concepts have been repeatedly criticized 
on empirical grounds and their generality 
questioned (Jenkins &  Lykken, 1957). 
Brengelmann (1960c, 1960d) has followed up 
their suggestive importance with regard to 
learning. Apart from the concepts mentioned, 
achievement motivation (“drive”) has been 
confirmed as one aspect of rigidity by the 
finding that scores on performance output 
(Brengelmann, 19603) and aspiration level 
(Brengelmann, 1959a, 1960a) are correlated 
significantly with rigidity scales. 

According to Brengelmann (1960a), the 
relationship between learning and rigidity 
depends on two effects of this personality 
Syndrome: interference or inhibition, on the 
one hand, and drive or facilitation, on the 
other. The interference aspect has been 
repeatedly confirmed in the form of a negative 
relationship between performance and rigidity 
for perceptual-motor learning, reproduction 
learning, and immediate recall and recognition 
learning (Brengelmann, 1957, 19602). 

The drive effects require the assumption of a 
curvilinear relationship between rigidity and 
task difficulty (novelty, stress), similar to the 
Yerkes-Dodson law recently revived by 
Eysenck (1955) and Broadhurst (1959). 
According to this law, learning efficiency 
increases with task difficulty until a certain 
optimum level is reached and decreases there- 
after. At the extreme levels of difficulty, drive, 
either absolutely or relatively, inhibits per- 
formance. This interpretation of rigidity has 
been confirmed in experiments using normal 
and abnormal subjects with regard to improve- 
ment with practice or output (Brengelmann, 
19602). For the high rigid subjects, improve- 
ment increases with difficulty of task up toa 

point of high difficulty and sharply decreases 
thereafter. For the low rigid subjects, the 
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opposite effect is obtained. Medium levels ç 
rigidity assume an intermediate position. 

Many investigators equate a high rate o 
conditioned responses with good performance 
Conditioning may also be considered as simpl 
learning—learning not to respond to stimul 
irrelevant, distracting, or interfering with 
respect to the task in hand. With respect te 
another more complex task, the same stimuli 
may be relevant. Situations favoring condi 
tioning in the former situation will be detri 
mental in the latter. This view is supported by 
the generalization emanating from Brengel- 
mann’s work that whenever a personality 
variable correlates negatively with learning, 
its correlation with conditioning is found to be 
Positive and vice versa. Accepting the widely 
practiced equation of eyelid conditioning with 
simple learning, rigidity is predicted as corre- 
lating positively and linearly with eyelid 
conditioning, 

This prediction is consistent with the argu- 
ment of Bindra, Paterson, and Strzelecki 
(1955) that the threshold to the blink is 
lowered by muscle tension, which, it has been 
suggested (Brengelmann, 1960b), underlies 
the excessive expressive movement found in 
the highly rigid. 

The prediction may also be thought com- 
patible with the Iowa hypothesis that higher 
drive favors the response with the greatest 
habit strength to the detriment of other 
responses. As applied to complex tasks, 
however, the argument loses consistency (Hill 
1957). Further, its predictions for a negative 
relationship between measures of drive and 
performance in learning are not significantly 
supported in Brengelmann's (19602) experi- 
ments. 


METHOD 


The subjects of the experiment were 75 young 
men aged 25.81 + SD 3.21 years. They were 
unselected samples of consecutive receptions at one 9 
Her Majesty’s prisons, where they were investigat 
at a research laboratory set up with the permission 0 
the prison commissioners. These men had sevi 
previous convictions (6.45 = 2.54), of which five-sixths 
were for property offences, and some previous experi 
ence of prison sentences (22.31 + 1.59 months), Abou 
one-third of each sentence would have been remitted: 
Their intelligence, measured by a timed version of the 
Progressive Matrices (Raven, 1954), is normal in 
mean value and somewhat platykurtic in distribution: 


3 Their occupations, if any, were mostly unskilled. 
Whether such subjects are more or less representative 
fl of the universe conventionally sampled by student 
| volunteers or mental patients will not be discussed here. 
The eyelid conditioning procedure of the Franks 
| experiments was duplicated. The CS was a 1110 cps 
| tone at about 65 db above threshold, established by 
| the method of limits, of 800 msec. duration. After 350 
| | msec. the UCS, a puff of air, lasting 500 msec., at a 
Í pressure of about 65 mm. of mercury at not less than 2 
cms. from the right eye was administered by an elec- 
tronic device. A partial reinforcement schedule was 
T used, giving an overall reinforcement ratio of 60%. The 
| intertrial interval varied between 20-30 seconds. To 
"reduce the validity of the criticism that the individual 
differences in the performance of the blink are present 
before the acquisition trials, any subject blinking to any 
of six CS of nine pre-acquisition trials (3 CS, 3 UCS, 
3 CS in that order) were excluded. This left 42 subjects. 
Franks (19562) only excluded subjects if they blinked 
to either or both of the last two of these CS. The con- 
| ditioning Score was the total number of CRs over 18 
test and 10 extinction trials between 150 and 625 
msec. from the start of the CS. The maximum score 
possible was 28, 
Three sets of four questionnaires each were used to 
assess the contribution of neuroticism, extraversion, and 
íl rigidity to individual differences in conditioning. The 
neuroticism and extraversion scales were administered 
in groups within 14 days of the conditioning. The 
Tigidity scales were administered by post some months 
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later. Retaining those who adequately completed the 
rigidity scales reduced the experimental group to 38 
subjects. When the conditioning records were analyzed, 
it was found that five subjects had provided no measur- 
able conditioned response. It is arguable that these 
subjects should be excluded because the absolute 
failure to condition may be due to lesions in the cns, 
misunderstanding, lack of cooperation, apparatus 
defect, or other extraneous factors. This would leave 33 
subjects in the experimental group. The results are 
reported separately for the groups of 42, 38, and 33 
subjects. 

The four neuroticism questionnaires used were the 
MPI Neuroticism (N) scale, the Cattell second-order 
Anxiety (Anx) factor scale (Cattell, 1957), and the 
MMPI Psychasthenia (P#) and K scales (Hathaway, 
1946; Hathway & McKinley, 1943). High scorers on the 
K scale are expected to score low on neuroticism. These 
scales contained 24, 40, 48, and 30 items, respectively. 
The Anx, Pt, and K scales were selected from a factor 
analysis on the 42 subjects as having the highest load- 
ings on a putative factor of neuroticism. The four 
questionnaires used to measure extrayersion-introversion 
were the MPI Extraversion (E) scale, the second-order 
extraversion factor scale or Contact Personality Factor 
(CPF), Forms A and B summed (Cattell & King, 1955), 
the MMPI Hysteria (Hy) scale (Hathaway & 
McKinley, 1943), and the Activity Preference (AP) 
scale, designed to measure “anxiety” independently of 
neuroticism and shown to differentiate groups differing 
in conditioning performance (Lykken, 1957). Extraverts 


TABLE 1 
MEANS AND STANDARD DEVIATIONS OF PERSONALITY SCALES AND CONDITIONING FOR EXPERIMENTAL AND 
CONTROL GROUPS 
M 2- SD 
f Experimental group Controls 
z Group II 
k (W = 42) (N = 38) QN = 33) Group I guna 
Neuroticism 
MPIN 25.12 + 11.10 | 25.37 +: 11.37 | 25.33 + 12.13 | 14.48 + 9.40 | 20.99 + 10.12 
| Bor 30.07 + 10.56 | 30.65 + 11.07 | 30.90 + 11.52 — 2 z5 Pp 
n. 13.60 3: 7.79 | 14.13 £ 8.33 | 14.61 Æ 8.39 m 9:67 £ 622 
ae 10:83 4 5.22 | 10.92 3 5.38 | 10.94 Æ 5.71 zn 15.04 £ 4. 
TSlon 
MPLE 27.43 4. 8.55 | 27.97 + 8.78 | 27.79 + 9.29, | 24.00 + 7.25 | 29.30 - 8.36 
67.00 + 10.69 | 68.00 + 10.73 | 67.45 + 11.37 — EST E P 
16.31 + 4.97 | 16.82 + 4.78 | 16.24 + 4.54 = 16. 1 Hert 
19.69 + 4.31 | 20.10 + 4.15 | 20.24 + 4.35 = 19.37 + 4. 
a 17.68 + 5.27 | 17.27 + 4.91. | 13.16 + 4.91 — 
ru 16.61 + 5.43 | 16.15 + 5.32 9.44 + 6.26 — 
T 20.68 + 4.54 | 20.67 + 4.35 | 15.28 + 6.80 — 
f 15.79 + 5.88 | 16.00 + 6.16 7.56 + 7.10 —— 
— 4.37 
8.21 4 8.13 + 5.40 | 9.36 + 4.69 6.40 + 
3.69 = 348 3.55 + 3.23 4.09 + 3.13 — 2.57 m 2.65* 
11.62 2: 8.20 | 11.55 + 8.38 | 13.30 + 7.71 ae 


a 
p From Cattell (1957), obtained on 759 normals. 


: E tom Carpenter (1959), obtained on 73 United States servicemen. 
tom Franks (19572), obtained on 55 male students. Their Acquisition score was 7.18 +: 5.12. 
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TABLE 2 
CORRELATIONS OF SCALES WITHIN EACH PERSONALITY PARAMETER AND WITH CONDITIONING 
Fere iy soes Conditioning 

WEES 

(N = 33)| N | An | Pt | K | E | CPF | Hy | AP* |CPIR| NR | IA | Do P 1i 1 

5 E E 
-> ao ** je —04 —09 -11 

dur EY fis ES e pus | 05} 03 |-01 

Pt S682 3t zw [6292 07 | 08 00 

Ke 59** | 62** | 70** | — 02 |-04 |—01 

—| 62**| 21 | 40* —08 |-17 |-17 

GEF 64" — 00 | 43** —13 |-15 |-09 

Hy 34*|-04 | — | 14 01 | 04 | i 

AP* 43* |} 42*) 11) — —06 | 06 12 

— |64**| 19| 29) — 15 33 

NR 60**| — 19 | 29 — 09 27 
IA ZOU 27: —| 33%) — 06 09 1 
Do 40* | 34* | 44*| — | — | da** 46" 


* Sign reversed. 
* Significant at beyond the 5% level. 
** Significant at beyond the 1% level. 


are expected to score low on this scale. These scales 
contained 24, 34, and 34, 60, and 32 items, respectively. 
The CPF, Hy, and AP scales were selected from the 
factor analysis mentioned as having the highest loadings 
on a putative factor of extraversion. None of the MMPI 
scales were T corrected. 

Rigidity was measured by four questionnaires used 
in the Brengelmann studies (1959b, 1960c, 1960d). 
These were mainly derived from the California Per- 
sonality Inventory Rigidity (CPIR) Scale (Gough), 
the Rigidity (NR) scale and Intolerance of Ambiguity 
(IA) scale (Nigniewitsky), and the Dogmatism (Do) 
scale (Rokeach). After item analysis by Brengelmann, 
these scales contained 15, 14, 17, and 16 items, respec- 
tively. The four scales have been found to correlate 
significantly and positively in a number of experiments 
(Brengelmann, 1959b; 1960c). In normal samples, 
rigidity does not, as a rule, correlate significantly with 
neuroticism and extraversion. In abnormal samples, 
rigidity and neuroticism are positively and significantly 
related, though to a slightly lesser extent than neuroti- 
cism and introversion. 


Rrsurrs 


Table 1 shows the values on the personality 
scales and on conditioning for the present ex- 
perimental subjects. The data from two other 
groups (males) are included for comparison 
purposes. Group I (V = 25) resembles the 
experimental group in age (27.16 + 4.26) and 
occupational level. Group II (N — 80) was 
composed of industrial apprentices in the age 
range 16-19 years (Willett, 1960). It is appar- 
ent from Table 1 that the experimental group 


was significantly more neurotic and rigid 
(p < .01) than Group I. The means in Group 
II are consistent with this finding. 

"Table 2 shows the relationships between the 
three personality parameters and conditioning: 
for the experimental group. The neuroticism 
scales fail to account for any significant portion 
of the conditioning variance. Although the 
scales intercorrelate significantly and con: 
sistently, only some of the correlations with 
conditioning are in the direction expected on 
the basis of the Iowa hypothesis. The extra 
version scales are not all significantly intet- 
correlated, nor entirely consistently. None 
significantly correlate with conditioning ue 
only some of them correlate in the predio 
direction. Comparing the MPI N and P 
scales, however, E does contribute more ip 
the conditioning variance than N in E 
direction predicted on the basis of the Eysenc 4 
Franks hypothesis. All rigidity scales bk 
correlate positively, but only partly s 
cantly. Similarly, the correlations between t 
rigidity scales and conditioning are all p. 
sistent with the prediction, and dogmat* 
provides a highly significant correlation y 
conditioning, occupying about a fifth of E 
conditioning variance. For the acquisiti? 
trials alone, this correlation is .44 (N = 3 a 

As only one of the four rigidity scal 
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correlates significantly with conditioning, a 
multiple correlation was run between the four 
rigidity scales and conditioning. The multiple 
correlation was .46, significant beyond the 5% 
level (or .55, NV = 33, p < .05). 

Figure 1 presents the conditioning curves 
for acquisition and extinction for two sub- 
groups high and low on rigidity, selected from 
the experimental group of 33 subjects. The 
high rigidity group comprises the 11 subjects 
who obtained the highest total scores summed 
over the four rigidity questionnaires, The low 
rigidity group comprises the 11 subjects who 
obtained the lowest total scores summed over 
the four questionnaires. The broken lines in 
Figure 1 represent the mean number of CRs, 
taken four at a time for each subgroup. 


Discussion 


1 Of „the three personality parameters, 
euroticism, extraversion, and rigidity, only 


| sid contributed a significant amount of 
| iine to individual differences in eyelid 
Conditioning, Tt should be remembered that 


ES Way in which the experiment was carried 
Was biased against these findings. The 


rigidi . 3 Eth 
‘tigidity questionnaires were administered 


wi ES . 
Eos Supervision, when the subjects had 
moved to other surroundings, and when 


E. s E 
Onsiderable time since the rest of the testing 


ha, E 
Rae peed Further, the rigidity scales were 
^ shorter than the neuroticism and extra- 


. Version scales, 


p neither extraversion nor neuroticism 
ures reliably correlate with conditioning, 
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TABLE 3 


COMPARISON OF SAMPLES AND EXPERIMENTS 
TESTING THE EYSENCK HYPOTHESIS 


Condi- 

Experiment Sample | Extrayersion | "Ene 

version 

correlation 

Franks (1956b) | 20 Dysthymics | 20.60 + 6.2 | —.15 
Franks (1956b) | 20 Hysterics | 49.60 + 11.228) —.40 
Franks (1956b) | 20 Normals | 40.45 + 11.63%) —.05 
Franks (1957b) | 55 Students | 26.18 + 8.989 | —.46* 
Das (1957) 63 Students | 25.58 + 9.52] —.08 
Willett (1960) 80 Adolescents | 29.30 +: 8.36°|] —.08 


® Rhathymia scale of Guilford (1940). 
> Extraversion scale of Eysenck (1956). 
* Significant at beyond the 1% level. 


the original repeated findings with the MA 
can no longer be explained by the loading on 
extraversion noted by Franks (1956b). An 
alternative explanation is that the MA has a 
loading on rigidity. Maltzman, Fox, and 
Morrisett (1953) provide experimental support 
for this notion. Brengelmann (1960c) also 
found that rigidity correlated with the MA 
among neurotics. It may be that among 
normals the MA scores are too positively 
skewed for a correlation to appear (Wesley, 
1953). On the other hand, although the 
correlation between extraversion and condi- 
tioning in the present investigation is not 
significantly different from zero, it is also not 
significantly different from the correlations 
found by Franks (1956a, 1957b). The samples 
and experiments testing the Eysenck hypoth- 
esis differ from each other in a number of 
respects, as is shown in Table 3. The somewhat 
unconventional setting of the present experi- 
ment can readily be supposed to influence the 
subject-experimenter interaction in a way that 
is untypical of university laboratories. There 
is, nevertheless, a consistent negative tendency 
in whatever relationship has been found 
between eyelid conditioning and extraversion 
quéstionnaires. 
SUMMARY 

The relative contributions to individual 
differences in eyelid conditioning of neuroticism 
and extraversion were assessed on the basis of 
the Spence-Taylor and the Franks-Eysenck 
hypotheses. In addition, a new parameter, 
rigidity, was predicted as positively related to 
conditioning on the basis of Brengelmann’s 
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previous work. Personality was measured by 
12 questionnaires. The Franks conditioning 
procedure was repeated, except that a more 
severe criterion of pre-acquisition blinking was 
used. Conditioning was measured by the 
number of blinks occurring to 28 tones, of 
which 18 were a part of a partial reinforcement 
schedule, and 10 were consecutive extinction 
trials. Seventy-five unselected normal male 
prison inmates were conditioned; 42 met the 
criterion, and 38 completed all the question- 
naires. Only the rigidity measures provided a 
significant positive correlation with condition- 
ing as predicted. The finding for extraversion 
and neuroticism was consistent with the 
Eysenck-Franks rather than the Spence-Taylor 
hypothesis, but not significantly so. 
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SOCIAL AND PSYCHOLOGICAL VARIABLES IN 
LEARNING HEBREW: 


MOSHE ANISFELD anp WALLACE E. LAMBERT 
McGill University 


HE role of social attitudes in the learning 

of foreign languages has caught the 

attention of social scientists as well as 
language teachers (Nida, 1957-58; Vachon- 
Spilka, 1959; Whyte & Holmberg, 1956). 
Gardner and Lambert (1959) and Gardner 
(1960) have carried out empirical studies of 
this aspect of second language learning. The 
available findings indicate that the attitude of 
a student of a second language towards the 
other language community plays an important 
role in his acquisition of the other language. 
Students who hold sympathetic attitudes 
toward the other language community are 
found to be more successful in acquiring a 
second language than those holding unfavor- 
able attitudes, 

Several theoretical positions have been 
presented to account for these findings (Anis- 
feld, 1960; Ervin, 1954; Gardner, 1960). Our 
own theoretical scheme, presented in detail 
elsewhere (Anisfeld, 1960), emphasizes the 
generalization of attitudes developed toward a 
linguistic group to the language they speak 
(and vice versa). From this perspective, a 
language has potential positive or negative 
reinforcing value. 

Since Hebrew is associated exclusively with 
the Jewish people, unlike many other languages 
that are spoken by various cultural groups, the 
generalization hypothesis seemed particularly 
appropriate for this language. The specific 
hypothesis was posed that the relation between 
attitudes toward the other language group and 
language achievement, found to hold for 
other languages, should be particularly promi- 
nent in the case of Jewish children learning 
Hebrew, in which their own language is 


1 This research was supported by the Canadian 
Defence Research Board, Grant Number D77-94-01-10. 

We are grateful to S. Cytrynbaum for his assistance 
in the administration and scoring of the tests, to D. 
Solomon (McGill) and L. Rosenberg, research director 
of the Canadian Jewish Congress, for their critical 
readings of a preliminary draft relating to the charac- 
terization of the various Jewish congregations in 
Montreal. 


learned as a second language. A favorable 
attitude towards belonging to the Jewish 
community should facilitate the childrens 
acquisition of Hebrew. On the other hand, ai 
attitude of marginality vis-à-vis the Jewish 
group should interfere with achievement in 
Hebrew. 

Another related prediction follows from 
Gardner and Lambert's (1959) distinction 
between "integrative" and “instrumental! 
orientations towards the study of a foreign 
language. In their study, students’ purpos 
for studying a foreign language were classified 
as integrative when the aim was to “lea 
more about the language group, or to meet 
more and different people,” and as instrumentil 
when the reasons reflected the “more utili 
tarian value of linguistic achievement" such) 
as getting a better position, becoming bettet) 
educated. In this study, we attempt to test]. 
in the case of Hebrew, the findings of Gardnel 
and Lambert (1959) and Gardner (1960) tha! 
integratively oriented students are bettet 
learners of a second language than are instru- 
mentally oriented students. 


results similar to those found for other lan 
guages in similar training situations. 


METHOD 


Achievement Measures 


Scores on the following tests were obtained: d 

1. Teachers’ ratings. The Hebrew instructor of e% 
class was asked to assign to his students the ust 
school marks for achievement in Hebrew with as a 
distribution as possible. In some cases, teachers Wel 
approached a second time and asked to rank ost 
students who were given the same mark. Since oie 
teachers tended to stress reading ability and um " 
standing of written language, we asked them ur 
special attention when assigning ratings to e 
students’ ability to speak and comprehend Hebr 
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Jn one class (Class A) which was at a sufficiently ad- 
vanced level to enable the teacher to make a distinction 
between her students' comprehension of Hebrew and 
their oral skills in the language, she was asked to rate 
them separately on these two aspects of performance. 

2-4. Voice recordings. For one school 2-minute tape 
recordings of the subjects' readings of a difficult Hebrew 
passage were obtained. These recordings were eval- 
uated by two Hebrew experts on each of the following 
three aspects: Reading Fluency (2), Pronunciation 
Accuracy (3), Purity of Accent (4). 

The score on each skill was taken as the combined 
"oa of both judges. The interjudge agreement was 

lo. 


Ability Measures 


|... Verbal Reasoning test. This test, taken from the 
Differential Aptitude Tests (Bennett, Seashore, & 
Wesmen, 1959), was used as a measure of intelligence. 
Because of time limitations the time allowance for this 
test was shortened from 30 to 15 minutes. 

6. IQ. Estimates of IQ were available for one class 
(Class A) only, These estimates were based on the 
Otis Self-Administering Tests of Mental Ability which 
had been administered by the Jewish Vocational 
Guidance Centre in 1958. 

7. Psi-Lambda Foreign Language Aptitude Battery, 
Short Form (Carroll & Sapon, 1956). The tests used 
were: Spelling Clues, Words in Sentences (shortened 
from 15 to 10 minutes), and Paired Associates. 


Attitude Measures 


8. Anti-Semitism (A-S) scale. This scale is composed 
of seven items, of which six were selected from the 
CNN A-S scale (Adorno, Frenkel-Brunswik, 
vinson, & Sanford, 1950) on the basis of their fitness 
Dr Jewish subjects. Minor changes in the wording of 
d items were introduced to make them appropriate 
or Canadian Jewish students. We employed this scale 
5 a measure of Jewish “self-hatred.” 

.9. Orientation Index. On the basis of preliminary 
NO os with Jewish educators and analysis of 
Ompositions written by Jewish pupils in Montreal on 
Opics related to study of Hebrew and other languages, 
Er reasons for learning Hebrew were com- 
nents Nine judges were asked to classify these state- 
2 ihe m either integrative or instrumental according 
ad lefinitions of Gardner and Lambert (1959). The 
Ee ion of inclusion in the scale was agreement of at 
ix DM judges on the categorization of an item. 
M. AR NEC integrative and three instrumental, 

; ils criterion and were finally used to make up the 

entation Index? For each of these statements the 


* The integrative reasons were: 

2. Hebrew is necessary to keep the Jewish 

People together. 
- I want to become more a part of the Jewish 

culture. 

5. In order to be a good Jew I have to know 
Hebrew. 

e Instrumental reasons were: 

1. One can enjoy a more successful life in the 
community with a knowledge of Hebrew. 
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subject was requested to state, along a three-point 
scale, the degree of importance the particular reason 
had for him. 

The integrative and instrumental items were scored 
separately, and then the instrumental score was sub- 
tracted from the integrative score to obtain an index 
of the balance between the two orientations for each 
individual. Thus the Orientation Index measures how 
much more (or less) intense the student’s integrative 
motive is than his instrumental motive for studying 
Hebrew. 


Subjects 

Although 11 classes in seven Jewish parochial 
schools in Montreal were contacted, for reasons de- 
scribed elsewhere (Anisfeld, 1960), only 7 classes were 
used for the final analysis. The main reason for not 
analyzing all classes was that the samples were so 
small. In this paper we report the results of 6 classes 
from three schools in different settings in Montreal. 
(Because of doubts regarding the validity of the achieve- 
ment measures of the seventh class, it is not included 
in the present discussion.) The schools as well as the 
classes so obviously differed in level of training, ex- 
perience required, and regularity of attendance, in 
addition to having different instructors, that it was 
appropriate to treat each class as a separate sample 
(Guilford, 1950, pp. 352-354). A description of each 
class is therefore in order. 

Class A (N = 16) is a Grade 9 class in an all-day 
Hebrew school. All children have attended the ele- 
mentary branch of this school, so that when tested 
they were in their ninth year of experience in Hebrew 
studies. The children study the Hebrew language for 
5 hours weekly in addition to their contact with the 
language in other Jewish subjects, such as Bible and 
Talmud study, and reading of prayers. 

Class B (N = 11) is a Grade 8 class in another 
school. All students except for one have already gone 
through the all-day elementary branch of the school, 
and they now attend only Sunday and afternoon 
classes. They receive 3.5 hours of instruction in Hebrew 
subjects and 2.5 in Yiddish subjects during the week. 

We also tested four classes in School C (V = 54) 
which is the largest and wealthiest Jewish afternoon 
school in Montreal. We will refer to these classes as 
Class C (8a), Class C (8b), Class C (9a), and Class C 
(9b), with the number denoting grade, a and b standing 
for the “better” and the “poorer” classes of students, 
respectively. In this school the children receive 4 
hours of instruction in Jewish subjects during the 
week. Most pupils have been attending this afternoon 
school since Grade 1. 

The students were divided into better and poorer 
classes when they completed Grade 2. We wanted to 
understand on what basis this classification was made. 
The classes were therefore compared on the following 
variables: Verbal Reasoning (Variable 5), and Anti- 
Semitism (Variable 8). The ¢ tests indicate that only 


3. In the future it may prove useful in getting a 
job. de 

4. 1 wish to enter a profession in which it is 
necessary to be familiar with Hebrew. 


526 


the differences in language aptitude are significant. 
The ¢ values for the differences between the a and 6 
classes on each grade level are significant beyond the 
.02 level (on two-tailed tests). It appears reasonable to 
conclude that the factor underlying the differences in 
achievement between the a and b classes is language 
aptitude, 


RESULTS AND DISCUSSION 


In order to test our hypotheses relating 
both ability and attitude to achievement in 
Hebrew, the correlations of these two classes 
of prediction variables with achievement 
measures were computed for each of our six 
subsamples. 

The heterogeneity of the classes in our 
study not only called for statistical treatment 
of each group separately but also prevented us 
from averaging the coefficients to obtain a 
single estimate of the population correlation. 
Since some of the correlations of the prediction 
variables with achievement measures differed 
significantly from class to class, we found it 
unjustified to assume that “the several r’s did 
arise by random sampling from the same popu- 
lation,” an assumption necessary to allow the 
averaging of coefficients (Guilford, 1950, p. 
355). Furthermore, there is also other evidence, 
to be presented later, that suggests that the 
Schools investigated represent samples from 
different populations. 


Ability and Achievement 


Table 1 presents the correlations of our two 
indices of ability (intelligence and linguistic 
aptitude) with achievement measures, Linguis- 
tic aptitude as measured by the Psi-Lambda 
battery correlates quite highly with achieve- 
ment measures for all classes. We notice in the 
Class A rubrics in Table 1 that the correlations 
between Foreign Language Aptitude and 
Purity of Accent and Oral Skill do not reach 
significance. This appears to be in line with the 
findings of Gardner (1960) who reports in- 
significant correlations between each of the 
subtests of the Short Form Psi-Lambda bat- 
tery (the same that was used in the present 
study) and accent characteristics, However, 
the Short Form does not include the Phonetic 
Script test which measures the "ability to 
‘code’ auditory phonetic material in such a way 
that this material can be recognized, identi- 
fied, and remembered over something longer 
than a few seconds. ...[an ability which is 


MOSHE ANISFELD AND WALLACE E. LAMBERT 


TABLE 1 


CoRRELATIONS OF ABILITY VARIABLES 
WITH ACHIEVEMENT MEASURES 


Ability measures 
Achievement measures e Verbal 
Lambda | Reason- | 10d 
Class A (N = 16) 
Fluency .599** 079 2M 
Pronunciation Accuracy 613°" +098 164 
Purity of Accent +420 +093 +068 
Comprehension (Teacher’s .667*** .492 546" 
ratings) 
Oral Skill (Teacher’s ratings) | .432 377 «321 
Class B (N = 11) 
"Teacher's ratings B17*** .663* 
Class C (8a) (N = 13) 
"Teacher's ratings 532 409 
Class C (8b) (N = 11) 
Teacher’s ratings .621* .T00*** 
Class C (9a) (N = 14) 
"Teacher's ratings 417 —.245 
Class C (9b) (N = 11) 
Teacher's ratings .647* Tute 


* Significant at .05 level; two-tailed test. 
** Significant at .02 level; two-tailed test. 
*** Significant at .01 level; two-tailed test. 


important not only in] remembering phonetit 
material (words, forms, etc.) but also in mim 
icking speech sounds” (Carroll, 1960, pp 
46-47). The auditory coding ability and thé 
ability to mimic are obviously necessary for 
speech production; it therefore seems that 
Gardner's conclusion is not justified. As 4 
matter of fact, in Gardner’s (1960) study thé 
correlation between the Spelling Clues testi 
which gauges the ability to produce sounds, 
and accent comes very close to significance, 
do also the correlations of the total Short For 
with Accent and with Oral Skill in the present 
study (Table 1). 

Turning to intelligence, we observe in Table 
1 that for Class A (which is at a sufficiently 2% 
vanced level of proficiency in Hebrew to make 
possible a distinction among several aspects 4 
linguistic skill) both Verbal Reasoning 2% 
IQ show differences in their correlations am 
various aspects of performance in Hog 
Both indices of intelligence correlate big y 
with Comprehension of Hebrew and with J. 
Skill but not with Fluency, Pronunciation Ac 
curacy, and Purity of Accent. This finding H 
pears to support the view (Dunkel, 1948) tha 
intelligence plays a differential role in M 
language domains, with success in the m° 


r 
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. cognitive tasks, represented by Comprehen- 


sion, being more dependent on intelligence 
than success in tasks involving mainly oral 
production. The considerable magnitude of the 
correlations of Verbal Reasoning and of IQ 
with the Oral Skill ratings is probably due to 
the fact that these ratings were assigned by 


- the teacher who, very likely, took into account 


to some extent the cognitive aspects of oral 
proficiency, such as the ideas a student ex- 
presses and the way he presents them. 


Altitudes and Achievement 


Table 2 summarizes the relations between 
each of the attitude variables and the measures 
of achievement. The correlations of the Anti- 
Semitism (A-S) scale and of the Orientation 
Index with achievement measures seem to be 
dependent upon the particular classes ex- 
amined. For Classes A and B, the A-S scale 
1s a reliable predictor of achievement in 
Hebrew whereas the Orientation Index is not; 
the reverse is generally true for the C classes. A 
possible explanation for this patterning is sug- 
gested by a consideration of the differential 
Social characteristics of our samples. Classes A 
and B are located in and draw their students 
from a dwelling district in Montreal (Outre- 
Mont and vicinity) which is lower in the socio- 
economic class hierarchy than School C (lo- 
cated in the Westmount suburb)? 

Briefly, the parents of our students in the 
Outremont district are first generation North 
oetans; they still bear revealing traces of 

eir old-fashioned Jewish background which 
they apparently dislike and from which they 
aye to free themselves. The Outremont 
E are or have been day students for 7 
Heb years, and they are advanced enough in 

E rew that both the teachers’ ratings and the 
objective tests of achievement can be re- 
Eu as reliable discriminators of their ability 
Ed. anguage. Consequently, our finding of a 
D. “ed between attitudes toward the 
thould TD (A-S) and proficiency in Hebrew 
h € taken as encouraging support of our 

Ypothesis, On the other hand, the failure to 
zu the same relation for three of the West- 

Unt classes is not necessarily conflicting evi- 


3 A 
E following characterization of the Jewish 
on a ae y these districts is based to a great extent 
Seidel (193 logical study of Montreal Jewish areas by 
939) and draws on the theory of Tryon (1955). 
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dence. Because the Westmount students are 
at a very elementary level of Hebrew and the 
individual differences between the students in 
achievement are slight, it seems likely that the 
teachers, when asked to rank their students in 
Hebrew skill, resort to considerations of other 
classroom behaviors which actually differen- 
tiate these students. The Anti-Semitism 
variable does not seem to be one of these be- 
haviors. It is difficult to imagine why West- 
mount Jews should be anti-Semitic. They are 
already second-generation North Americans, 
have adapted to the American culture to a 
considerable extent, and are economically and 
socially secure. The dimension of own-group 
devaluation is comparatively meaningless for 
Westmount Jews. It is our belief that they are 
relatively unconcerned with feelings of self- 
devaluation. It follows that the variation ob- 
tained on the A-S scale for the C classes may 
well be chance variance,‘ and the scale should 
not be regarded as measuring for the West- 
mount students a latent variable around which 
psychological processes are organized—a con- 
ception of attitudes presented by Green (1954). 

The Orientation Index, measuring a desire 
tointegrate inthe Jewish culture, also seems to 
manifest differential sensitivity from class to 
class. The Outremont children apparently take 
their Jewishness as a matter of obvious fact 
and are not concerned with problems of inte- 
gration in the Jewish culture. For them the 
variance on the Orientation Index is pre- 
sumably chance variance and consequently 
this index does not predict achievement in 
Hebrew. In contrast, the problem of under- 
standing what being Jewish means, of estab- 
lishing a cultural identity, is of major concern 
to our Westmount students and their parents. 
It would not be surprising, therefore, if the 
Westmount teachers in evaluating a student’s 
standing in the Hebrew language, a major sub- 
ject in the Hebrew school, took into account 
the degree a student exhibits a willingness to 
integrate in the Jewish culture through the 
Hebrew school. Furthermore, to the extent 
that the teachers’ ratings do reflect true in- 
dividual differences in Hebrew achievement, the 
integration dimension would be a salient one 

4The two districts do not differ in mean scores 
and variances on either the A-S scale or the Orientation 
Index. 
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TABLE 2 


CORRELATION OF ATTITUDE VARIABLES 
WITH ACHIEVEMENT MEASURES 


Attitude measures 
Achievement measures 3 
gam a [Orientation 

Class A (N = 16) 

Fluency —.504* | —.329 

Pronunciation Accuracy — .488 —.143 

Purity of Accent — .438 — .078 

Comprehension (Teacher’s | —.589** +203 

ratings) 
Oral Skill (Teacher’s —.495* | —.054 
ratings) 

Class B (N — 9) 

"Teacher's ratings —.682* | —.405 
Class C (8a) (N — 15) 

"Teacher's ratings —.576* | —.612** 
Class C (85) (N — 11) 

"Teacher's ratings — .084 .632* 
Class C (9a) (N — 15) 

"Teacher's ratings —.160 —.514* 
Class C (9b) (N = 11) 

"Teacher's ratings —.075 —.371 


* Correlation is positive if “integrative” orientation is related 
to achievement; correlation is negative if “instrumental” orienta- 
tion is related to achievement. 

* Significant at .05 level; two-tailed test. 

** Significant at .02 level; two-tailed test 


for the Westmount students and therefore 
contribute to differences in success in Hebrew. 
These two explanations may account for the 
correlations of the Orientation Index found in 
the Westmount school. 
Two of the three significant correlations of 
the Orientation Index with teachers’ ratings 
are negative. We predicted a positive correla- 
tion between the Orientation Index and 
achievement, indicating that the integrative 
motive—which expresses a desire to become 
better integrated in the Jewish culture—is 
more conducive to success in Hebrew than the 
instrumental motive. This discrepancy sug- 
gests a reconsideration of the integrative-in- 
strumental distinction as defined by our index, 
It seems that in the case of Hebrew the instru- 
mental items really indicate a greater desire to 
integrate in the Jewish culture than the in- 
tegrative items, Wanting to get a job requiring 
knowledge of Hebrew (an instrumental reason 
for studying Hebrew) actually means be- 
coming a rabbi, a Hebrew teacher, or engaging 
in other Jewish professions which involve more 
participation in the Jewish culture and com- 
munity than the more idealistic and vague 
purposes of becoming “more a part of the 
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Jewish culture" (an integrative reason). The. 
situation is different for English-speaking 
children in Catholic schools in bilingual Mont- 
real where Gardner and Lambert (1959) and 
Gardner (1960) found that the integrative 
orientation led to superior achievement than 
did the instrumental orientation. In Montreal 
there are positions requiring knowledge of 
French which are clearly instrumental and do 
not imply any integrative feelings towards the 
French-Canadians. Our results may suggest 
that the reasons for studying a language, 
whether instrumental or integrative are mot 
important in themselves: a particular orienta- 
tion influences language acquisition only to the 
extent that it is indicative of attitudes towards 
the language community. 


Interrelations among Prediction Variables 
"This discussion suggests that for a student 
to do well in Hebrew, and probably in other 
languages as well, he needs to possess intelli- 
gence, language aptitude, and a proper at- 
titude toward the language community. In 
spection of the intercorrelations among these 
variables indicates that some of the correla- 
tions between the prediction variables and the 
achievement measures may be artificially high. 
The correlations of Psi-Lambda with Verbal 
Reasoning for the six classes under our con- 
sideration range from .47 to .83 with only one 
exception of a correlation of .19. (Similar re 
sults with a different intelligence test are doct 
mented by Gardner & Lambert, 1959.) Par- 
tialing out the intercorrelation from either of 
these variables would certainly reduce their 
correlations with achievement measures. 
Though much lower in magnitude, there is also 
some relation between A-S and Psi-Lambda. 
The correlations for Classes A, B, C (82), 
C (8b), C (9a), and C (95) are —.27, — 48, 09 
—.84, .01, and .01, respectively. A partial 
correlation between Psi-Lambda and achieve 
ment measures and between A-S and the same 
measures would lower some of the correlations 
reported in this paper. However, the preset! 
study has demonstrated the relevance of att 
tudes to the acquisition of a new language, bu 
because of the smallness of samples and 1 | 
equality of Ns for different variables it is 1 
possible to determine more precisely the 10% 
of each variable through factor analysis ” 
partial correlations. 5 
The relation between Psi-Lambda and A- 
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warrants interest apart from its possible in- 
fluence on the correlations of these variables 
with achievement. There appear to be at least 
two plausible cause-effect interpretations of 
this finding. The causal relation may rest be- 
tween language aptitude and anti-Semitism. 
The ability of a pupil to master the Hebrew 
language is a prerequisite to whatever success 
he may have in Hebrew school. The Hebrew 
school plays a central role in the Jewish com- 
munity, and whatever attitudes a student de- 
velops toward the Hebrew school hold for the 
Jewish community at large. The student with 
language aptitude would find it easier to learn 
Hebrew, and consequently do well in Hebrew 
school, be rewarded for his achievement and 
come to attach positive value to the Hebrew 
school and the Jewish community—an indirect 
Source of reinforcement. His favorable atti- 
tude to the Jewish cultural group could express 
itself as a low score on the A-S scale. In com- 
Parison, students who lack language aptitude 
would find it more difficult to learn Hebrew; 
Possibly they associate the Hebrew language 
and school with frustrating experiences which, 
when generalized as an attitude to the Jewish 
Community, would yield high scores on the A-S 
Scale. Such an explanation is consonant with 
our generalization hypothesis, suggesting that 
the generalization process works not only from 
a community to its language but also from a 
language to the language community. An al- 
ternative interpretation is that the negative 
Correlations between anti-Semitism and Psi- 
Lambda are due to the influence of some com- 
mon factor. Since the F Scale (which is highly 
related to A-S) does not correlate with the Psi- 
Lambda tests (Gardner, 1960; Gardner & 
Lambert, 1959), it is difficult to speculate 
about the nature of such a common factor. 


SUMMARY 


4 A study was conducted with Jewish eighth 
hd ninth grade students learning Hebrew. 

Sasures of achievement in the Hebrew lan- 
guage were correlated with scores on two 
Groups of tests: attitude tests which measure 
E Teasons for learning Hebrew and their 
ae udes toward the Jewish culture and com- 
toD» and ability measures including in- 
a ae and language aptitude tests. The 
amm RE our predictions about the rela- 
E of both ability and attitude to success in 

ming Hebrew. However, whereas intelli- 
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gence and linguistic aptitude are relatively 
stable predictors of success, the attitude meas- 
ures are less stable and vary in their relation- 
ship to achievement from school to school and 
from class to class. We attempted to explain 
these differences through considerations of the 
sociopsychological characteristics of the Jews 
in the various districts of Montreal where the 
schools are located. 
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THE EXTINCTION OF FEAR AS A FUNCTION OF DISTANCE - 
VERSUS DISSIMILARITY FROM THE ORIGINAL 
CONFLICT SITUATION: 


THOMAS ELDER, CHARLES D. NOBLIN 


Louisiana State University 


AND BRENDAN A. MAHER? 


Harvard University 


URRAY and Berkun (1955) have com- 
bined Miller's (1944, 1948) conflict 
and displacement models to form a 
three-dimensional scheme that generated pre- 
dictions concerning the simultaneous opera- 
tions of conflict and displacement. Conflict is 
derived from a set of assumptions relating re- 
sponse strength to the distance of the subject 
from the goal (Miller, 1944); displacement is 
based upon a similar set of assumptions con- 
cerning the difference in similarity between an 
original and other stimulus objects (Miller, 
1944). Murray and Berkun diagrammatically 
represent this three-dimensional system by two 
horizontal axes at right angles to each other, 
representing distance and similarity scales, 
respectively. A third axis extends vertically 
from the point of intersection of the two hori- 
zontal axes and represents a scale of response 
strength. By placing two points on each of the 
three axes, two planes are obtained: one repre- 
senting approach and one representing avoid- 
ance tendency, from which are derived specific 
predictions of behavior. 

Applying the model to the behavior of rats, 
they deduced that subjects trained to ap- 
proach-avoidance conflict in an original alley 
would, whenever possible, leave that alley to 
enter a different one and that the more dis- 
similar the new alley the nearer subjects would 
approach the goal. As a test of these deduc- 
tions, they first trained rats to run an alley 
for food and on later trials established conflict 
by administering shock while the subjects 
were eating. These subjects were then placed 
at the starting end of the original alley and 


1 This study was supported by funds from Grants 
G 3852/6433 from the National Science Foundation 
to Brendan A. Maher. 

It was read in abbreviated form at the annual 
meeting of the Southeastern Psychological Association, 
Atlanta, Georgia, 1960. 

2 In the Department of Social Relations at Harvard 
University. 
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could either remain in this alley or could leave 
and enter new and different alleys sica 
and parallel to the original. Windows were 
spaced at equal intervals along the common 
walls of adjacent alleys to permit crossing 
from one to another. Tracings of the move 
ments of the rats through the maze follow 
the pattern predicted; i.e., the subjects ran 
farther down the most dissimilar alley than 
down the moderately dissimilar alley, and 
farther down the moderate alley than down 
the original. They also reported that goal re 
Sponses occurred first in the most dissimilar 
alley, second in the intermediate, and finally 
in the original alley. Third, they observed that 
subjects entered the most dissimilar alley! 
through windows farther down the intet 
mediate alley than the windows used whél 
crossing from the original to the intermediate 
alley. These results were interpreted as coni 
firmation of deductions from, and support fo; 
the three-dimensional model insofar as it pit 
dicts the locus of "displacement" to be a f 
tion of the similarity between the original an 
the test situations. y 
A recent study (Maher, Elder, & Noblis 
1962) in which it was necessary to replicaté 
the Murray and Berkun (1955) experiment has 
confirmed all but one of these findings. D 
this it was found that rats tend to enter mort 
dissimilar alleys through the windows nene 
the start end of the runways rather than WI? 
dows spaced farther down the alleys. 
finding was combined with the observatio 
that Murray and Berkun always located : 
most dissimilar alley farthest away from ds 
original, thus confounding distance and à à 
similarity and making it impossible to dec! 
if the subjects are entering new and di x 
alleys as a response to distance cues OF eo 
response to similarity cues. d 
Tf distance is the critical factor, then by d 
leting the principle of generalization from 
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three-dimensional formulation, and extending 
the assumptions about distance of the subject 
from the goal to include the distance of other 
alleys from the original, a simplified model 
evolves from which the same prediction (i.e., 
the “therapeutic effect”) can be generated. 

The purpose of the present study was to de- 
vise a situation in which the spatial distance of 
dissimilar alleys and the dissimilarity of distant 
alleys from the original could be systematically 
altered, thereby subjecting to empirical tests 
two alternate deductions: Subjects trained to 
approach-avoidance conflict should either enter 
and run farther down the most dissimilar alley 
as predicted by the three-dimensional model; 
or as the simpler formulation predicts, they 
should enter and run down the alley most dis- 
tant from the original. 


METHOD 
Apparatus 


The apparatus (see Figure 1) consisted of four alleys 
each of which was 48 inches long, 5 inches wide, 6 
inches deep, white, grey black, and white, respec- 
tively. The alleys and walls of the alleys contained 
four 2.5 inch square windows equally spaced along the 
walls of the alleys, each of separate construction so the 
parallel arrangement could be varied. A food cup 1.5 
inches in diameter was attached to an end wall of each 
alley with the rim of the cup 3 inches from the floor. 
The floors of the alleys were covered with .25 inch 
hardware cloth. Hooks were placed on the outside of 
the end walls and a spring extended from the hooks 
on each end of the first alley to the hooks on the ends 
of the fourth alley to hold the four runways securely in 
Parallel. A removable cover was constructed of .5 inch 
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hardware cloth and placed on top of the four-alley 
combination. 


Subjects 


The subjects were 24 male, albino rats of the 
Sprague-Dawley strain. These were randomly assigned 
to six groups of four and placed on a 24 hour food 
deprivation schedule 7 days prior to runway training. 
All subjects were fed 8 grams of Purina dog chow cubes 
daily, and a plentiful supply of water was available at 
all times. 


Procedure 


On the first day of training each of the 24 subjects 
received 25 successive reinforced approach trials in a 
white alley, hereafter designated as the original alley. 
A windowless wall was inserted between the original 
and adjacent alley to prevent the subject from entering 
other alleys during training trials, This procedure was 
repeated on the second day. On the third day subjects 
each received 10 warm-up trials, after which they re- 
ceived shock at the goal end of the original alley. The 
.75 milliampere shock was administered when the 
subject made a circuit between the metal food cup and 
the hardware-cloth floor. Shock trials were continued 
until the subject no longer ran beyond a point 18 inches 
from the start end of the runway. 

Testing trials commenced immediately following 
the shock trials. The windowless wall between the 
original and the adjacent alley was removed and a 
wall with windows inserted in the maze. 

Since the original alley was the same for all subjects, 
only the parallel position of the others was altered. 
This permitted six possible parallel arrangements of the 
four alleys, with one group of subjects assigned to each 
arrangement, Thus each group was tested under a 
different color/position combination. 

A test trial consisted of placing the subject at the 
start of the original alley, the experimenter tracing 
its path through the maze. A trial terminated with the 


l 


Fic. 1. Parallel alleys for testing displacement. 
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subject’s making the goal response in any one of the 
Tunways or until a period of 2 minutes had elapsed. A 
goal response was defined by the subject’s reaching up 
and placing its nose down into the food cup. Each 
subject received a maximum of 4 test trials on the first 
day and 10 test trials on each day thereafter until a 
criterion of two successive goal responses in the original 
alley had been reached. 


RESULTS 


A mean distance score was computed for 
each subject by summing the number of 
eighth’s of the distance traveled down the 
alley on each trial, and dividing by the total 
number of trials in which the subject entered 
that particular alley. Such scores were com- 
puted for each of the 24 subjects, the mean re- 
sult for the entire sample being given in Figure 
2. The broken vertical line in the figure identi- 
fies scores obtained in the original from scores 
in the interchangeable alleys. 

In order to test the effects of distance and/or 
color, a mixed analysis of variance on distance 
scores in the interchangeable alleys was used. 
As the summary in Table 1 shows, the distance 
variable produced the only significant effect, 
Variance due to groups (i.e., between-subjects, 
Distance X Color interaction) approached but 
did not reach the minimum significance level, 
and color appears to have had little or no effect, 
An unexpected result was that trials in the 
first half did not differ significantly from trials 
in the second half. Figure 3 shows that the 
only difference observed between early and 
later trials was in the original alley (p < .001 
by a one-tailed sign test), The range of trials 
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Fic. 2, Mean distances traveled down each alley. 
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TABLE 1 
SUMMARY OF ANALYSIS OF VARIANCE ON 
Supyects’ MEAN Distance TRAVELED 
Down Atteys 2, 3, AND 4 


Source df SS MS Fu 

Between-subjects 11 | 145.04 
Groups (G) 2 47.61 | 23.80 | 2.20 
Error (b) 9 | 97.43 | 10.82 
Within-subjects 60 | 419.23 
Distance (D) 2 96.23 | 48.11 | 9.56* 
Color (C) 2 6.31 | 3.16 | .68 
Trial blocks (T) 1 2.51] 2.51] .50 
DXT 2 5.90 | 2.95 | .59 
CT 2 11.74 | 5.87 | 1.17 
GXT 2 9.78 | 4.89 | .97 
DXCwXT 2 2.84) 1.42} .28 
Error tw) 47 | 236.31 | 5.03 

"Total 71 | 564.27 

foe Ole 


| 
required to reach the criterion of two succes 
sive goal responses in the original alley was 
from 6 to 21 with a mean of 10.2 
To test Murray and Berkun's (1955) pre- 
diction that subjects should tend to use wit 
dows increasingly farther down toward the 
goal of increasingly dissimilar alleys, exit 
scores for the alleys along a distance as well as 
a similarity dimension were derived in the 
manner described by Murray and Berkun. The 
means are given in Figure 4. A one-tailed 
sign test between scores of Alleys 1 and 2, ? 
and 3, and 3 and 4 for both distance and colot 
failed to yield a significant difference. 


DISTANCE (EARLY TRIALS) 
DISTANCE (LATER TRIALS) 
COLOR (EARLY TRIALS) 
COLOR (LATER TRIALS) 


DISTANCE DOWN ALLEY IN 1/8s 


on 


ORIGINAL 
(WHITE) 


NEAR 


“AR 
INTERMEDIATE Ly) 
(WHITE) (GRAY) [rx 


ALLEY 


Fic. 3. Mean distance scores on early trials co™ 
pared with later trials, 


DISTANCE VERSUS DISSIMILARITY IN THE EXTINCTION OF FEAR 


DISTANCE. 


o—o 
COLOR e——5 


INTERMEDIATE FAR 
(BLACK) 


ORIGINAL 
(WHITE) 


NEAR 
(WHITE) (GRAY) 


ALLEY 
Fic. 4. Mean exit scores for each alley. 


Discussion 


The hypothesis derived from the approach- 
avoidance-displacement model that subjects 
should run farther down more dissimilar 
alleys is rejected. The alternative hypothesis 
that rats trained to approach-avoidance con- 
flict run farther down adjacent alleys as func- 
tion of the spatial distance of the other alleys 
from the original is supported. However, the 
failure of the gradient to shift toward the goal 
ends of the neighboring alleys, and the 
dramatic forward shift of the point of inter- 
Secting approach-avoidance gradients in the 
Original alley raises some doubt as to the 
nature of the therapeutic effect produced by 
experience in the other alleys. The tendency 
for subjects to leave the original alley and 
enter some alleys may, perhaps, be most parsi- 
moniously viewed as escape behavior, rather 
than as a response to conflict generalized to 
other situations, 

_A possibility exists, however, that the dis- 
similar alleys did have a therapeutic effect, as 
4 function not of degree of dissimilarity but of 
dissimilarity per se. While the behavior of sub- 
Jects in the dissimilar-white alley differed from 


533 


that observed in the original-white alley, 
thereby seeming to cast doubt on this hypothe- 
sis, these two alleys, similar in color, may have 
been dissimilar in some uncontrolled respect 
such as odor. 

At this point, the conclusion drawn from 
this study is restricted to the inference that the 
three-dimensional model of Miller and others 
is open to question insofar as it utilizes a dis- 
similarity dimension to account for displace- 
ment. 


SUMMARY 


The influence of distance was compared with 
that of dissimilarity in producing displace- 
ment behavior in an approach-avoidance con- 
flict. Twenty-four white rats were used in a 
design that permitted separation of the effect 
of these two variables. Increasing reappear- 
ance of the approach response was significantly 
related to distance but not to dissimilarity. 
Tentative explanations are offered in terms of 
escape behavior, or of uncontrolled similarity 
cues such as odor. The Miller model of ap- 
proach-avoidance-displacement is not sup- 
ported by these data, 
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INCUBATION EFFECT: 


ROLE OF AWARENESS IN AN IMMEDIATE VERSUS DELAYED TEST OF i 
CONDITIONED EMOTIONALITY! 


SANFORD GOLIN? l 
State University of Iowa 


ARIOUS investigators (Bindra & 
Cameron, 1953; Diven, 1937; Hag- 
gard, 1943; Mednick, 1957) have 

reported that a delay between the acquisition 
of a conditioned emotional response and the 
subsequent presentation of the CS results in 
an increase in the strength of the conditioned 
Tesponse. It has been suggested that this 
increase in emotionality after a delay is 
related to the clinical phenomenon commonly 
called the “incubation” of anxiety (e.g., 
White, 1948), 

A study by Diven (1937) has frequently 
been cited as providing experimental evidence 
for the incubation effect. Diven tested the 
effects of delay on emotionality by comparing 
a conditioned GSR immediately after acquisi- 
tion with the GSR after periods of delay 
varying from 30 minutes to 48 hours. This 
experiment consisted of 10 subexperiments 
each differing with Tespect to some experi- 
mental variable, In making overall compari- 
sons the effects of various variables were 
inextricably confounded and, thus, the evi- 
dence for concluding that emotionality in- 
creased after a delay was not clear. Subsequent 
studies have reported that delay resulted in an 
increase in emotionality (Bindra & Cameron, 
1953), in a decrease (Haggard, 1943), and in 
an increase for responses to generalized 
stimuli but not for responses to the CS (Med- 
nick, 1957). But these studies are not strictly 
comparable since they differed with respect 
to one or more variables such as the length 
of the CS-UCS interval and the point in this 
interval at which the GSR was measured, 

Diven (1937) further reported that the 

! Based on a dissertation submitted in partial ful- 
fillment of the requirements for the PhD degree at the 
State University of Iowa. Acknowledgment is made to 

I. E. Farber, committee Chairman, for his advice 
throughout this study. 
? Now at the University of Wisconsin-Milwaukee. 


534 


| 
increase in emotionality after a delay was 
relatively greater when subjects were unaware 
of the relation between the CS and UCS than 
when they were aware of this relation, This 
finding was considered consistent with the 
clinical hypothesis that repressed traumatic 
experiences are capable of generating more 
affect than trauma accessible to conscious 
experience, For methodological reasons Diven's 
evidence is dubious and, furthermore, the 
results of other studies (Haggard, e 
Lacey, Smith, & Green, 1955; Mednick, 1957) 
imply that increases in emotionality after à 
delay are greater when the subject is aware j 
the CS-UCS relation. Mednick (1957), like 
others (e.g, Bitterman & Holtzman, 1952; 
Moeller, 1954) noted that GSR response 
strength declined after reaching a maximum 
early in acquisition—i.e., adaptation oc] 
curred—and suggested that the incubatio 
effect may have been due to recovery from 
adaptation occurring during delay. Bothi 
Haggard (1943) and Lacey et al. (1955) found] 
the adaptation of emotional responses to be 
greater for “aware” subjects than for “wni 
aware" subjects. Hence, if incubation is à 
result of recovery from adaptation, aware 
subjects would be expected to show a greater 
increase in emotionality than unaware subjects 
as a function of delay. 3 
This study was designed to test the view 
that emotionality, as measured by the GSR, 
increases after a delay and that this increas? 
is greater when the subject is aware of the 
relation between the CS and UCS. Unlike 
previous studies in this area awareness was 
determined by allowing the sub ject to ident 
from a checklist of all stimuli those whi dl 
were followed by the UCS. Hills (1959) foun 
that written rather than oral report increase 
the proportion of subjects who are conside 
aware when awareness is defined in terms 
ability to specify the CS-UCS relationship. 
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METHOD 


Subjecis, Apparatus, Preliminary Procedure, 
and Materials 


Subjects were 60 male undergraduate students from 
the State University of Iowa assigned in equal numbers 
to each of three List conditions described below. 

After each subject was comfortably seated, GSR 
pick-up electrodes and shock electrodes were attached. 
After a “ready” signal, the subject was given a mild (15 
volt) electric shock of 1 second duration, administered 
through copper electrodes attached to the palm and 
middle finger of the left hand, by means of a Variac 
connected to a 120 volt isolated alternating current. 
The GSR pick-up electrodes, attached to the middle 
|, and little fingers of the right hand, consisted of round 

lead plates 3g inches in diameter and 34 inch thick. A 
felt corn plaster filled with Sanborn electrode paste pro- 
vided the actual contact with the subject’s finger. The 
GSR was recorded as a change in conductance by an 
Esterline-Angus recorder through a Hunter GSR 
amplifier circuit. Time intervals and durations of 
stimuli were controlled by decade-type interval timers 
(Hunter & Brown, 1949). The shock apparatus, re- 
corder, amplifier, and timers were located behind a 5 
feet x 3 feet black masonite screen. After the subject 
Was given the 15-volt shock, he was informed that the 
shock in the experiment proper would be similar to the 
15-volt shock, but much stronger. 

The stimuli consisted of 15 words (Table 1) pre- 
sented for one second each on a 3 inch X 5 inch trans- 
lucent cardboard screen by means of a 35 millimeter 
Projector placed behind the masonite partition. Three 
of these words served as experimental words. The re- 
maining words served as distractor stimuli to make the 
task more dificult. For each subject one of the three 
ental words was a shock-aware (SA) word, i.e., 
. the subject was informed beforehand that this par- 

ticular word would be followed by shock. For each sub- 
HM another of the three experimental words was a 
5 do (SU) word, i.e., always followed by 
n but not mentioned in the instructions. The last 
uns three words was a nonshock (NS) word, i.e. 
er followed by shock nor mentioned in the instruc- 
Eon The instructions were a modification of those 
qus by Lacey et al. (1955) which were designed to 
od. the relation between the SU word and the 


Acquisition and Test Series 
E trials, each trial containing the three experi- 
3 ntal words and six of the nine filler words, were 
ER to each subject. Subjects were randomly as- 
E EM List groups of 20 subjects each and the 
dM erential effects of the experimental words 
Mor Ma were controlled by counterbalancing them 
mds e three List groups, as shown in Table 1. The 
Es ere presented to the three List groups in the 
Order and therefore the effect of order with respect 


3 t 
Mie igeested in a personal communication by Guy 
Tom a technique devised by David Lykken. 
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TABLE 1 
SrimwoLUs Words 
Experimental words 
Shock condition List I List II List III 

Shock-aware (SA) TIGER | ZINC COAT 
Shock-unaware (SU) | zwc | coaT TIGER 
Nonshock (NS) COAT | TIGER ZINC 

Filler words 


LEOPARD, WILDCAT, COPPER, TIN, SHIRT, DRESS, 
TRIANGLE, CIRCLE, RECTANGLE, SQUARE, HEXAGON, CONE 


to Awareness (SA, SU, and NS) was also counter- 
balanced among the List groups. 

When the subject’s GSR reached a base level the first 
stimulus word was presented. The subject was in- 
structed to chain-associate to each word and tap a 
telegraph key located in front of him with the left hand 
ata constant rate until either a 6-watt light in the upper 
right corner of the masonite partition went on or until 
he received an electric shock. The subject chain-associ- 
ated and tapped to each word for 15 seconds. If the 
stimulus word was an NS word, the end of the interval 
was indicated by the light. If the stimulus was an SA 
or SU word, the end of the interval was signaled, not 
by the light, but by a 100-volt electric shock, a stimulus 
shown by preliminary study to be safe, though painful. 
In order to allow the GSR sufficient time to reach a 
base level there was an interval of 30 seconds between 
the end of one trial and the beginning of the next. If at 
the end of 30 seconds the GSR did not reach a base 
level, the experimenter waited until a base level was 
reached before beginning the next trial. It is estimated 
that the GSR reached a base level by 30 seconds on 
95% of the trials and those subjects who tended to have 
longer intertrial intervals were quite evenly distributed 
among the experimental groups. 

After completion of acquisition one half of the sub- 
jects in each of the three List groups were randomly as- 
signed to the Delay condition. The remaining half were 
assigned to the Immediate condition. The electrodes 
were removed from the subjects in the Delay condition 
who were then taken to another part of the room where 
they spent 30 minutes in a task involving abstract 
thinking problems. At the end of this period, these 
subjects were seated in front of the masonite screen as 
before, the electrodes being attached, and the test 
series (extinction) was begun. Subjects in the immediate 
group proceeded directly from the acquisition series to 
the test series. 

All subjects were given two test trials with the 
electrodes attached, each trial containing the three ex- 
perimental words and six of the filler words. No shock 
was administered during the test series. The base level 
of each subject was recorded in raw ohms resistance and 
the GSR was recorded as a change in conductance for 
each presentation of an experimental word both during 
acquisition and the test series. 
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Fic. 1. Mean GSRs in conductance units under Im- 
mediate and Delay conditions. 


Determination of Awareness 


After the test series the electrodes were removed and 
each subject was presented with a written list of all the 
stimulus words and asked to check whether or not a 
word had been followed by shock during the acquisition 
series. The check list consisted of the following cate- 
Bories: (1) “am certain it was not,” (2) “think it was 


Resutts 

The GSR was measured in terms of the 
amplitude of change in conductance occurring 
between the onset of the CS and the onset of 
the signal, shock or light, terminating the end 
of the chain-association interval. Since the 
correlation between base level and GSR 
changes was found not to be significantly 
different from zero, the results were computed 
on the basis of the absolute magnitude of the 
GSR. 


SANFORD GOLIN | 


TABLE 2 
SUMMARY OF ANALYSIS OF VARIANCE oF TRANSFO! 
GSR Dara or TRIAL Six or ACQUISITION 


Source dí | us F " 
Between subjects 59 | 
Delay (D) 1] 12.59] 218] | 
Lists (L) 2|11.98]| 2.08 
DXL 2| 3.52| «1 
Error (b) 54| 5.77 ^ 
Within subjects 120 J 
Awareness (A) 2 | 54.16 | 33.13 | «i 
AXD 2 -51 | «1 | 
AXL 4 2.84 1.74 
AXDXL 4| 4.01 2.45 
Error (w) 108 1.63 i 
Total 179 
Acquisition 


Figure 1 shows the GSR acquisition curva 
for SA, SU, and NS words. The mean GSI 
and SDs for Acquisition Trial 1 were, 1% 
spectively, SA, 33.87, 26.04; SU, 20.68, 21.06) 
NS, 23.09, 23.03. The difference between tht 
mean SA and mean NS GSRs (within-subjects 
comparison), which a / test showed to be 
significant (p < .01), indicated that the 
instructions were effective in producing al 
emotional response to the SA word before the! 
Shock was administered. There was no sig} 
nificant difference between the mean SU 
GSR and the mean NS GSR on Trial 1 ($ > 
.10). 

A mean GSRs and SDs at the end d 
acquisition (Trial 6) were, respectively, SA) 
24.04, 23.64; SU, 12.25, 12.24; NS, 8.31, 10.02 
In order to determine the effect of Awareness 


-001) and the GSR to SU words was gus 
than that to NS words (p < .02). Since t 
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was no significant difference between the SU 
and NS words at the beginning of acquisition, 
the significant difference between them at the 
end of acquisition indicated that conditioning 
had occurred. The finding of no significant 
difference between Immediate and Delay 
subjects indicated there was no bias in the 
selection of subjects for these conditions since 
they received the same treatments during 
acquisition, while the finding of no significant 
effect of Lists showed that the GSR was 
neither a function of the words themselves 
nor of the interaction between words and the 
other factors, 

As can be seen from Figure 1, the GSR to 
all three experimental words declined during 
acquisition, i.e., adaptation occurred. In order 
to compare the amount of adaptation between 
SA and SU words, the difference between the 
GSR on Acquisition Trial 2 and Acquisition 
Trial 6 (A-2 minus A-6) was computed for 
each subject. Trial 2 was taken as the starting 
Point for assessing the relative effects of 
adaptation since it was the first trial following 
the first reinforced trial. The means and SDs 
of the difference scores were, respectively, 
SA, 8.84, 23.8; SU, 3.92, 12.5, the difference 
between the means (within-subjects com- 
Parison) being significant at between the .05 
and .10 level. 


Effects of Delay and Awareness 


As can be seen from Figure 1, Delay subjects 
showed an increase from the last trial of 
acquisition (A-6) to the first test trial (T-1) 
Bas Immediate subjects did not. The 
ifference between the GSR on T-1 and A-6 
(T4 minus A-6) was computed for each 
Subject and an analysis of variance of the 
effects of Awareness (within-subjects) and of 
elay and Lists (between-subjects) was 
carried out, The relevant means and SDs are 
Presented in Table 3 and the analysis is 
Dunmarized in Table 4. The analysis showed 
a the mean change in the GSR was sig- 
eee greater for Delay subjects (P < 
E Ja Furthermore, the significant Delay X 
Gr meness interaction (p < .005) showed 
4 D the Increase in the GSR after a delay was 
Eun of awareness; for Delay subjects, ¢ 
; $ showed that the increase was greater for 
€ SA words than for either the SU words 
< 01) or NS words (p < .001), and the 
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TABLE 3 


GSR DIFFERENCES BETWEEN TRIAL SIX OF 
Acquisition AND Test TRIAL ONE 


Shock-aware Shock-unaware  Nonshock 
Immediate 
M —1.68 14 1.50 
SD 14.57 8.26 915 
Delay 
M 39.27 28.67 18.62 
SD 46.41 42.30 28.58 
TABLE 4 


Summary or ANALYSIS OF VARIANCE OF GSR DIFFER- 
ENCES BETWEEN TRIAL SIX OF ACQUISITION AND 


Test TRIAL ONE 
Source df MS F ? 
Between subjects 59 
Delay (D) 1 |36994.33 | 22.01 | <.001 
Lists (L 2| 889.86 |«1 
DXL 2| 595.68 |«1 
Error (b) 54 | 1681.06 
Within subjects 120 
Awareness (A) 2 | 1147.78 | 3.64 | <.05 
AXD 2 | 2136.96 | 6.79 | <.005 
AXL 4 | 1599.01 | 5.08 | <.005 
AXDXL 4 | 4649.20 | 14.77 | <.001 
Error (w) 108 


"Total 


increase for SU words was greater than that 
for NS words (p < .01). There were no sig- 
nificant differences with respect to the in- 
crease from A-6 to T-1 for Immediate subjects. 

The results just cited showed that both 
adaptation and incubation were greater for 
SA than SU words. To further determine if 
there was any relation between adaptation and 
incubation, Delay subjects were ranked with 
respect to the amount of adaptation and 
divided at the median into two groups, High 
and Low Adapters. A comparison of the mean 
incubation scores of these two groups revealed 
a significantly (p < .05) greater increase after 
a delay for High Adapters. 

Further analysis of these significant Lists 
X Awareness (p < .005) and Lists X Aware- 
ness X Delay (p < .001) interactions showed 
that the results were related to word order on 
T-1, ie, Delay subjects not only showed a 
differential increase with respect to awareness 
(SA, SU, and NS words), but also showed a 
significantly greater mean GSR to the first 
word within the trial presented after a delay, 
irrespective of the nature of the word. It will 
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be recalled that the order of presentation of 
the stimulus words was the same for all 
subjects but the order of the SA, SU, and NS 
“treatments” was counterbalanced among the 
List groups. 

Since the SA, SU, and NS words differed 
not only with respect to the increase in the 
GSR from A-6 to T-1, but also with respect 
to their strength at the end of acquisition 
(A-6), a test was made of the hypothesis that 
the amount of increase was a function of the 
strength of the response at the end of acquisi- 
tion. A comparison of high and low GSR 
responders on A-6 showed no significant 
difference with respect to the change from 
A-6 to T-1 (p > .10). Thus, it was shown that 
observed differences after a delay could not 
reasonably be attributed to the level of re- 
sponding at the end of acquisition. 

Figure 1 shows a large drop in the mean 
GSR for Delay subjects from the first (T-1) 
to the second (T-2) test trial. A comparison 
of A-6 with T-2 showed that Delay subjects, 
despite their rapid drop from T-1 to T-2, had 
not declined to their level of responding at the 
end of acquisition (p < .025). 


Awareness Ratings 


In the determination of awareness, 23 
subjects checked Category 4 on the question- 
naire, ie, indicated that they thought the 
SU word was followed by shock but were not 
sure. The mean GSR to the SU word on the 
last trial of acquisition of these 23 subjects 
was compared with the comparable mean 
GSR of 23 subjects randomly selected from 
those subjects who checked Categories 1-3 for 
the SU word. A / test did not reveal any 
significant differences between those subjects 
who marked Category 4 and those who marked 
Categories 1-3. 


Discussion 


The results of this study showed that an 
emotional response was learned without the 
subject being aware of the relation between 
the CS and UCS. This result is in accord with 


* As Eriksen (1960) has pointed out, the definition of 
awareness is a major methodological problem in this 
area. A similar problem involving the concept “emotion” 
has been discussed by Brown and Farber (1951). They 
have noted that to regard emotion as an existential real 
only partially encompassed by its definition rather than 
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the findings of Diven (1937) and Lacey et al, 
(1955). As a consequence of a confounding of 
variables the basis of Diven’s conclusions 
were not clear and, as Chatterjee and Eriksen 
(1960) have pointed out, the findings of Lacey 
et al. could have been an artifact resulting 
from the method of computing the conditioning 
scores. The present results suggest that the’ 
conclusions of the Diven and Lacey et al 
studies with respect to "earning without | 
awareness" were not affected by their methodo- | 
logical shortcomings. l 
This study showed that emotionality as 
measured by the GSR to words associated | 
with shock increased after a delay. This | 
finding agrees with the results of Diven| 
(1937), disagrees with those of Haggard (1943) | 
and is only partially consistent with those of 
Mednick (1957). Mednick, it will be recalled, 
did not find an increase after a delay in| 
emotional reactions to the CS, but did report 
such an increase for stimuli meaningfully 
related to the CS. 1 
In the present experiment the increase in 
emotionality following a delay differed among 
the responses to SA, SU, and NS words with 
the increase to the SA words being the greatest. 
This result is in contradiction to Divens 
(1937) conclusion that the increase following 
a delay is greatest under SU conditions. On 
the assumption that “incubation” is a result 
of recovery from adaptation, it was hy- 
pothesized that any increases after a delay 
should be greater for those words whose GSR 
showed greater adaptation during acquisition. 
Adaptation from the second to the last t 
of acquisition was found to be greater fot 
the GSR to SA than SU words, though the 
difference was not highly significant (P < 
.10). 


Since both adaptation and the increase aftet 
a delay were found to be greater for SA 
words, these findings can be considered 4 
consistent with the interpretation of incuba 
tion as recovery from adaptation. Also, the 
finding that subjects who showed relatively 
greater adaptation also showed a greate 
increase after a delay further supports thi 
interpretation. 


as a scientific inference has lead to unanswerah 
questions such as, what is the “true” ber 
emotion. Their comments might also be consid 
applicable to the definition of awareness. 


| Tf one assumes that incubation is a result of 
recovery from adaptation, the question is 
raised as to what are the variables involved 
in adaptation. Figure 1 shows that despite 
the differential response strength among 
SA, SU, and NS words, response strength to 
all three words declined with increasing 
reinforcement. The shape of these curves 
suggests that the incubation effect may be 
considered as a special case of the Switzer 
phenomenon described by Hull (1943). Hull 
hypothesized that the initial rise in the 
extinction curve following massed reinforce- 
ment resulted from dissipation of some kind 
of inhibitory process developed during acquisi- 
tion (inhibition of reinforcement). Within this 
framework one could speculate as to what 
kinds of inhibitory processes might be involved 
in the adaptation of emotional responses. 


SUMMARY 


„The purpose of the study was to test the 
view that conditioned emotionality, as 
measured by the GSR, increases after a delay 
and that this increase is greater when the 
subject is aware of the relation between the 
CS and UCS, Certain words were associated 
with shock. The subject was aware of the 
relation between one word and shock but was 
unaware of the relation between a second word 
and shock. The GSR to these words was 
compared to the GSR to a word not associated 
with shock. The GSR to these words was 
recorded immediately after acquisition for 
half the subjects, and after 30 minutes delay 
following acquisition for the remaining sub- 
Jects. The major results were: (a) the GSR 
increased as a function of delay, (6) the 
increase after a delay was greater when the 
Subject was aware of the relation between the 
CS and UCS, (c) adaptation during acquisition 
Was greater when the subject was aware of 
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the CS-UCS relation. The results were in- 
terpreted as consistent with the interpretation 
of incubation as a result of recovery from 
adaptation. 
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ANY well known clinical writers have 
suggested that the disorders of 
thought displayed by schizophrenic 

or brain damaged patients represent in some 
sense a regression to modes of thought that 
are typical of young children. This statement 
has been made of schizophrenics by Arieti 
(1955), Kasanin (1945), Sullivan (1924), 
Von Domarus (1944), and White (1928); of 
brain damaged patients by Levin (1930); 
and of both schizophrenic and brain damaged 
patients by Hanfmann (1953), Lewin (1951), 
Vigotsky (1934), and Werner (1940). These 
writers were all concerned with the return to 
earlier patterns of interpreting or interrelat- 
ing words and ideas, the sense of regression 
that is pertinent to this paper. 

In psychoanalytic terminology, this is a 
variety of temporal ego regression. By ego 
regression, Freud (1953, 1955) referred to an 
abandonment by the ego of some of its func- 
tions. Although he mentioned that ego re- 
gression also entails a reversion to genetically 
earlier patterns of functioning, his examples 
of this were almost exclusively of the trans- 
formation of thoughts into images. When 
Freud wrote of temporal regression, he was 
usually concerned with its affective manifes- 
tations, ie, libido regression and object 
regression. However, later psychoanalytic 
writers, like Kris (1936, 1938), have discussed 
in some detail temporal regression in the 
sense of the present paper, ie., reversion to 
prelogical “primary process" patterns of 
interpreting and interrelating meaningful 
content. 

In Lewin’s (1951) terms, the phenomena 
under discussion are regarded as Tegression 
rather than retrogression because there is no 
evidence that any particular patient who 

1This study was supported by a research grant 
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shows these modes of thought previously 
exhibited them in his own childhood. 

Attempts at objective testing of the re 
gression hypothesis of pathological thinking 
have usually consisted of comparisons of the 
performance of children with that of psychi- | 
atric patients. The findings have been con- | 
flicting. Using several tests, Vigotsky (1934) | 
concluded that children, schizophrenics, and | 
brain damaged patients share a similar pre- 
conceptual kind of thinking. Bolles (1937) 
and Wegrocki (1940) each compared schizo- 
phrenics with children on a variety of tasks 
designed to measure the ability to think con- 
ceptually, Both investigators, contrary to 
Vigotsky, concluded from a qualitative analy- 
sis of performance that the differences be- 
tween groups were much more prominent 
than the commonalities. Rashkis, Cushman, 
and Landis (1946) did a similar study in which 
they compared both schizophrenics and pa- 
retics with children. They concluded that the 
paretics but not the schizophrenics were child- 
like in their thinking. b 

All four of these studies relied on qualitative 
analysis of the data in terms of descriptive 
categories of response which were subjectively 
determined after the data had been obtained. 
This methodologically hazardous procedure 
often results in findings that fail to be con- 
firmed by other investigators. 

Several other investigators have employed 
more objective measures of performance. 
Cameron (1938), comparing schizophrenics 
with children on the ways in which they 
finish incomplete causal sentences, concludi 
that the groups are very different. Howeve 
three other investigators, using different tasks, 
reported similarities between these two group 
Feifel (1949) found that the definitions © 
words offered by schizophrenics and diia 
showed a comparable pattern of der 
from the definitions offered by adult normal 
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Ellsworth (1951) reported a similar finding 
on the frequency of usage of different parts 
of speech. Burstein (1959, 1961) found that 
both schizophrenics and children treat anto- 
nyms as if they were synonyms more than do 
normals. 

Each of these three investigators chose his 
measure of deviation on the basis of its dis- 
tinguishing either schizophrenics or children 
from normal adults. In each case, the tasks 
were such that both children and schizo- 
phrenics were characterized by deviant per- 
formance relative to normal adults. However, 
as a result of restrictions either on the re- 
sponse alternatives available or on the in- 
vestigator’s categorization of the responses, 
each task provided relatively few avenues 
along which deviations could occur. The 
tasks tended to measure undifferentiated de- 
viation rather than different kinds of specific 
deviation. Therefore, the likelihood that 
schizophrenics and children would appear 
similar was very great. 

A much more stringent test of the regression 
hypothesis entails the use of tasks that measure 
errors that are sufficiently specific to dis- 
tinguish pathological groups. For example, 
one might choose measures on the basis of 
their ability to distinguish the thinking dis- 
order of schizophrenia from that of brain 
damage, as well as their ability to distinguish 
pathological from normal groups. One could 
then compare the performance of the clinical 
groups on several such measures with the 
Performance of children with more assurance 
that sufficiently distinct and varied oppor- 
tunities for deviation have been provided. 

Such an approach has an additional advan- 
tage. Despite the fact that differences between 
the thinking of schizophrenic and brain dam- 
aged patients appear well established on the 
basis of both everyday clinical observation 
and research evidence, the disorders of both 
groups have been claimed to represent a 
Tegression to childhood patterns of thought. 
This claim cannot be valid for both groups 
unless schizophrenics and brain damaged 
Patients regress to different age levels that 
differ on error preferences. It is especially 
appropriate, therefore, to apply a stringent 
test of the regression hypothesis to these two 
Particular pathological groups. 

The present investigation employs two 
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tasks, each of which measures two contrasting 
kinds of errors. Each task has previously 
been shown to distinguish, by kind of error 
preferred, the thinking disorders of schizo- 
phrenic and brain damaged patients. One is a 
measure of the confusion of figurative and 
literal meanings of common words and phrases 
(Chapman, 1960); the second is a measure of 
the tendencies to include or exclude inappro- 
priately the names of objects from common 
conceptual categories (Chapman, 1961). 

These tasks present subjects with pre- 
arranged opportunities to engage in the hypo- 
thesized kinds of thinking disorder. It may 
be objected that this is inappropriate for in- 
vestigating the hypothesis, since psycho- 
analysts write as if every occasion of regressive 
(primary process) thinking is motivated by 
affect specifically related to the material at 
hand. This emphasis may be in large part 
due to the fact that Freud’s discussion of 
primary process thinking in adults arose 
primarily from his study of dreams, the psy- 
chology of errors, and neurosis. In psychosis, 
however, the thinking disorder apparently 
becomes a more pervasive feature of thinking, 
Burstein (1961) has shown that some errors 
structurally identical to those described as 
primary process thinking are exhibited by 
schizophrenics in dealing with experimentally 
contrived material which, presumably, is 
emotionally neutral. Such a process appeared 
in the tendency to treat both homonyms and 
antonyms as synonyms. These findings do 
not imply that the thinking disorder is not 
motivated, but they do suggest that some 
thinking disorders of the sort described by 
psychoanalytic writers are sufficiently per- 
vasive to permit measurement through the use 
of materials that do not necessarily involve 
an individualistic investment of affect by the 
patient at hand. 


EXPERIMENT I 


Procedure 
The confusion of figurative and literal meanings of 
common phrases was measured by a paper-and-pencil 
task with 40 items like the following: 
30. The rich woman looks down on her servants. 
This means: 
A. The rich woman likes to work in her 
garden in the evening. 
B. The rich woman sees her servants from 
a great height. 
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C. Therich woman thinks she is better than 
her servants. 
16. The roof repairman looks down on the children 
playing. 
This means: 


A. The roof repairman likes to work in his 
garden in the evening. 

B. The roof repairman sees the children 
from a great height. 

C. The roof repairman thinks he is better 
than the children. 

In Item 30 above, the correct answer is, of course, 
“The rich woman thinks she is better than her sery- 
ants.” Choosing “The rich woman sees her servants 
from a great height” would reflect an inappropriately 
literal misinterpretation of “looks down on.” In Item 
16, the reverse is true: “The roof repairman sees the 
children from a great height” is the correct answer, and 
choosing ‘The roof repairman thinks he is better than 
the children” would be inappropriately figurative. The 
third alternative, which interprets “looks down on” to 
mean to work in a garden, is neither correct nor a pre- 
dicted error due to misinterpretation. This alternative 
was included as a measure of errors from such un- 
identified sources as inattention or inadequate under- 
standing of the task. Such unidentified errors ought to 
be equally distributed among the three alternatives. 
Thus, there should be, by chance, as many errors made 
for unidentified reasons hidden among the predicted 
misinterpretation markings as there are markings of 
these "unidentified" error alternatives themselves. 
"Therefore, we may obtain for each subject a measure of 
literal misinterpretation, relatively uncontaminated by 
errors from unidentified sources, if we subtract from the 
number of his literal error markings the number of his 
"unidentified" error markings. This will be called the 
“corrected literal misinterpretation score." The “cor- 
rected figurative misinterpretation score" is obtained 
by a similar principle. 

"The 40 items consisted of 20 which permitted literal 
misinterpretations and 20 which permitted figurative 
misinterpretations. For a more complete description of 
this task and its rationale, see Chapman (1960). 

Tn an earlier study, Schizophrenics were found to 
make literal misinterpretations of figurative usages 
more often than they made figurative misinterpretations 
of literal usages (p < .01). Organics, however, showed 
the opposite error perference, more often making 
figurative misinterpretations of literal usages (p < .05). 
Moreover, a preference score of figurative misinterpre- 
tations minus literal misinterpretations distinguished 
the two groups (5 < .01). 


Subjects 


Subjects were pupils in a fourth grade class and an 
eighth grade class at a suburban public school near 
Chicago, Illinois. The students in this school were pre- 
dominantly from upper middle class white families. 
There were 23 fourth graders, of whom 19 were 9 years 
old at their last birthway; 4 were 10 years old. There 
were 21 eighth graders, of whom 16 were 13 years old 
and 5 were 14. 


Dav, AND VERDONE 


TABLE 1 
MEAN FIGURATIVE AND LITERAL MISINTERPRETATION 
Scores or FOURTH GRADERS AND 
EIGHTH GRADERS 


Figurative Literal 
misinter- misinter- 
pretations pretations 


Fourth | Eighth | Fourth | Eighth 


Predicted misinterpre- |2.1 |2.0 | 5.3 | 1.5 
tation markings 

Unidentified error mark- | 1.3 2 1.6. | 8 
Ings 

Corrected misinterpre- :8 | 1.8) | 3.7 1i 
tation score 

Results 


The mean corrected literal misinterpretation 
scores and corrected figurative misinterpreta- 
tion scores of the two groups are shown in 
Table 1. As seen there, the fourth graders 
made literal misinterpretations of figurative 
usages more often than they made figurative 
misinterpretations of literal usages. The dif- 
ference was significant (p < .01) as deter- 
mined by a two-tailed Wilcoxon test. The 
eighth graders tended to show the opposite 
error preference, preferring figurative misin- 
terpretations more than literal misinterpre- 
tations, but this difference was not significant 
(p = .19). On a preference score of corrected 
figurative misinterpretation score minus coi- 
rected literal misinterpretation score, the two 
groups were again different (p < .02). Thus, 
the schizophrenics appeared to resemble the 
younger children in their performance 0n 
this test, whereas the brain damaged patients 
clearly did not. 


EXPERIMENT II 
Procedure 


The subjects described in Experiment I were also 
given, in paper-and-pencil form, a task that had previ- 
ously been employed to compare schizophrenics wi 
brain damaged patients on the breadth of the concept 
that they use. E. 

The task consisted of 12 subtests like the following: 
30 words were listed, of which 10 were names of trees 
10 names of flowering plants, and 10 names of music 
instruments. The printed instruction read, “Circle e 
word that names a tree. Cross out each word that do65 
not name a tree.” 

The 12 subtests varied in number of words to D 
categorized and in the proportion of words that be 
longed in the named category. Altogether the 12 st! 
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tests required the categorization of 480 names of 
objects of which 240 belonged in a named category 
and 240 did not. Thus, there were as many opportu- 
nities for incorrect inclusions in a named category as for 
incorrect exclusions. For a further description of this 


task, see Chapman (1961). 

In an earlier study, schizophrenics and brain dam- 
aged patients were presented this task in a card sorting 
format. The brain damaged patients made more ex- 
clusion errors than inclusion errors (? < .05), and the 
Schizophrenics showed the opposite error preference, 
making more inclusion than exclusion errors (p < .05). 
Moreover, a preference score of inclusion errors minus 
exclusion errors distinguished the two groups (p « .01). 

Tn the present study, this task was given to the fourth 
grade and eighth grade classes described above. It was 
completed by 20 subjects in each group. 


Results 


The mean exclusion and inclusion error 
Scores are shown in Table 2. Both groups ex- 
cluded more than they included. A score of 
inclusion errors minus exclusion errors was 
computed for each subject. Because random 
errors should result as often in an inclusion 
as an exclusion error, this difference score is 
independent of such random error. The dif- 
ference score was found to distinguish the two 
groups (p< .01) the younger children 
showing the more extreme negative scores. 
Moreover, the younger children tended to 
exceed the older on the proportion of total 
errors that were exclusion errors (p < .06). 
(Subjects who made fewer than a total of 
five errors were omitted from this analysis 
because of the unreliability of proportions 
With small denominators.) Thus, the brain 
damaged patients appeared like the children 
in their performance, but more like the younger 
children than the older. The schizophrenics 
exhibited the opposite error preference on 
this test. The results here are essentially con- 
trary to those obtained with the figurative- 
literal test, 


TABLE 2 
MEAN Errors ON A MEASURE OF 
CONCEPTUAL BREADTH 


Inclusion 
Inch Exch - 
ero. eror | gaming © 
Fourth 
ourth grade 8.6 37.0  —28.4 
Eighth grade 4.6 8.6 -4.0 
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DISCUSSION 


On two tasks that distinguish schizophrenics 
from brain damaged patients, younger children 
were more like schizophrenics in their error 
preference on one task; on the second, they 
were more like brain damaged patients. These 
results appear contradictory if one accepts 
the regression hypothesis for either or both 
clinical groups. This contradiction cannot þe 
resolved by hypothesizing different age levels 
of regression for schizophrenics and brain 
damaged patients because each group of pa- 
tients would appear regressed to an earlier 
level than the other in terms of one of the two 
tasks. Perhaps, then we should not look to 
the regression principle to explain the simi- 
larity of either clinical group to the younger 
children, 

One may speculate that the error preferences 
which occur as a function of age with the 
children on these tasks result from the smaller 
fund of knowledge possessed by younger chil- 
dren. This interpretation might hold for the 
figurative-literal task. Figurative meanings 
may be learned later in life than literal ones, so 
younger children misinterpret figurative ex- 
pressions as if they were literal more often 
than they make the opposite kind of misin- 
terpretation. On the breadth of concepts 
task, younger children may more often ex- 
clude objects from the named conceptual 
categories because they have not yet learned 
the names of the objects. But this explanation 
in terms of childish ignorance would not ex- 
plain the results for the clinical groups be- 
cause the results are in opposite directions 
for the two tasks. The data offer no justifiable 
basis for speculation as to the theoretical 
basis for the differing error patterns exhibited 
by the pathological groups. They do, however, 
demonstrate the inadequacy of the hypothesis 
that thinking disorders are completely ac- 
counted for by regression. j ; 

As previously pointed out, an investigator 
is more likely to conclude that the errors of 
two groups are qualitatively alike if his meas- 
ure is not highly specific as to the kind of 
error it taps. One may enhance the specificity 
of description of thinking disorder if he has 
available two or more measures on the same 
subjects and treats them concurrently and 
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interrelates them. Using this procedure, one 
is less likely to conclude that groups are similar 
than if the measures are examined one at a 
time. This is because many of the same kinds 
of deviant responses and failures appear in a 
variety of conditions. The difference between 
thinking disorders in diverse conditions partly 
consists of differing frequencies of various 
kinds of deviant response which are present 
in many of the conditions. For example, on 
the second task here, schizophrenics, unlike 
the younger children, excluded less than they 
included. However, considering exclusion 
errors alone, one finds that the schizophrenics 
made more exclusion errors than the normal 
adults, and the younger children made more 
than the older children. The difference be- 
tween the groups emerged only in rank order- 
ing the strengths of two differing error 
propensities, 

The use of rank ordering of different kinds 
of response in order to specify a difference 
between children and schizophrenics is also 
the principal evidence in Cameron’s (1938) 
well known study. He compared three groups 
of schizophrenics, varying in degree of pa- 
thology on their handling of two tasks which, 
according to Piaget, require abilities that 
mature at different ages. The more disor- 
ganized patients did not do worse on the tasks 
that required an ability which matures later. 
Cameron concluded, therefore, that the 
schizophrenic thinking disorder is not regres- 
Sive in nature, However, if Cameron had 
only been concerned with the presence or 
absence of the various kinds of deviant re- 
sponse, he might well have concluded that 
children and schizophrenics are alike, Indeed, 
he remarked that both show asyndesis and 
metonymic distortion. 

Even in the four rather subjectively in- 
terpreted studies of concept sorting that were 
mentioned earlier, the degree to which the 
investigator particularized his description of 
the thinking disorder seemed to influence his 
conclusions concerning regression, Vigotsky 
(1934) apparently lumped together into the 
single category of “complex thinking” a 
fairly large variety of failure on his test and 
concluded that his groups were alike, Bolles 
(1937), Wegrocki (1940), and Rashkis, Cush- 
man, and Landis (1946) described sortings in 
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terms of more varied categories, concludi 
that the differences between groups were 
important than the similarities, 

The basic question would seem to 
whether regression is to be considered a 
descriptive term or an explanatory princip 
One may justifiably use it descriptively as 
way of referring to whatever correspondences 
emerge between two sets of observations, ont 
of which is obtained on children. Very often, 
however, the term is used as an explanatory 
principle, meaning, for example, that patients 
behave as they do because they have b 
like children. This usage seems to require a 
fairly blanket similarity between the 
groups unless one is willing to say that re 
gression occurs in certain characteristics but 
not others. Such a formulation is rather 
unsatisfactory in the absence of a rationdlt 
for predicting which childlike characteristics 
are shown. | 


SUMMARY 


The hypothesis that the thinking disorders 
of schizophrenics and brain damaged patients 
are forms of regression was tested by adminis- 
tering to a fourth grade and an eighth grade 
class of children two tasks, each of which had 
previously been shown to distinguish schizo- 
phrenics from brain damaged patients on the 
variety of error most often made. The first 
task measured both the misinterpretation di 
literal usages of words as if they were figurai 
tive, and the misinterpretation of figurative 
usages as if they were literal. The youngd 
children exceeded the older in their differ 
ential preference for misinterpreting figurative 
expressions as if they were literal as against 
the opposite kind of misinterpretation. Since 
schizophrenics had previously been shown t0 
differ from brain damaged patients in i 
same manner, the finding indicated t? 
schizophrenics are more “childlike” than bra 
damaged patients on this task. 23 

The second task measured inappropr? 
inclusion and exclusion of objects from con 
conceptual categories. The younger chil 
exceeded the older on their preference 
exclusion errors as against inclusion E 
Brain damaged patients had previously b 
shown to differ from schizophrenics a 
Same manner, indicating that the b 
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damaged are more “childlike” than the 
schizophrenics on this task. 

There is no blanket similarity between the 
error patterns of children and of either schizo- 
phrenics or brain damaged patients, and the 
use of the term “regression” to imply such a 
blanket similarity is not justified. There are, 
however, some points of similarity, which 
would appear to justify the use of regression 
as a purely descriptive term, as distinct from 
an explanatory concept. Much of the lack of 
agreement among investigators in this area 
has occurred both because the various in- 
vestigators have differed as to how much 
similarity is necessary to conclude that re- 
gression exists, and because they have used 
instruments that differ as to sensitivity for 
distinguishing varieties of deviancy. 
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HE use of conditioning techniques in the 
study of the psychological effects of 
centrally acting drugs is important from 
three points of view. First, for the learning 
theorist, drive states may be manipulated 
experimentally by drugs and thus may be a 
useful tool for testing deductions about the 
effects of drive level on the rate of learning 
(Brown, DiMascio, & Klerman, 1958; Burn- 
stein & Dorfman, 1959). Second, for the 
physiologist, the effects of drugs on different 
kinds of conditioning make possible inferences 
about the site of drug action (Berger, 1957; 
Hunt, 1957) and may provide data on the 
manner in which different drugs affect specific 
brain mechanisms (Olds & Travis, 1960). 
Third, conditioning techniques provide useful 
information for the psychopharmacologist 
and the clinician on how different drugs alter 
drives (Schneider & Costiloe, 1957). 
Previous studies have generally found that 
depressant drugs retard the acquisition of 
simple conditioned responses and speed ex- 
tinction, while stimulants facilitate acquisition 
and slow extinction. (For a brief summary of 
previous work, see Schneider & Costiloe, 
1957.) The present study attempts to demon- 
strate that the drug effect is partially deter- 
mined by whether the affective state aroused 
by the unconditioned stimulus (UCS) used is 
positive or negative. In the basic Hullian 
paradigm, the tendency to respond (E or 
excitatory reaction potential) is a. function of 
the drive level (D) times habit strength (H, 
or amount of learning) minus the response 
threshold (D): E = (D x H) — L. Since the 
number of trials to acquisition or extinction 
is a response measure and not a direct measure 
of the amount of learning, different theoretical 
interpretations of the results are possible. 
Thus Eysenck (1960), for instance, seems to 
feel that drugs act directly on habit strength 
(H). Payne (1958) assumes that depressant 
drugs increase the excitation threshold of 
neural structures and thus raise the reaction 
threshold (Z). The most common interpreta- 


tion of these experimental results, however, 
assumes that the reaction potential is affected 
because drugs act directly on the drive level 
(D) of the organism. The experimental designs 
of most of these studies (like the present one) 
do not provide a crucial test between these 
alternative interpretations. But since there is 
abundant clinical evidence that tranquilizing 
drugs do decrease hyperexcitability and 
anxiety (Berger, Hendley, & Tynes, 1956; 
Osinski, 1957; Wikler, 1958) and some experi- 
mental data showing they also decrease the 
activation level as measured by the psycho- 
galvanometer (Carran, 1960; Kristofferson & 
Cormack, 1960), our predictions are based 
on the assumption that tranquilizing drugs 
possess drive-altering properties. 

This series of two experiments was de- 
signed to test the effects of two tranquilizers 
(meprobamate and prochlorperazine) on the 
conditioning of the galvanic skin response 
(GSR) when a noxious stimulus is used as the 
UCS (shock in Experiment I) and when an 
affectively positive stimulus is used as the 
UCS (sexually exciting pin-up pictures in 
Experiment II). The GSR was chosen as the 
unconditioned response (UCR) because it 
provides a sensitive index of the “activation 
level” of the organism (Woodworth & Schlos- 
berg, 1954) and can provide information on 
the effects of tranquilizers upon autonomic 
activity. The procedure used was adapted 
from Schneider and Costiloe (1957) with 
slight modifications. 


Exrerment I: SHOCK CONDITIONING 
Subjects 
The subjects were college men, 21 years of age and 
older, who volunteered for the experiment and who were 
paid for their participation. In Experiment I, 27 sub- 
Jects were tested, 9 each under meprobamate, prochlor- 
perazine, and placebo. 


Dosage 


On the second experimental day, one hour before 
testing began, subjects were administered capsules con- 
taining either 1200mg. meprobamate, 20 mg. prochlor- 
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perazine, or a placebo. All subjects received identical 
capsules; administration followed double-blind pro- 
cedures. 


Apparatus 


The GSR was recorded from the left palm on a 
Lafayette psychogalvanometer. The UCS was a shock 
administered to the right arm by a standard ac shocker. 
Shock intensity varied from 50 to 70 volts between sub- 
jects according to individual tolerance, but was main- 
tained at a constant level for each subject on both days 
of testing. The pulse frequency was 120 cycles per 
second. On the oscilloscope, the pulse was sharply 
peaked with a fast fall-off. The shocker did not have the 
ability to keep current constant over side changes in the 
subject’s resistance. The subjects were seated in a 
modified dental chair and completely screened from the 
experimenter. In order to cut down on distractions, the 
experimental room was kept at low illumination during 
testing. 


Procedure 


Following Schneider and Costiloe (1957), a partial 
reinforcement, short-trace conditioning procedure was 
used. The conditioned stimulus (CS) was a buzzer of 1 
second duration, followed in 0.5 second by an electric 
shock. The duration of the UCS was 2 seconds. The 
presentation of the stimuli was controlled by an elec- 
tronic timer. 

_Each subject was tested on two consecutive days, 
without drugs the first day and with drugs the second, 
On each day, a four-step procedure was followed: 


First Day 


1. The GSR to the CS was adapted out. The criterion 
used for adaptation and extinction was three consecu- 
tive trials without response to the CS or until the non- 
reinforced CS had been presented for 30 trials. 

2. The UCS was presented for five trials at 30- 
second intervals to test the subject’s reaction. 

3. The partial reinforcement conditioning procedure 
was begun. Presentation of the CS occurred every 30 
seconds for 20 trials. On 15 of the trials, the CS was 
followed by the UCS, and on 5 it was not. ‘The nonre- 
inforced trials were randomized according to the same 
schedule for all subjects within the series of 20 trials, 
and changed for each subject on the second day of 
testing. 

4. Then the extinction procedure was carried out. 
The CS was again presented every 30 seconds, but with- 
out reinforcement, until extinction or for 30 trials. The 
number of trials to extinction (E1) provides a rough 
measure of the strength of conditioning and, when pos- 
sible, was used in matching subjects assigned to the 
different experimental groups on the second day. 


Second Day 


5. The CS was again presented without reinforce- 
ment as in Step 4, until extinction occurred. The second 
extinction (E2) provides the measure of the amount of 
spontaneous recovery under each drug condition. 

6. Presentation of UCS for five trials as in Step 2. 
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7. Reconditioning as in Step 3. 

8. Extinction as in Step 2. The third extinction (E3) 
provides an index of the amount of reconditioning on 
the second day. 

In order to control for individual differences in the 
amount of original conditioning, a measure of spon- 
taneous recovery was obtained for each subject by 
subtracting the number of trials for E2 from the num- 
ber for Ei (E1 — E2). Subtracting the number of 
trials for E3 from E1 provided the measure of the 
amount of reconditioning (E1 — E3). The statistical 
analysis compares the means of these difference scores 
for the two drug groups with the difference scores for 
the control group. 


Predictions 

Although the two drugs tested are both 
ataraxics, their mode of operation and site 
of action are completely different. Prochlor- 
perazine belongs to the group of tranquilizers 
called the “autonomic suppressants,” which 
affect the internal regulating systems by 
stimulating the posterior hypothalamus. 
Meprobamate, in contrast, is a central re- 
laxant and produces its tranquilizing effect 


by a selective blocking of interneuronal 
circuits, resulting in a reduction in muscle 


tension (Berger, 1957). 

Different behavioral effects of these drugs 
parallel these neurophysiological differences. 
Sikorski (1958) found that both meprobamate 
and prochlorperazine facilitate the expression 
of sexual and aggressive impulses, apparently 
by reducing the anxiety generally associated 
with their expression. Olds and Travis (1960), 
however, found quite striking differences 
between the effects of meprobamate and 
chlorpromazine (a phenothiazine) on self- 
stimulation behavior. While the former 
inhibited escape and avoidance reactions in 
doses that did not affect approach tendencies, 
the latter inhibited both approach and avoid- 
ance responses such that self-stimulation was 
markedly reduced. 

The predictions for the shock conditioning 
are based on the assumption that both mepro- 
bamate and prochlorperazine relieve anxiety 
and thus lower the over-all drive level of the 
subject. Since reaction potential is a multiplica- 
tive function of habit strength and drive 
(E = D X H), and since the habit strengths 
of the drug and placebo groups are assumed 
equal because of the identical criterion for 
extinction on the first day, the two drug 
groups should show less spontaneous recovery 
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than the comparable control group. The dif- 
ferences, however, should be slight because 
the habit strength of a previously extinguished 
response is small. 

The same reasoning leads to the prediction 
that the drug groups should extinguish faster 
than the control group after relearning, and 
that the difference should be larger than for the 
extinction of the spontaneous recovery because 
of greater habit strength after conditioning 
than after extinction. 


Results 

Table 1 presents the mean number of trials 
needed for each extinction as well as the 
measures for the amount of spontaneous 
recovery (E1 — E2) and the amount of re- 
conditioning (E1 — E3). Table 2 presents the 
statistical tests of significance for these 
differences. 

On the first day, when no drug was adminis- 
tered, the three groups showed no significant 


TABLE 1 


Mean NUMBER OF TRIALS TO EXTINCTION UNDER 
SHOCK CONDITIONING 


inal 
First ex-| taneous | E1 — | ertas | E1 — 
Grow tinction i 
T SES regen mm dm 
Placebo 18.3 | 15.0 | 3.3 | 12.2 | 6.1 
Meprobamate| 17.6 | 13.0 | 4.6 9.3 | 8.3 
Prochlorpera-| 21.9 | 20.2 | 1.7 | 12.9 | 9.0 
zine 
TABLE 2 
Tests or Druc EFFECTS ON SHOCK CONDITIONING 
Baseline Response measure i 
Meprobamate vs. Placebo | Amount of condi- u 
groups (First day: tioni 
without drugs) (E1) 
lorperazine vs. Amount of condi- | .65 
Placebo groups (First tioning 
day: without drugs) (E1) 
Drug Group 
Meprobamate vs. Spontaneous re- 
Placebo covery 
(E1 — E2) 
Meprobamate vs. Reconditioning .46 
Placebo (E1 — E2) 
Prochlorperazine vs. Spontaneous re- .33 
Placebo covery 
(E1 — E2) 
Prochlorperazine vs. Reconditioning .59 
Placebo (E1 — E3) 


* With 16 df a £ of 2.12 is needed for significance at the s% 
level. 
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difference in the number of trials needed to 
reach extinction. It was hypothesized that 
the administration of meprobamate and | 
prochlorperazine on the second day would | 
result in faster extinction of the spontaneous | 
recovery in the two drug groups as compared 

to the placebo group. The meprobamate | 
group was in the predicted direction; the _ 
prochlorperazine group was not. Neither 
difference was significant. The prediction 
that the drug group would show faster ex- 
tinction at the end of the second testing also 
was not confirmed, although both drug groups 
were in the predicted direction, Experiment 
I, then, gives no evidence that tranquilizing 
drugs affect classical conditioning or spon- 
taneous recovery when a noxious UCS is used 
as the reinforcer, 


EXPERIMENT II: PIN-UP PICTURE 
CONDITIONING 


Since Olds (Olds & Travis, 1960) has 
presented evidence that drugs may aflect the 
reward areas of the brain differently from the 
regions mediating escape tendencies, it was 
decided to run a second experiment, using an 
affectively positive stimulus as the UCS. 
On the basis of their work with self-stimulation 
techniques, Olds and Travis concluded that 
dosages of meprobamate sufficiently large 
to inhibit escape and avoidance tendencies 
had little or no effect on approach tendencies. 
We tried to capitalize on a “built-in” approach- 
avoidance conflict situation by using sexually 
arousing pictures as the unconditioned stimuli 
for the affective arousal measured by the 
GSR. It was hypothesized that meprobamate 
would lower the avoidance aspect of the con- 
flict associated with the arousal of sexual 
impulses in social situations. This would 
allow greater expression of sexual affect (and 
thus greater drive level) and, therefore, 
greater spontaneous recovery and higher 
reaction potential built up during relearning. 
Two experiments lend support to the con- 
tention that the degree of sexual arousal may 
be accentuated by lessening inhibition. Clark 
(1952) found that sexual imagery in responses 
made to TAT cards was both more frequent 
and more direct after subjects had taken 
alcohol. Sikorski (1958) similarly found that 
both need for aggression and need for sex were 
higher, as measured by objectively scored 
Tesponses to TAT cards, for subjects under 
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either meprobamate or prochlorperazine than 
for a comparable placebo group. 

Tt was felt that the sex of the experimenter 
would be an important determinant of the 
amount of sexual arousal, and it was pre- 
dicted that under meprobamate, male subjects 
run by a female experimenter would show slower 
extinction than male subjects run by,a male 
experimenter. 


Subjects 


Fifty subjects were used in all. The female experi- 
menter ran 10 subjects in each of the three groups: 
meprobamate, prochlorperazine, and placebo. The 


> male experimenter ran 10 subjects in both the meproba- 


4 


1 


mate and placebo groups. 


Apparatus 


The apparatus was that used in Experiment Iwith 
the exception that the UCS were mounted pin-up 
pictures which were viewed at the end of a large box 
through a 4 X S-inch opening. The pictures were 
mounted approximately 17 inches from the aperture. A 
small light was placed on the roof of the box near the 
front end. The UCS was illuminated by a flash of light 
lasting 2 seconds. At the end of each presentation, the 


f picture was changed for the next exposure. The experi- 


t 


TABLE 3 
Mean NUMBER or TRIALS TO EXTINCTION UNDER 
SEXUAL AROUSAL CONDITIONING 
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mental room was kept dark during the conditioning pro- 
cedure so that subjects were unable to see the picture 
until the light was flashed. 


Procedure 


A 2-day testing procedure identical to that described 
for Experiment I was used. 


Results 


Table 3 presents the mean number of ex- 
tinction trials required for each of the three 
extinction periods. The significance of these 
differences is presented in Tables 3, 4, and 5. 

The prochlorperazine group as compared 
to the control group showed faster extinction 
of the spontaneous recovery and slower ex- 
tinction of the second day’s relearning, but 
neither difference approached significance. 
In the comparison of the meprobamate 
groups with the placebo groups, the amount 
of conditioning the first day, as measured by 
the number of trials to extinction, was not 
significantly related to either the sex of the 
experimenter or the subsequently assigned 
drug group. The difference between E1 and E2 
is smaller for the total meprobamate group 
than for the control group, and smaller for 
the subjects run by the female experimenter 
than for those run by the male experimenter. 


G E- However, these predicted differences are not 
B et | e2 (Phy | ES F1-E* significant. The E1 — E3 difference is signifi- 
erai cant in the predicted direction at the .05 
robamate o 

A enter. [129] 9.2 e g Gaa both drug and experimenter effects. 

Female experimenter | 5.6| 6.0| —.4110.8| —5.2 There is no drug-experimenter interaction. 
ES ed SE The meprobamate group shows comparatively 

X] — t H 

Female P nennen is ud i 9.5| 5.8 Slower extinction at the end of the second day, 

Brno enne and those drug subjects run by the female 
pedes pad e posae der menter were extinguished more slowly 
Femal i experimen! : 
ale experimenter 16.4 8.6 7.8/1.8) 3-6 fhan those run by the male experimenter. 
TABLE 4 
ANALYSIS OF VARIANCE OF MEPROBAMATE EFFECTS UNDER SEXUAL AROUSAL CONDITIONING 
Baseline Response measure B 

Meprobamate vs. Placebo Amount of conditioning 1. Drug group effect: 3.27 

(First day: without drugs) (E1) 2. Experimenter: 1.13 

Re VE Female experimenter 3, Interaction: 1.48 

rug effect 
A. ense vs. Placebo Spon RM » pag aie of 

ale vs. Fi l i El — E . Experimenter: Š 
ode em f ) 3. Interaction: 2.00 
B. Meprobamate vs. Placebo Reconditioning 1. Drug effect: 6.33% 
Male vs. Female experimenter (E1 — 2. Experimenter: 4.66 
3. Interaction: .58 


* Significant beyond the 5% level, df = 1/36. 


550 


TABLE 5 


t TESTS OF PROCHLORPERAZINE EFFECTS UNDER SEXUAL 
AROUSAL CONDITIONING 


Drug group Response measure e 
Baseline 
Prochlorperazine vs. Amount of condi- | .06 
Placebo (First day: tioni! 
without drugs) (E1) 
Drug effect 
Prochlorperazine vs. Spontaneous re- +23 
Placebo covery 
(Ei — E2) 
Prochlorperazine vs. Reconditioning 41 
Placebo (E1 — E3) 


, “df = 18. A t of 2.10 is needed for significance at the .05 
level. 


TABLE 6 


Comparison OF SPEED OF EXTINCTION WITH POSITIVE 
AND NEGATIVE UNCONDITIONED STIMULI 


Group Main effects F 
Placebo* Spontaneous recovery vs. | .15 
Reconditioning 
Shock vs. Pin-up pictures | .12 
Interaction :22 
Prochlorperazine* Spontaneous recovery vs. | .07 
Reconditioning 
Shock vs. Pin-up pictures | .01 
Interaction 1.57 
Meprobamate* Spontaneous recovery vs. | .02 
Reconditioning 
Shock vs. Pin-up pictures |7.73* 
Interaction 1.59 


^ Includes only subjects run by the female experimenter. 
* Significant beyond the .01 level, df = 1/32. 


Since it was predicted that the drugs would 
facilitate extinction if an aversive drive were 
aroused but delay extinction if the aroused 
drive were affectively positive, the extinction 
rates under positive and negative conditioning 
were compared within each drug group (Table 
6). The placebo groups showed no difference in 
extinction rate attributable to the UCS, nor 
did the prochlorperazine groups. The mepro- 
bamate groups, however, were significantly 
different beyond the .01 level, with the group 
conditioned to pin-up pictures taking longer 
to extinguish than the group conditioned with 
shock. The expected faster extinction of the 
spontaneous recovery did not occur, perhaps 
because the second day’s procedure was identi- 
cal to that used the first day and thus alerted 
or allowed the subjects to become set for the 
extinction procedure. 
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The data of Experiment II are consistent 
with the hypothesis that meprobamate allows 
greater expression of sexual impulses, possibly 
by reducing internal inhibiting responses such 
as fear. The two experiments taken together 
are consistent with Olds’ (Olds & Travis, 
1960) finding that doses of meprobamai 
which inhibit avoidance reactions will not 
affect approach reactions, and with Sikorski’s 
(1958) finding that meprobamate facilitates 
the expression of sexual effect, The failure of 
the E1 — E2 difference to be statistically 
significant can perhaps be accounted for in 
terms of Hullian theory. In both experiments, « 
the predicted differences in the extinctio 
rates between the drug and placebo group 
was predicted on the assumption that the 
drugs affected the drive level of the organis 
and thus indirectly affected the reaction po- 
tential by combining multiplicatively with 
habit strength. However, after the first ex 
tinction session, the remaining habit strength 
was low, and, therefore the manipulation of 
drive on the second day had little effect. 
After reconditioning the second day, habit 
strength was high, and different drive levels: 
would, by multiplying with habit strength, 
produce a relatively greater difference in the 
reaction potential between the two groups 
than after the first day. Inspection of Tables 
1 and 3 shows this to be indeed the case: In 
all instances there is a greater difference be 
tween the drug and placebo groups for E1 — 
E3 than for E1 — E2, as predicted, though © 
the difference is not significant within 
groups taken separately. 


a 


prochlorperazine) on the retention and re s , 
learning (by means of conditioning procedures) 
of an emotional response to noxious an 


guished. On the second day, one hour after 
double-blind administration of either a drug 
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With a negative UCS, neither drug affected 
at a statistically significant level the number 
of trials to extinction after spontaneous re- 
covery, or the number required for recon- 
ditioning. With a positive UCS, (a) 
prochlorperazine affected neither spontaneous 
recovery nor reconditioning; (6) meprobamate 
did not affect spontaneous recovery; (c) the 
number of trials to extinction of the recon- 
ditioning was significantly higher under 
meprobamate than under placebo; (d) among 
subjects who had taken meprobamate, those 
run by a female experimenter took signifi- 
cantly more trials to extinguish after recondi- 
tioning than did those run by a male 
experimenter; (e) subjects under meprobamate 
but not under prochlorperazine took longer to 
extinguish a response conditioned with a 
positive UCS than with a negative UCS. 

, Of the four predictions for the prochlorpera- 

zine vs. placebo groups, only two were in the 
predicted direction, and none was significant. 
All six predictions for the meprobamate vs. 
placebo groups were in the predicted direction, 
but only two comparisons were statistically 
significant: drug effect and experimenter dif- 
ferences after reconditioning to sexually 
arousing pictures. Subjects under meproba- 
mate extinguished faster than the control 
group when the UCS was noxious, and more 
slowly than the control group when a positive 
UCS was used. The data do not indicate that 
the extinction of the spontaneous recovery 
was faster than the extinction of the recondi- 
tioning. The meprobamate groups, but not the 
placebo or prochlorperazine groups, extin- 
guished the conditioned response to shock 
significantly faster than the response condi- 
tioned to pin-up pictures. 
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THE RELATIONSHIP OF PERSISTENCE AT A TASK TO EXPECTATION 
OF SUCCESS AND ACHIEVEMENT RELATED MOTIVES: 


N. T. FEATHER? 
University of Michigan 


IHE present study investigates the rela- 

tionship of persistence at a task both to 

its apparent difficulty and to the relative 
strength within an individual of the motives 
to achieve success and to avoid failure 
(Feather, 1960). The situation to be con- 
sidered is one in which a subject comes to an 
objectively insoluble puzzle either believing 
it to be easy or very difficult. He works at this 
achievement task under test conditions and 
suffers repeated and consistent failure in his 
attempts to get the solution. Persistence is 
measured in terms of the total time or total 
trials which the subject works at the task 
before he turns to an alternative achievement 
activity. The former measure is sometimes 
referred to in the literature as temporal per- 
sistence; the latter is analogous to resistance 
to extinction. 

The hypotheses of the present investigation 
are developed from the theory of achievement 
motivation (Atkinson, 1957, 1960) together 
with certain additional assumptions. This 
theory relates characteristics of motivated 
behavior to the interaction of relatively stable 
personality dispositions (motives) and more 
transient situational influences (expectations 
and incentive values). As such, this approach 
belongs to a class of “expectancy value” 
theories which all involve somewhat similar 
concepts (Feather, 1959a). Within this con- 
text, the present research attempts to clarify 
the relationship of persistence at a task to 
expectations and motives by providing answers 
to the following two questions: 

1. How is persistence at a task affected by 


! This paper is based upon a doctoral dissertation sub- 
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versity of Michigan. 
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the initial subjective probability of success 
among subjects in whom the motive to achieve 
success is stronger than the motive to avoid 
failure (M, > May)? Do such subjects tend to 
persist longer at the task when it initially 
appears easy to them than when it initially 
appears very difficult? 

2. Conversely, how is persistence at a task 
affected by the initial subjective probability 
of success among subjects in whom the motive 
to avoid failure is stronger than the motive 
to achieve success (May > M,)? 

The present investigation goes beyond the 
earlier studies of French and Thomas (1958) 
and Atkinson and Litwin (1960) in its at- 
tempt to vary both motive and expectation 
simultaneously, Furthermore, it provides 
assumptions about the dynamics of changing 
motivation as expectations of success and 
failure change, and an explicit analysis of 
why the subject stops performing the task. 


THEORETICAL ASSUMPTIONS 


Consider a subject who is performing the 
initial achievement task. It is assumed that 
he will continue to persist at this task as long 
as total motivation to perform it is stronger 
than total motivation to perform the alterna- 
tive available to him. 

Total motivation to perform the initial 
achievement task is attributable to the follow- 
ing component motivations*: (a) achievement 
related motivation to perform the task, and 
(b) extrinsic motivation to perform the task. 
Similarly, since the alternative is also an 
achievement task, total motivation to perform 
it is attributable to analogous component 
motivations. 

Achievement related motivation to perform 
a task refers to the resultant of motivation to 


*It should be noted that a distinction is made be- 
tween motivation, which refers to a particular act or 
response, and motive. Differences in strength of motive 
(a latent disposition of personality) are assumed to be 
only one factor influencing the strength of motivation, 
the other influences being level of expectation and 
incentive value. 
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ACHIEVEMENT MOTIVATION AND TASK PERSISTENCE 


achieve success at the task and motivation 
to avoid failure at the task. According to the 
theory of achievement motivation (Atkinson, 
1957, 1960) these two motivations summate 
algebraically to give positive achievement 
related motivation (approach) for subjects in 
whom the motive to achieve success is stronger 


I than the motive to avoid failure (ie., when 


M, > Mas), and negative achievement related 
motivation (avoidance) for subjects in whom 
Ma; > M, Further, achievement related 
motivation whether positive or negative is at 
its maximum when subjective probability of 
success at the task is intermediate (i.e., when 
P, = .50) and decreases monotonically as 
P, becomes either smaller or larger than .50. 
If achievement related motivation were the 
only motivation elicited in the situation it is 
apparent that a subject with stronger motive 
to avoid failure should not even undertake 
performance of an achievement task. Instead 
he should avoid the task and choose activities 
which do not arouse anxiety about failure. In 
contrast, a subject with stronger motive to 
achieve success should show some positive 
interest in performing an achievement task. 

The concept of extrinsic motivation to 
perform a task is introduced to account for 
the fact that the subject is in a social situation 
in which he has the role of a subject in an 
experiment and knows he is expected to make 
Some attempt at the task. In any test situa- 
tion there are certain extrinsic constraints 
that influence the subject to perform the task 
irrespective of the nature of his achievement 
related motivation. Studies by French (1955), 
and by Atkinson and Raphelson (1956) have 
shown, for example, that other motives like 
n Affiliation are sometimes systematically 
related to task performance in a situation 
where no achievement orientation is given but 
cooperation is requested. The usual social 
constraints (ie., desire for approval, fear of 
disapproval) provide an important source of 
Motivation for subjects in whom Ma; > M.. 
If these subjects are to perform the task at all, 
Some positive motivation must exist to oppose 
their tendency to avoid an achievement task. 
For subjects in whom M, > Ma, the ex 
trinsic motivation to perform an assigned task 
enhances their normally positive motivation 
to perform achievement related tasks. In 
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both cases, task performance may be con- 
sidered overdetermined, that is, the result of 
two or more different kinds of motivation to 
perform or not to perform the task (cf. Atkin- 
son & Reitman, 1956). 

In the light of this analysis of the com- 
ponents of total motivation, the basic condi- 
tion for performance of the initial task rather 
than the alternative task is as follows: 


INITIAL TASK 
Achievement related Extrinsic 
+ 
motivation motivation 
ALTERNATIVE TASK 
Achievement related Extrinsic 
> 
motivation motivation 


It is assumed that the subject will turn to 
the alternative achievement task when: 


INITIAL TASK 
Achievement related Extrinsic 
+ 
motivation motivation 
ALTERNATIVE TASK 
Achievement related Extrinsic 
< 
motivation motivation 


How then does total motivation to perform 
the initial achievement task become weaker 
than total motivation to perform the alterna- 
tive, so that the subject gives up persisting 
at the initial task and turns to the alternative? 
To permit derivation of hypotheses the follow- 
ing three assumptions are made: 

1. Both extrinsic motivation to perform 
the initial task and extrinsic motivation to 
perform the alternative task are assumed to be 
constant across experimental conditions. 

2. Extrinsic motivation to perform the 
initial task is assumed to be stronger than 
extrinsic motivation to perform the alternative 
task. This is a strong assumption that at least 
acknowledges the fact that the subject is 
asked by the experimenter to begin the initial 
task first. 

3. The subjective probability of success 
for the alternative achievement task is as- 
sumed to be constant across experimental con- 
ditions. Hence, achievement related motiva- 
tion to perform the alternative task is à 
positive constant for subjects in whom the 
motive to achieve success is stronger (ie 
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when M, > May), and has the same value (but 
negative) for subjects in whom May > M. 

It follows that any decrease in total motiva- 
tion to perform the initial task must be related 
to change in achievement related motivation 
to perform the initial task. Since, in the theory 
of achievement motivation, the motives to 
achieve success (M;) and to avoid failure 
(Mas) are considered relatively stable disposi- 
tions of the personality, change in achieve- 
ment related motivation to perform the 
initial task must be mediated by variation in 
the subject's expectation of success at the 
task as he works at it unsuccessfully. Re- 
peated unsuccessful attempts at the initial 
task are assumed to produce successive de- 
creases in subjective probability of success 
at the task (P,). Decrease in P, is thus the 
basic dynamic principle assumed to mediate 
decrease in total motivation to perform the 
initial task. 

The following two specific assumptions are 
made about this decrease in expectation of 
success: 

1. When the task is presented as a test of 
skill, reduction in P, to a particular value is 
assumed to require more unsuccessful at- 
tempts at the task when P, is initially high 
than when P, is initially low. 

2. The rate at which decrease in P, occurs 
is assumed not to be systematically related to 
the strength of either the motive to achieve 
success (M,) or the motive to avoid failure 
(Maj). 

HYPOTHESES 


The following four hypotheses are tested 
in the present investigation: 

Hypothesis 1 states that when the motive 
to achieve success is stronger than the motive 
to avoid failure (M, > Mar), persistence at the 
initial achievement task should be greater when 
initial subjective probability of success is 
high (ie, some P, > .50) than when initial 
P, is low (i.e., some P, < .50). 

Hypothesis 2 states that when Mas > M, 
persistence at the initial achievement task 
should be greater when initial P, is low (i.e., 
some P, « .50) than when initial P, is high 
(i.e., some P, > .50). 


^ This assumption is consistent with the distinction 
between tasks involving skill and tasks involving a 
degree of external control (e.g., chance) presented by 
James and Rotter (1958). 
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: 

Hypolhesis 3 states that when initial P, 
is high (i.e., some P, > .50), subjects in whom 
M, > Ma; should persist longer at the initial 
achievement task than subjects in whom 
May > M. 

Hypothesis 4 states that when initial P, is 
low (ie, some P, < .50), subjects in whom 
Ma; > M, should persist longer at the initial ph 
achievement task than subjects in whom ` 
M,» Ma. 

The detailed derivation is presented for 
Hypotheses 1 and 2 only. The derivation of 
‘Hypotheses 3 and 4 follows the same line of, 


argument. 
Derivation of Hypothesis 1 


Consider a subject in whom the motive to 
achieve success is stronger than the motive 
to avoid failure (Le. M, > May), and to 
whom the initial achievement task is presented 
as easy (i.e., some P, > .50). For this subject, 
achievement related motivation to perform 
the task is positive (approach) and becomes 
increasingly positive as his subjective prob- 4| 
ability of success (P,) at the task falls to .50 
with successive failures. Hence, total motiva- 
tion to perform the initial task should increase 
as the task appears more difficult to the subject 
up to the point at which his P, = .50. At this 
point positive achievement related motivation 
and, hence, total motivation to perform the D 
task would be maximum. Thereafter, as P, 
gets lower and the task appears more and 1 
more difficult to the subject, positive achieve- 
ment related motivation decreases until at .. 
some low level of P, total motivation to | 
perform the initial task becomes weaker than 
total motivation to perform the alternative. 
The subject should then quit the initial task —' 
and turn to the alternative. ; F, 

In contrast, if the initial task were presented i 
to this subject as very difficult (i.e, some — 
P, « .50), there would be an immediate de- 
crease in the positive achievemént related — 
motivation to perform the task as his P, drops . 
with repeated failure. Hence, total motivation 
to perform the initial task would decrease . 
immediately. In this condition it should take & 
less unsuccessful attempts at the task to reduce 
P, to the low level at which total motivation 
to perform the initial task becomes less tha : 
total motivation to perform the alternative. 
Hence, we would expect a subject in whom l 
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M, > Mas to persist longer at the initial 
achievement task when his P, is initially high 
than when his P, is initially low. 


Derivation of Hypothesis 2 


Consider a subject in whom the motive to 
avoid failure is stronger than the motive to 
achieve success (i.e., May > M,), and to whom 
the initial achievement task is presented as 
easy (ie., some P, > .50). For this subject, 
achievement related motivation to perform 
the task is negative (avoidance) and becomes 
increasingly negative as his P, at the task 
drops to .50 with repeated failure. Hence, 
total motivation to perform the initial task 
should decrease as the task appears more 
difficult to the subject up to the point at 
which his P, = .50, where negative achieve- 
ment related motivation would be maximum. 
At this point total motivation to perform the 
initial task should therefore be minimum. It 
follows that, if the subject is to quit the task 
at all, he should do so as his P, at the task 
falls to .50 since it is during this stage of task 
performance that total motivation to perform 
the task is decreasing. 

In contrast, if the initial task were pre- 
sented to this subject as very difficult (i.e., 
some P, < .50), there would be an immediate 
decrease in negative achievement related 
motivation as his P, drops with repeated 
failure. Hence, total motivation to perform 
the initial task would increase immediately, 
and the subject should continue to perform 
the task indefinitely. We would therefore 
expect a subject in whom Ma; > M, to persist 
longer at the initial achievement task when 
his P, is initially low than when his P, is 
initially high. 

In the preceding analysis, differences in 
persistence have been related to differences in 
the way in which total motivation to perform 
the initial achievement task changes with 
decrease in P, as a subject experiences re- 
peated failure at the task. The different types 
of change in total motivation are summarized 
in Table 1 in relation to differences in motive 
and differences in initial P,. 

Adequate tests of the hypotheses of the 
study clearly require development of pro- 
cedures which communicate to the subjects 
that they are expected to perform the initial 
achievement task first, but that they may 
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TABLE 1 


CHANGES IN TOTAL MOTIVATION TO PERFORM THE 
INITIAL ACHIEVEMENT Task AS P, DECREASES IN 
RELATION TO DIFFERENCES IN MOTIVE (M, > 
Mes and May > M.) AND DIFFERENCES 
IN [ĪNITIAL SUBJECTIVE PROBABILITY 
or Success (P, > .50 and P, < .50) 


Motive 


[o in total motivatit 
rx Bund hange in 5 motivation as P, 


Increase followed by decrease 
Immediate decrease 

Decrease followed by increase 
Immediate increase 


feel free to move on to the alternative achieve- 
ment task whenever they choose to do so. If 
extrinsic motivation to perform the initial 
task were very strong in relation to extrinsic 
motivation to perform the alternative task, 
none of the differences in persistence predicted 
here would be expected to occur. 


METHOD 


Subjects 

Thematic apperceptive stories under neutral con- 
ditions and Mandler-Sarason Test Anxiety Question- 
naire provided measures of strength of n Achievement 
(M,) and Test Anxiety (May) for 89 male college stu- 
dents from an introductory psychology course at the 
University of Michigan in 1959.5 The TAT was ad- 
ministered according to the standard procedure (Mc- 
Clelland, Atkinson, Clark, & Lowell, 1953). Six pictures 
were presented in the following order (using numbers 
assigned by Atkinson, 1958): 2, 48, 1, 7, 100, 24. In- 
terscorer reliability for scoring n Achievement was .93.* 

In subjects classified High n Achievement-Low Test 
Anxiety (in terms of median splits) it was assumed 
M. > Moy ; in subjects classified Low n Achievement- 
High Test Anxiety it was assumed May > M,. 


Task 


A task consisting of four items and labeled the 
Perceptual Reasoning test was presented to 34 of these 
preselected subjects in individual test sessions of one 
hour duration. Subjects were randomly assigned so that 
approximately half of each of the High n Achievement- 
Low Test Anxiety and Low n Achievement-High Test 
Anxiety groups came to the first. item of the Perceptual 
Reasoning test with a high initial subjective probability 
of success (P); the other half came to the first item 
with a low initial P, . The procedure adopted in assign- 
ing subjects guaranteed that the experimenter would be 
unaware when testing a subject for persistence of his n 
Achievement and Test Anxiety scores. The initial P, for 


5 The author wishes to thank Charles P. Smith for 
administering the TAT and Test Anxiety Question- 


naire. 
6 The author wishes to thank William Larkin for his 
assistance in scoring the TAT protocols. 
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each item of the test was experimentally induced by 
using fictitious group norms and the subject could leave 
each item as he wished and turn to the next one in the 
series. 

Each item of the Perceptual Reasoning test involved 
a line diagram approximately 1.5 inches square printed 
on a white card 6” X 4”. Copies of a particular item 
were stacked in a pile 2” high and the four piles of cards 
were placed in line on the desk immediately in front 
of the subject. The piles were designated 1-4 from left 
to right and the subject could not see the content of any 
item until he began to work at it, i.e., the precise content 
of the alternatives to which he could turn if he wished 
to quit an item was unknown to him. 

For each item the subject’s task was to trace over all 
the lines of the diagram with a red pencil according to 
two rules: he was not permitted to lift his pencil from 
the figure; he was not permitted to trace over any line 
twice. It is a simple matter to construct figures for 
which the problem is insoluble but which are sufficiently 
complex so that the subject is unable to see this. The 
subject was allowed to take as many trials at an item 
as he wished. If he wanted another trial at the item, he 
simply took another copy of the same item from the top 
of its pile and worked at it. The following instructions 
to the subject summarize the restrictions placed on task 
performance: 

Naturally there are certain restrictions involved in 
this test. Youcan only work at an item for 40 seconds 
at a time, and I shall be timing you on that. But you 
can have as many of these 40 second trials as you 
want. If you fail on a trial, that is if you don’t suc- 
ceed in tracing over all the lines in the figure, you 
will then have the choice of continuing or going on 
to the next item. If you want to go on to the next 
item you should let me know at once. Once you've 
stopped working at an item you can't go back to it 
again. If you want to continue with the item you 
should turn the failed copy face downwards. You can 
then take another copy of the same item from the 
par and you will again have 40 seconds to work at 

t. 

Obviously, for an insoluble item, each trial the subject 
took resulted in failure. Using this procedure, one can 
obtain two measures of persistence for an insoluble item: 
total time spent in working at the item; and number of 
trials taken at the item before turning to the next one in 
the series. The former measure is akin to the typical 
temporal measure used in studies of persistence; the 
latter is analogous to resistance to extinction. 

It will be noted that the alternative activity to which 
the subject could turn was another achievement task 
similar in content to the one at which he had been 
working. 

Of the four items involved in the Perceptual Reason- 
ing test the first and third were insoluble, the second and 
fourth items were soluble. 


Experimental Induction of Subjective Prob- 
ability of Success 


The following instructions were given: 

Now the four items vary in difficulty. Some are 
harder than others, and you're not expected to be 
able to solve all of them, but do the best you can. 
Before I present each item I’m going to let you know 
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the percentage of college students who are able to 

pass that item. . . . Well, let's look at the norms for 

the first item. These go by age. How old are you? 

The tables show that at your age level—per cent, 

that is approximately— per cent of college students 

are able to get the solution. 

Table 2 presents the fictitious norms for each of the 
four items of the Perceptual Reasoning test that were 
reported to Groups A, B, C, and D. These norms were 
assumed to be sufüciently different to determine | 
differences in initial P, for each of the four items. It 
should be noted that initial P, on items following the 
first may be influenced not only by the norms reported 
to the subject but also by the subject's actual experience 
with the preceding items. 

The following final instructions were given before 
Ttem 1 was presented to the subject: 

Try to get the solution if you can. It's quite OK for 
you to take as many 40 second trials as you want at 
the item. But remember that the four items do vary 
in difficulty, some are harder than others and you 
may not be able to solve all of them. So if you should 
feel that you're not getting anywhere with the item 
you should let me know at once so that we can move 
on to the next one. Here is Item 1. You should find 
it fairly easy/difficult. 

For each insoluble item the subject was timed from 
when he started the item until when he nominated to 
try the next item in the set. When he quit an insoluble 
item, the copies of the item he had attempted were 
Stacked to one side out of sight and their number was 
later recorded. 


Effectiveness of Reported Norms 


Norm procedure: After reporting the fictitious group 
norm but before presenting the item, the experimenter 
gave the following instruction: *...I'm also going to 
ask you to let me know what you think your own 
chances are of solving the problem. You can do this very 
simply by marking a number along this scale. I'll get 
you to do this before I present each item and I'll also 
get you to check here how confident you are about that 
estimate" To obtain the probability estimates a 20- 
point rating scale was used, numbered from 0 to 100 


TABLE 2 
Fictitious PERCENTAGES^ Or Success REPORTED TO 
SUBJECTS IN THE EXPERIMENTAL INDUCTION: 
OF INITIAL DIFFICULTY FOR EACH 
or Four Test ITEMS 


Reported percentages 


Movadon | 1 
Group 
ieee *- |Test Anxiety| 1 | 2 See 
A High Low | 70| 50} 5 | 50 
B High | Low | 5| 50| 70| 50 
S Eo High | 70] 50| 5 | 50 
D Low High | 5|50|70 


^ Actual percentages reported deviated slightly from Or 
ulated values in that they were given to one decimal plac 
add to authenticity (e.g., 69.4%). 
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insteps of five. The subject made his rating of confidence 
in his probability estimate on a five-category Likert- 
type scale ranging from “Not certain at all” to “No 
doubt about it at all." 


Time Limits for Insoluble Items 1 and 3 


Data concerning persistence at Item 1 were from the 
very outset considered more crucial for the test of the 
hypotheses since results for other items would be com- 
plicated by differential sequence effects. For this reason 
each subject was allowed to persist at Item 1 for 20 
minutes (if he so desired) so as to increase the possi- 
bility of obtaining a wide variance in persistence scores. 
If the subject was still persisting at Item 1 after 20 
minutes it was suggested that he move on to Item 2. 
This interruption procedure for Item 1 necessarily intro- 
duced a time differential for Item 3, some subjects hav- 
ing more time to work at this item than others, depend- 
ing on how long they had worked at Item 1, Subjects 
still working at Item 3 were interrupted at approxi- 
mately 15 minutes before the hour to allow sufficient 
time for administration of a postperformance question- 
naire. 


Postperformance Questionnaire 


Finally, the experimenter administered a question- 
naire to each subject specifically designed to provide 
information about the assumptions involved in the hy- 
potheses. The majority of questions were presented in 
Likert-type form and were related to Item 1, When the 
subject had completed the questionnaire he was thanked 
for his participation in the experiment, commended on 
his performance at a fairly difficult task, and requested 
to keep details of the experiment in confidence. A re- 
port of the experiment was sent to each subject at a 
later date. 


RESULTS 


The basic data relevant to the hypotheses 
of the present investigation are the persistence 
scores of the four groups in terms of time and 
trials for the first insoluble item (Item 1). 
The results of analysis of persistence scores 
for the second insoluble item (Item 3) are 
Presented in less detail since the necessity 
for unanticipated interruption of a sizable 
number of extremely persistent subjects on 
the first and third items, and the possibility 
of uncontrolled sequence effects, make in- 
terpretation of results for the third item 
equivocal. 


Analysis of Persistence at Item 1 


Since time and trials scores are perfectly 
correlated for Item 1, only the analysis of 
persistence trials is presented. By persistence 
trials is meant the number of trials the subject 
took at Item 1 before he quit to move on to 
the second item or before he was interrupted. ` 
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The median of these persistence trials at 
Item 1 for all subjects is 20 with a range from 
2 to 41+.7 Persistence trials for subjects of 
each of Groups A, B, C, and D are classified 
as high and low in terms of whether they are 
above or below this median, and the resulting 
frequencies are presented in Table 3. Below 
Table 3 are presented the results of a parti- 
tioned x? analysis (Sutcliffe, 1957) applied 
to these data. 

Table 3 shows that only the x? representing 
the interactive effect of motivation and ex- 
pectation on persistence is significant (p < 
.01). While the proportion of subjects classi- 
fied high in persistence increases as the reported 
norm varies from 5% to 70% (i.e., from diffi- 
cult to easy) for subjects high in n Achieve- 
ment and low in Test Anxiety, this proportion 
decreases for subjects low in m Achievement 
and high in Test Anxiety with the same varia- 
tion in the reported norm. This implies a 
basic difference in the way persistence is 
related to the initial difficulty of the item 
among subjects in whom M, > Mas versus 
subjects in whom Mas > Ms. 

Relationships in Table 3 are clearly con- 
sistent with Hypotheses 1, 2, 3, and 4. Sub- 
jects in whom it is assumed that M: > My 
(the High-Low subjects) show greater per- 
sistence at the task when the reported norm 
defines it as easy (70% norm) than when the 
norm defines it as difficult (5% norm). This 
result supports Hypothesis 1 in implying that 
subjects in whom M, > May tend to persist 
longer at the item when initial expectation of 
success is high rather than low. In contrast, 
subjects in whom it is assumed that Ma; > 
M, (the Low-High subjects) show greater 
persistence at the task when the reported 
norm defines it as difficult (5% norm) than 
when the norm defines it as easy (7076 norm). 
This result supports Hypothesis 2 in implying 
that subjects in whom Ma; > M, tend to 
persist longer at the item when initial ex- 
pectation of success is low rather than high. 
Furthermore, among subjects for whom the 
reported norm defines the item as easy, the 
proportion of subjects showing high per- 
sistence is greater for those classified as high 

7 The + indicates that the subject had to be inter- 
rupted. Due to the interruption procedure some scores 
are open-ended. Hence, results are classified on the 
basis of median splits and relationships in the resulting 
frequency tables analyzed using x? as the basic statistic. 
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TABLE 3 


NuusER or SuBjEecrs Wao Were HIGH AND Low IN PERSISTENCE ON ITEM 1 IN RELATION TO 
STATED DIFFICULTY OF THE Task AND NATURE OF MOTIVATION 


Motivation Persistence trials 
Group Stated difficulty of task High Lon 
n Achievement Test Anxiety obre Be 
A High Low 70% (easy) 6 2 
B High Low 5% (difficult) 2 7i 
C Low High 7095 (easy) K] 6 
D Low High 5% (difficult) 6 2 
A and B High Low Mo, 8 9 
Cand D Low fuel: h oy 5% 2 : 
Aand C Both conditions % (easy, 
Band D Both conditions 5% (difficult) 8 9 
Partition of x? 
Source Value af b 
Motivation X Persistence 212, 1 2.05 
Expectation X Persistence 712 1 2.05 
Motivation X Expectation X Persistence 7.65 1 <.01 
Total 7.89 3 <.05 


in n Achievement and low in Test Anxiety. 
This result is consistent with Hypothesis 3 in 
implying that, when the initial expectation of 
success is high, subjects in whom M, > Mas 
tend to persist longer at the item than subjects 
in whom Mas > M.. Finally, among subjects 
for whom the reported norm defines the item 
as difficult, the proportion of subjects showing 
high persistence is greater for those classified 
as low in n Achievement and high in Test 
Anxiety. The result is consistent with Hy- 
pothesis 4 in implying that, when the initial 
expectation of success is low, subjects in 
whom Mas > M, tend to persist longer at the 
item than subjects in whom M, > Maş. 


Analysis of Persistence at Item 3 


Persistence data for Item 3 compared across 
Groups A, B, C, and D show no significant 
trends. However, when comparisons of per- 
sistence scores for Items 1 and 3 are made 
within the same group, subjects of Group B 
(High-Low subjects) persist longer at Item 3 
which for them is defined by the reported 
norm as easy than at Item 1 which is defined 
by the norm as difficult. This change is con- 
sistent with Hypothesis 1. Subjects of Group 
C (Low-High subjects) persist longer at 
Item 3 which for them is defined by the re- 
ported norm as difficult than at Item 1 which 


is defined by the norm as easy. This change is 
consistent with Hypothesis 2. However, when 
ambiguous comparisons of persistence scores 
are excluded (due to the interruption proce- 
dure), there is a tendency for all subjects to 
persist longer at Item 3 than at Item 1. This 
suggests that an influence common to all 
subjects might have operated to increase 
persistence at Item 3. 

When such an influence is assumed and 
ambiguous differences in persistence scores 
(due to the interruption procedure) are ex- 
cluded, the data support the qualified predic- 
tion that Groups À and D should at least 
tend to increase relatively less in persistence 
from Item 1 to Item 3 than do Groups B and 
C. However, this result is equivocal since, as 
a result of limitations of time, subjects of 
Group A and D who persisted relatively longer 
at Item 1 may not have had the same op- 
portunity to increase in persistence at Item 
3 as did subjects of Groups B and C. 
Analysis of Supplementary Data 

Analysis of data concerned with effectiveness 
of experimental procedures supports the as- 
sumption that the different reported norms 
determined widely different initial subjective 
probabilities of success. The probability 
estimates given by subjects in all four groups 
tend to follow the reported norms very closely- 
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Data obtained from the postperformance 
questionnaire are generally consistent with 
assumptions involved in hypotheses. For 
example, subjects in all four groups report 
decreases in estimates of probability of suc- 
cess as they work at Item 1 with repeated 
failure. This decrease is consistent with the 
basic dynamic assumption that expectation 
of success should diminish with repeated failure 
at the task. In addition, there is no evidence 
that the rate of decrease in probability esti- 
mates with failure at Item 1 differs between 
the two motivation conditions (High-Low 
subjects versus Low-High subjects). This 
implies that rate of decrease in P, is inde- 
pendent of the relative strength of M, and 
Mas within the individual. 


Discussion 


Where the present investigation permits 
unequivocal test of the four hypotheses (i.e., 
for Item 1), results are consistent with pre- 
dictions. Furthermore, postperformance ques- 
tionnaire data generally support the basic 
assumptions involved in these predictions. 

The investigation stands in marked contrast 
to the typical trait studies of persistence and 
extinction studies with human subjects since 
hypotheses are based on a consideration of 
personality and situational factors in interac- 
tion. Trait studies of persistence typically 
Concentrate on the relatively stable aspects 
of persons by looking for consistencies in 
behavior and they tend to neglect the role of 
the situation. In contrast, extinction studies 
usually attempt to relate persistence of be- 
havior to characteristics of the acquisition 
Stage and they tend to ignore the influence on 
resistance to extinction of relatively stable 
personality characteristics. The contribution of 
the present study is to investigate persistence 
as a phenomenon determined by the interac- 
tion of both personality and situational in- 
fluences. 

The present investigation also differs from 
the usual extinction investigation by using 
a task that could be seen by the subject as 
involving personal skill (cf. James & Rotter, 
1958) and that provided a relatively inex- 
haustible range of alternative responses 
following failure. In contrast, extinction 
Studies with humans have often used tasks in 
which the subject may see success and failure 
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as largely beyond his control (e.g., the Hum- 
phrey’s apparatus) and where the response is 
restricted to a particular action (e.g., pulling 
down a lever). Furthermore, in the present 
investigation initial expectation of success 
was varied by using fictitious norms. In con- 
trast, in extinction studies the expectation may 
be considered as based mainly on frequency 
and pattern of reinforcement and nonreinforce- 
ment in the acquisition series. 

The theoretical analysis may be extended 
to a consideration of persistence under condi- 
tions that differ from those of the present 
investigation in important respects. The 
general paradigm of the persistence situation 
is that in which the subject has some expecta- 
tion of attaining a goal, is unrestricted in the 
number of attempts he can have at attaining 
it, and has an alternative activity to turn to 
as he wishes. Within this general framework 
the following differences may be noted which 
in turn imply specific types of persistence 
situation worthy of investigation: 

1, Performance of the alternative to which 
the subject may turn may involve motives 
that are the same or different from those in- 
volved in the performance of the activity at 
which he is presently engaged. The persistence 
situation studied in the present investigation 
is one in which initial task and alternative 
activity belong to the same class of activity. 
Total motivation to perform the initial task 
and total motivation to perform the alterna- 
tive task are both attributable to the same 
motivational components. In contrast, we can 
consider a type of persistence situation in 
which the initial task and the alternative task 
do not belong to the same class of activity, 
ie, where the alternative is motivationally 
different in the sense that total motivation to 
perform the alternative is influenced by 
motives which are not involved in performance 
of the initial activity. Investigations of per- 
sistence by French and Thomas (1958) and 
by Atkinson and Litwin (1960) appear to fall 
in this latter class since in both of these 
studies the alternative activity to which the 
subject can turn does not necessarily involve 
achievement related motivation. 

2. The situation does or does not involve 
types of motivational tendency in which in- 
centive values and expectations are assumed 
to be dependent. In the achievement context 
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dependencies are assumed between the positive 
incentive value of success and the subjective 
probability of success, and between the nega- 
tive incentive value of failure and the sub- 
jective probability of failure (cf. Feather, 
1959a, 1959b). But, in the more general case 
(e.g., efforts to get food when hungry), there 
would be no dependency between incentive 
value and expectation, and, hence, no increase 
to a maximum motivation followed by a de- 
crease as expectation of goal attainment 
(P,) changes from P, > .50 to P, < .50 (as 
occurs for achievement related motivation). 
Instead we would expect motivation to per- 
form the activity to decrease continuously 
with decrease in the strength of expectation 
of goal attainment, providing strength of 
motive and incentive remained constant. 

3. Persistence situations may differ ac- 
cording to whether or not the incentive is 
objectively present. One can consider a 
persistence situation in which the incentive 
is actually in view, e.g., young children at- 
tempting to overcome an obstacle in order to 
get a candy. In the present investigation and 
in the human extinction type of study, the 
incentive is not objectively present. In the 
Perceptual Reasoning test, the subject has to 
solve the puzzle; he cannot see the solution. 
In the extinction situation the reinforcement 
does not appear in the extinction series. 
Hence, it is possible for the subject to develop 
the expectation that there is in fact no in- 
centive, e.g., that the puzzle is insoluble or 
that the experimenter has “fixed” the apparatus 
so that no more reinforcements will occur. 
In other words, the subject may conceive of 
these situations as beyond his control, as not 
involving skill. Consequently, special care is 
required to ensure that the subject retains 
the belief that only his efforts will influence 
the outcome. 

These three differentiating characteristics, 
in combination, yield the eight specific types 
of persistence situation summarized in Table 
4. The persistence situation investigated in 
the present study corresponds to 5, i.e., 
performance of the alternative involves the 
same kind of motivation as the activity at 
which the subject is presently engaged, de- 
pendencies are assumed between incentive 
values and expectations, and the incentive 
is not objectively present. In contrast, the 
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TABLE 4 


TYPES OF PERSISTENCE SITUATION ACCORDING TO 
WHETHER OR NOT THE ALTERNATIVE Is Morva- 
TIONALLY THE SAME OR DIFFERENT FROM THE 
PRESENT ACTIVITY, DEPENDENCIES BETWEEN 
INCENTIVE VALUES AND EXPECTATIONS 
Exist or po Nor Exist, AND THE 
INCENTIVE IS OBJECTIVELY 
PRESENT OR ABSENT 


Dependency, 
tive paan Incentive 
Persistence (motivatonally afrpectation (objectively 
situation same or value present or 
different) (presentior absent) 
absent) 
1 same present present 
2 different present present 
3 same absent present 
4 different absent present 
5 same present absent 
6 different present absent 
7 same absent absent 
8 different absent absent 


persistence situation studied by French and 
Thomas (1958) and by Atkinson and Litwin 
(1960) corresponds to 6 since performance of 
the alternative in these studies probably 
involves different motivation from that in- 
volved in the activity at which the subject 
is engaged. It should be possible to construct 
experimental situations corresponding to the 
other six cases and to analyze persistence 
behavior in each of these in terms of the in- 
teraction of motives, expectations, and in- 
centives, Such an approach would require (a) 
specification of the nature of the motives, 
expectations, and incentives involved in the 
situation; (b) measures of the strength of these 
motives, expectations, and incentives; (c) 
assumptions concerning how these basic 
variables combine to determine the various 
component motivations; (d) specification of 
the manner in which expectation changes 
with experience at the activity; and (e) as- 
sumptions concerning the resolution of the 
set of component motivations into a resultant 
motivation. h 

The present study has shown that this 
strategy does work for the type of persistence 
situation investigated here. Detailed specifi- 
cation of the components of total motivation 
to perform a task in relation to total motiva- 
tion to perform the available alternative, !n 
conjunction with a dynamic principle of 
expectation change, has permitted differ- 
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ential predictions about persistence. This 
type of analysis, which takes account of both 
personality dispositions and situational in- 
fluences in interaction according to an “ex- 
pectancy value” model, may also be required 
to explain persistence in the other types of 
situation. 
SUMMARY 


This investigation examines the relation- 
ship of persistence at a task both to its ap- 
parent difficulty and to the relative strength 
within an individual of the motives to achieve 
success and to avoid failure. A subject per- 
forms a task presented to him as part of an 
important test, undergoes repeated failure at 
it (because it is in fact insoluble), but may 
turn to an alternative achievement task 
whenever he desires. Strength of motive to 
achieve success is assessed by the TAT n 
Achievement procedure; strength of motive 
to avoid failure is assessed by Mandler- 
Sarason Test Anxiety Questionnaire. Apparent 
difficulty of the task (i.e., initial expectation 
0f success) is varied by use of fictitious group 
norms. Persistence is measured by total 
trials or total time the subject works at the 
task before turning to the alternative. 

Hypotheses are derived from Atkinson’s 
theory of achievement motivation with addi- 
tional assumptions. The following hypotheses 
are tested: 

Hypothesis 1 states that when motive to 
achieve success (M,) is stronger in the subject 
than motive to avoid failure (Mas) he should 
be more persistent at a task when his initial 
subjective probability of success (P,) is high 
(easy task, P, > .50) than when his initial 
P, is low (difficult task, P, < .50). 

Hypothesis 2 states that when Ma; > M, 
the subject should be more persistent when 
us oe P, is low than when his initial P, is 

gh. 

„Hypothesis 3 states that when initial P, is 
high, a subject in whom M, > Mas should be 
more persistent than one in whom Mas > Ms. 

Hypothesis 4 states that when initial P, is 
low, a subject in whom May > M, should be 
more persistent than one in whom M, > Mos. 

Results are generally consistent with hypoth- 
€ses and with assumptions involved in the 
Predictions. 

As a guide to future research a theoretical 
scheme is presented in which different classes 
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of persistence situations are identified. The 
present investigation indicates the importance 
of specifying the components of total motiva- 
tion to perform a task in relation to the com- 
ponents of total motivation to perform the 
alternative when attempting to predict degree 
of persistence. The theoretical analysis sug- 
gests the possibility of conceptualizing each 
component motivation in “expectancy value" 
terms, and the importance of change in ex- 
pectation as a dynamic principle mediating 
change in motivation. The factual evidence 
clearly provides a demonstration that per- 
sistence can be conceptualized as an interac- 
tion of personality dispositions and situa- 
tional influences. 
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IMPULSIVE CLOSURE AS REACTION TO FAILURE-INDUCED 
THREAT! 


JAMES E. DITTES 
Yale University 


ONFRONTING diffuse, disorganized, and 

ambiguous pieces of information in 

the environment, persons vary in the 
way in which they impose conclusions, pat- 
terns, or meaning. They vary not only in the 
nature of the closure they impose, but also in 
their readiness to impose any closure at all. 
Given a particular series of stimuli, some per- 
sons form a closure readily as best they can 
with the available information, and others tend 
to defer judgment, leaving the stimuli am- 
biguous until more information is available. 
In most instances, a moderate delay in closure 
is more adaptive and produces a higher quality 
closure than does an impulsive judgment. 

Such variability in the tendency to form a 
closure has not been widely recognized and 
studied, except in special cases such as those 
labeled as rigidity, conformity, persuasibility, 
suggestibility, ethnocentrism, prejudice, and 
resistance to free association. These special 
cases attract interest either because the be- 
havior is maladaptive or because particular 
identifiable sources are used by the person to 
help him form the closure. Examples of such 
identifiable sources are the judgments of others 
in the case of conformity, or one’s own prior 
judgments, in the case of rigidity. 

So far as individual differences in the tend- 
ency to closure have been studied, they have 
generally been attributed to cognitive impair- 
ment due to distraction or emotional inter- 
ference (e.g., Asch, 1952; Moffitt & Stagner, 
1956; Shipley, 1956), or else to different 
"styles" (Frenkel-Brunswik, 1949; Gardner, 
Holzman, Klein, Linton, & Spence, 1959; 
Holzman, 1954; Klein, 1958; Messick & Hills, 
1960). The latter line of investigation has 
looked for correlated patterns in various 
aspects of cognitive and related behavior. 

A motivational basis for differences in the 
strength of tendency to impose closure has 
been proposed (Dittes, 1959) with the theory 
that closure, regardless of its contents, ac- 


1 This experiment was supported by Grant M-2590 
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quires a specific reward value (in most sectors 
of our culture at least), as a means of enhancing 
self-esteem and self-confidence. The reward 
value of closure is the result of previous ex- 
perience in a wide variety of circumstances 
in which important need satisfactions are de- 
pendent upon or closely associated with cog- 
nitive closure. Examples would include pa- 
rental approval of a child’s learning to name 
objects or the relief following recognition of a 
pattern of job promotion which makes it clear 
that being passed over is not due to personal 
inadequacies. On the basis of such acquired 
reward value, the prediction follows that per- 
sons experiencing a greater need to maintain 
or enhance their sense of self-esteem are more 
likely to impose closure. 

A previous experiment has given support 
to this general theory (Dittes, 1959), Self- 
esteem was manipulated via apparent ac- 
ceptance by peers, social approval presumably 
being one important source of self-esteem. It 
was found that persons made to experience 
low approval tended subsequently to demon- 
strate more impulsive closure than those who 
experienced higher approval. ; 

However, the results of this single experi- 
ment could be attributed not to the mediation 
of threatened self-esteem but more directly to 
the variable of social acceptance, persons seek- 
ing to impose closure as a kind of “show-off” 
because they believed this would win them re- 
gained acceptance. This interpretation of the 
previous results remained possible, although it 
was made unlikely by several controls that 
were used. 

The self-esteem theory predicts more gen- 
erally that similar results could be found when 
self-esteem is manipulated by some quite 
different variable. The present experiment 
undertook to affect self-esteem by manipulat- 
ing the subjects’ sense of success or failure 
on an ego involved task. Care was taken to 
rule out any possible effects of actual or antici- 
pated social approval. Performance was anony- 
mous so far as the subjects knew and the sub- 
jects could not anticipate any social approval 
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or disapproval because of their performance. 
A sample of graduate students was used to 
provide subjects who were likely to be par- 
ticularly vulnerable to success and failure 
manipulations. 

An experimental group was made to ex- 
perience failure on an ego involved task. Two 
control groups were used: one that was made 
to experience success, and one that experienced 
failure to the same degree as the experimental 
group but was led to regard the task as un- 
related to crucial self-esteem factors. The two 
control groups were required to isolate the 
effects of esteem-threatening failure. 

The same measures of impulsive closure 
were used as in the preceding experiment and 
the same measures of personality characteris- 
tics were also used with two additions. The 
specific hypotheses were as follows: 


Effect of Failure-Induced Threat 


Failure produces more impulsive closure 
than either success or failure on a noninvolving 
task. 


Effects of Personality Characteristics 


Self-esteem. In a given situation without im- 
mediate strong situational support of self- 
esteem, persons with lower general self-esteem 
show more impulsive closure than those with 
higher general self-esteem. Aside from situ- 
ational influences on self-esteem that may 
affect closure, persons with lower general self- 
esteem are presumably constantly seeking 
whatever defensive supports may be available 
and are more likely to resort to closure, when 
circumstances provide opportunity than per- 
Sons with higher general self-esteem. Dif- 
ferences attributed to general personality 
characteristics are expected only in the ex- 
perimental condition and not in the control 
Conditions, because in the latter, self-esteem 
for all persons is at least temporarily elevated. 
Differences attributable to personality charac- 
teristics that persons bring with them into the 
experiment are likely to show up only when 
the situation is depriving of immediate support. 

Valuation of achievement. In a situation in 
which achievement motivation is frustrated, 
Persons who attach strong value to achieve- 
ment show more impulsive closure than per- 
Sons weak in achievement valuation. Persons 
with stronger achievement motives presumably 
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are more threatened by such frustration and 
therefore more motivated to seek compensating 
satisfaction, such as closure. 


Interaction of Threat and Personality 


Self-esteem. The predicted effects of failure 
are likely to occur only among persons with 
low general self-esteem. These are the persons 
who are vulnerable to such manipulation. 
Persons with high self-esteem presumably have 
other resources and are less threatened, if at 
all, by a single situation of failure. 

Valuation of achievement. The predicted 
effects of failure are likely to occur only among 
persons with high value on achievement. The 
same reasoning applies. These are the persons 
who are vulnerable to such manipulation. 
Persons who place less value on achievement 
presumably depend upon other resources for 
their self-esteem and would be vulnerable to 
the manipulation of them but not to the manip- 
ulation of achievement. 


MxrrHOD 
Subjects 
The experiment was conducted, without prior 
announcement, during a regular class meeting of a 
required course at the Vale Divinity School. Fifty- 
seven students completed the experimental materials. 
To minimize any effects of interpersonal interaction 
and anticipated opinion of others, all work was 
anonymous, so far as the subjects knew, and all 
materials were self-administered, after a brief general 
introduction by the instructor. The experimental 
materials, which were distributed at one time in a 
folder, were represented as a series of unrelated tasks 
at which subjects were asked to work privately and 
anonymously. Assignment to experimental conditions 
was determined by the way in which the shuffled set of 
folders chanced to be distributed. 


Experimental Manipulations 

Subjects took Items 1-10, 31-40 of the Space 
Relations Test published by the Psychological Corpora- 
tion. This test is difficult enough to prevent subjects 
from having much subjective certainty about the 
quality of their performance and it is therefore open to 
manipulation of reported performance results, and 
thus of success or failure. The test is also abstract 
enough to allow it to be represented as measuring 
anything; this provided opportunity for the ego involy- 
ing manipulation. A special answer sheet was devised, 
so that answers were recorded, by carbon, onto a keyed 
sheet so as to permit self-scoring after the seal was 
broken. Anonymity was emphasized by repeated 
written instructions during the procedures that subjects 
should not write names on their materials and that 
the experimenter’s interest was only in group averages 
not individual results. 
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Ego involving manipulation. Subjects assigned to the 
success and the ego involved failure conditions found 
written instructions describing the Space Relations 
Test as one measuring, “skills of abstraction ... of 
central importance in almost all occupations, especially 
in the professions . . . highly related to personal effec- 
tiveness and professional success.” Subjects who 
chanced to be assigned to the noninvolved failure 
condition found written instructions describing the test 
as “valid in predicting professional success with only 
two occupations—stagehands and certain kinds of 
architects” but indicating that for exploratory purposes 
it was desired to discover the average level of per- 
formance among the present sample. 

The effectiveness of these manipulations is clearly 
indicated by the first two lines of Table 1. On a ques- 
tionnaire administered at the end of the experiment 
subjects in the noninvolving condition reported trying 
significantly less hard to do well on the Space Relations 
Test than subjects in the ego involving conditions 
reported. Interestingly enough, their actual per- 
formance on the test was significantly superior, in- 
dicating presumably the fruits of their more relaxed 
approach. The second line of Table 1 reports the actual 
average performance on the Space Relations Test—as 
subsequently scored by the experimenter—to be dis- 
tinguished from the results that subjects believed that 
they had attained. 

Success-Failure manipulation. Fictitious keys and 
norms were used to affect the subjects’ feelings of having 
performed well or poorly on the Space Relations Test. 
For subjects assigned to the success condition, a special 
answer key was devised, to maximize the number of 
apparently correct responses. A fictitious set of norms 
was provided so that all subjects would find their scores 
ranging among the high percentiles of graduate stu- 
dents, For subjects in the failure conditions, a false key 
was provided indicating that many of the most common 
responses were incorrect. A set of norms was also 


James E. DrrrES 


devised in such a way as to make all subjects range in 
the lower percentiles as compared with candidates for 
bachelor degrees. 

To guarantee that subjects would inspect the norms 
carefully, a form was provided on which they were 
required to record their own score and percentile. 

The middle section of Table 1 indicates that these 
manipulations were highly effective. On three items on 
the final questionnaire, success subjects reported 
that their results were above average, higher than they 
had expected, and pleasing, while subjects in both 
failure conditions reported their results were below 
average, lower than they expected, and disturbing. 

Evidence of net effect on self-esteem. The pair of 
manipulations just described was intended to produce 
one condition (ego involving failure) producing a 
threat to self-esteem and two control conditions, Two 
items on the final questionnaire asked for the subjects’ 
reports on this net effect, one a direct query and one 
indirect. The results, reported at the bottom of Table 
1, seem to indicate that a satisfactory difference was 
achieved, Both nonthreat conditions reported that 
their results had been less on their minds through the 
subsequent part of the experiment than did the ego 
involved failure group. On the direct question, success 
subjects reported self-esteem mildly enhanced by 
performance on the Space Relations Test and ego 
involved failure subjects reported self-esteem mildly 
weakened. The noninvolved failure group reported an 
average score near the ego involved failure group but 
still substantially different (p = .06). 


Measures of Impulsive Closure 


Parable task. Subjects were presented with a one- 
paragraph story, written in Biblical idiom and repre- 
sented as a portion of a recently discovered scroll, and 
asked to write their opinions concerning the meaning of 
the parable. The account was essentially incoherent 


TABLE 1 
EVIDENCE OF EFFECTIVENESS OF EXPERIMENTAL MANIPULATIONS 
Average score in each condition® 
TI 
Manipulation and questionnaire item Control Sra, P5 ? 
Success Failure Failure 
Boe) UC ad, inte 

Ego involvement i 

How hard did you try on test? 5.0 4.6 4.8 Noninvolved | <.05 

Score on Space Relations Test 28.9 33.2 27.2 ys <.05 

y Ego involved 

Success-Failure 

Test results above or below average? 2.4 5.8 <.001 

Test ee higher or lower than ex- 2.5 5 ey Success <.001 

pecte MER 

Pleased or disturbed by test results? 2.4 4.8 4.9 Failure «.001 
Net threat 

How much did test results stay in mind? 2.8 P 3.5 Control .05 

"Test results affect self-esteem? 25 4.3 4.7 Thre i <.001 

eal 


a Questionnaire responses based on six-point scales. Maximum possible score on this form of Space Relations Test was 54. 
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and unstructured. However, it contained a large number 
of familiar religious symbols that could, with some 
selection and distortion, be forced to yield a coherent 
meaning, though no two subjects ever found the same 
meaning. Responses were later scored on a three-point 
scale, according to whether each was a positive state- 
ment of meaning or indicated uncertainty as to mean- 
ing. The higher score represented more impulsive 
closure. 

As for both tasks, the scoring categories were defined 
in advance of the experimental sessions, and the scoring 
was done by two judges working independently without 
knowledge of the subjects! performance on other tasks 
or of the experimental conditions to which the subjects 
had been assigned. The judges showed absolute agree- 
ment in 91% of these judgments. 

"This task was presented as entirely unrelated to the 
Space Relations Test. The rationale was offered that 
performance on this task would help discussion leaders 
in another course better prepare their sessions. 

Roommate task. Subjects were given two lists of 
traits that were represented as words used by close 
friends to describe two students; they were instructed 
to write their impression of each man. Each list con- 
tained a dominant set of traits and a subordinate set 
that was incompatible with the dominant set. In one 
instance, for example, the list was: individualist, 
suspicious, belligerent, considerate, self-concerned 
thoughtful, stubborn, tolerant. The two lists were 
balanced so that dominant traits in one were favorable, 
in the other unfavorable. 

Responses were judged on a three-point scale 
according to the extent to which the subordinate traits 
were integrated into the impression. A statement 
based exclusively on the dominant traits was regarded 
as the most impulsive. There was absolute agreement 
between the judges on 71% of the judgments. Differ- 
ences between judges were averaged. 

This task was represented as unrelated to other 
parts of the experiment, and as exploring one part of a 
procedure by which roommates might be selected. 

Combined index? Since both tasks were generated 
from the same rationale to measure a single behavioral 
tendency, the scores for each task were combined to 
form a more reliable over-all index of closure strength 
for each subject by summing the scores after weighting 
them by factors to equalize their variance. 


Personality Measures 

Several weeks before the experiment, subjects com- 
Pleted a self-rating questionnaire that included items 
comprising two different scales. 


The third task used in the earlier experiment 
(Dittes, 1959) proved to be unsuitable for large group 
Administration, at least in the acoustics of the available 
Toom. This is the “jammed message" task which 
Tequires subjects to listen to a recorded tape. Per- 
formance with a large group was so highly related to 
position in the room as to swamp any possible differ- 
ences due to experimental conditions. 

Acoustics were probably most homogeneous for 
subjects seated in the back two-thirds of the room. 
Considering just this group, subjects in the ego involved 
failure condition performed more impulsively than 
Subjects in the success condition (p < .05) 
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Self-esteem. This scale was comprised of 12 items 
such as: 

I feel nervous if I have to meet lots of people. 

I am usually not sure that others understand what 

Isay. 

Valuation om achievement. Six items assessed the 
general importance that subjects attached to personal 
achievement. Typical items were: 

Successful accomplishment of a task is important 
whether others know of it or not. 

Once I begin a task, I can't give up until I have 
mastered it. 

Space relations aptitude. The subjects’ actual per- 
formance on the Space Relations Test provided an 
estimate of general intellectual ability. This test is 
described by its publishers as measuring the "'ability to 
manipulate things mentally,” and correlates in the 
.50s with indices of general intelligence such as the 
ACE Psychological Examination. 

Because of the false scoring key and norms used by 
the subjects, the actual test results are, of course, much 
higher and imperfectly correlated (r = .82) with the 
scores the subjects in the failure groups believed they 
had obtained. There is nevertheless the possibility that 
this measure is contaminated with the degree of failure 
experienced by subjects, those scoring lower perhaps 
experiencing more failure. However, with the false 
norms used by the subjects, all fell between 10th and 
30th percentiles of college undergraduates so that there 
was relatively little room for individual differences in 
experienced failure, compared with the massive over-all 
effect. The subjectively experienced failure of a person 
falling at the 30th percentile should be at least as great 
as that of a subject scoring at the 10th percentile, 
because the former is presumably accustomed to higher 
scores. 

Dichotomisalion of distributions. In order to assess 
the interaction between the experimental conditions and 
the personality characteristics, the scores on the dis- 
tribution of each of these measures were dichotomized. 
As in the previous experiment, scores on the self-esteem 
distribution were divided into thirds and the higher 
third compared with the lower two-thirds. The other in- 
dices were divided at the median and subjects in the 
upper and lower halves of the distribution compared. 

Table 2 presents the split-half reliabilities of these 
scales and the correlation among them. None of the 
three measures is significantly related with either of the 


others. 
RESULTS 


Effects of Failure-I: nduced Threat 


Table 3 presents the impulsive closure scores 
on each of the tasks and for the combined 
index. The threatened subjects (ego involved 
failure) were more impulsive on each task 
than the subjects experiencing success. The 
noninvolved failure control group tended to 
be impulsive on the parable task but not on 
the roommate task, an inconsistent perform- 
ance to be discussed later. 
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TABLE 2 


INTERCORRELATIONS AND RELIABILITIES OF 
PERSONALITY MEASURES 


Intercorrelations 


Relia- 


Valuation Space 


on achieve- Relations _Dility 
ment Test 
Self-esteem +13 .01 .16 
Valuation on -—.20 .46 
achievement 
TABLE 3 
AVERAGE IMPULSIVE CLOSURE SCORES 
Ri - " 
Parable task mate Sen 
"Threat condition: 
Failure ego in- 1.9 2.0 2.0 
volved 
Control conditions: 
Success, ego in- 1.6 1.9 1.8 
volved 
Failure, nonin- 1.9 1.8 1.8 
volved 
p Threat vs. Con- «.05 «.05 «.05 
trols* 


A Mann-Whitney U test (two-tailed) used throughout this 
report. 


Effects of Personality Characteristics 

Self-esteem. As predicted and as shown in 
the third column of Table 4, for the experi- 
mentally threatened condition low self-esteem 
subjects tended to show more impulsive closure 
than subjects with higher self-esteem. In the 
control condition no effects of differing self- 
esteem were predicted, because subjects were 
experiencing strong enough temporary support 
to make all subjects’ self-esteem homogene- 
ously high. 

Valuation on achievement. Subjects with 
stronger valuation on achievement reacted to 
the experimental failure condition with more 
impulsive closure than did subjects with less 
achievement value. These results appear in the 
third column of Table 5. The correlation be- 
tween valuation on achievement and impulsive 
closure scores in this condition was .42 
(p < .10). 

Space relations aptitude. Subjects with less 
of the kind of intellectual ability measured by 
the Space Relations Test also reacted more 
strongly to the experimental failure manipula- 
tion and produced more impulsive closure than 
subjects scoring higher on the Space Relations 
Test. These results appear in the third column 
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TABLE 4 
AVERAGE SCORES ON COMBINED INDEX OF 
CLOSURE FOR SUBJECTS WITH HIGH 
AND Low SELF-ESTEEM 


Control conditions | „Threat | Control vs. 
Self-esteem " x. "E 
uccess | Failure | Failure | piger. 
inn Gea) innere) niea | ence | # 
Low TUIS | 158 2.2 4 | .02 
High 1.8 | 1.8 1.9 1 | nsj 
ifierence —-4|0 3 
p value of <.10 
difference 
TABLE 5 


AVERAGE ScORES ON COMBINED INDEX OF 
CLOSURE For SUBJECTS WITH HIGH 
AND Low VALUATION OF 


ACHIEVEMENT 
Control conditions RENTEN CODES TE 
eras oe 
ievemen! ees Nn" Gee Dilfer 
involved) | involved | involved) 
High 1.8 1.8 2.3 5 | .02 
Low 17 1.9 | 2.0 2 n 
Difference Iv —-.1 3 
value of .05 
difference 
TABLE 6 


AVERAGE ScORES ON COMBINED INDEX OF 
CLOSURE For SUBJECTS WITH Low 
AND HIGH PERFORMANCE 
ON SPACE RELATIONS 


TEST i 
1 vs. 
Control conditions | condition | ^ iret 
Space relations 
peronmance |. reos | Failure | Failure | Diger 
n- go 
involved) Savaro) involved) | °°° 
E ENEE ZTA, 3| .02 - 
igh 1.9 1.7 1.7 —.1| ns 
Difference E ‘4 5 4 
p value of «05 
difference 


of Table 6. Correlation between Space Rela- 
tions Test scores and the index of impulsive 
closure was .41 (p < .10) within this exper- - 
mental condition. 


Interaction of Threat and Personality 


Persons with low self-esteem, strong valu- 
ation on achievement, and low space relations 
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ability were by far the more vulnerable and 
more responsive to the experimental manipula- 
tions of failure. The effects of the experimental 
manipulation of threat occurred almost ex- 
clusively among these subjects, as seen in the 
top lines of Tables 4, 5, and 6. Subjects with 
high self-esteem, or low valuation on achieve- 
ment, or high space relations ability tended to 
be immune to the experimental manipulation 
of failure. 
DISCUSSION 


Two different experimental manipulations 
have thus produced highly similar patterns of 
results. Low regard by peers (Dittes, 1959) 
and, in the present experiment, an experience 
of failure, even though anonymous, both re- 
sulted in the tendency for subjects to form 
positive and easy conclusions on varied types 
of conceptual tasks. The pair of results seems 
most easily accounted for by the assumptions 
(a) that low regard by peers and experience of 
failure both damage self-esteem, and (b) that 
cognitive closure has acquired the capacity 
to enhance self-esteem. 

To seek out closure impulsively as a com- 
pensating source of self-esteem, then, becomes 
à defensive maneuver induced by the threat 
or damage experience. Finding certainty and 
clarity is reassuring, and when self-esteem is 
threatened, this defensive device may be called 
upon, if circumstances provide opportunity. 

This interpretation is supported by the 
pattern of interaction between the experi- 
mental manipulations and the personality 
characteristics that people brought with them 
to the experiment. In each experiment, sub- 
Jects in the lowest two-thirds of the self-esteem 
distribution were the ones who were vulnerable 
to the manipulation of threat. Persons scoring 
high on a self-esteem questionnaire presumably 
have other resources of self-esteem that they 
can readily recall or rehearse under conditions 
of threat or danger—this may be all that is 
Tepresented by a high score on such a question- 
naire. They are less dependent for self-esteem 
on the esteem of peers, on specific experiences 
of success, or on such defensive devices as 
impulsive closure. Good performance on the 
Space Relations Test seems best interpreted 
as providing another index of general abilities 
and other resources that can nourish a gen- 
erally high self-esteem. Therefore it is con- 
Sistent to find the identical pattern of inter- 
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action with the Space Relations Test as with 
the measure of self-esteem. 

The index of valuation of achievement pre- 
sumably represents a measure of the degree 
to which persons depend for their esteem on 
experiences of personal success and achieve- 
ment of the kind involved in the experimental 
manipulation. Subjects scoring low on this 
measure are presumably more reliant on other 
sources for their self-esteem and less vulnerable 
to withdrawal of this source. 


Possible Limitations of Sample 

The sample of graduate students for this 
experiment was chosen as one which was ex- 
pected to be particularly vulnerable to manipu- 
lations of failure, that is, a group with a gen- 
erally high achievement value. Although there 
is no decisive independent evidence confirming 
this assumption, subjects achieved an average 
score of 4.2 on the six-point scale measuring 
valuation of achievement. All subjects rated 
themselves as above average on this scale; no 
subject achieved an average score of 3 or less. 
The generality of the results is therefore limited 
to subjects with correspondingly high achieve- 
ment value. But this limitation is precisely 
what was predicted in the first instance and 
demonstrated in the interaction results in 
Table 5. 

The argument might be advanced that the 
employment of impulsive closure as a defensive 
response to threat may be limited to such a 
sample of graduate students. This seems un- 
likely for the following reasons. The pressures 
in the academic environment generally tend to 
punish impulsive closure and to reward de- 
layed closure of higher quality. Witness the 
stereotype of the egghead who can never make 
up his mind about anything. It seems more 
likely that the esteem enhancing properties of 
closure have been learned much earlier than 
college days and that the defensive use of im- 
pulsive closure takes place in spite of, rather 
than because of, the fact that these subjects 
are graduate students. 


Low Correlation between Tasks 


The fact that the two tasks used to measure 
impulsive closure have such a low correlation 
(r = .10) presents a serious problem, even 
though considering group averages they tend 
to show similar consequences of the experi- 
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mental procedures. The low correlation may be 
attributed to the fact that for each of the speci- 
fic tasks, peculiar skills, interests, or back- 
ground (in judging prose passages or in judging 
prospective roommates) are elicited and tend 
to override the experimental effects for a few 
individual subjects. 

It is more likely, however, that closure has 
not been clearly enough defined conceptually 
so as to generate equivalent operations. The 
two operational measures may represent some- 
what different variables. For example there 
may be an important distinction between 
what may be called perceptual closure and 
achievement closure—between the relatively 
passive recognition and assurance that there 
is order and system and clarity in the environ- 
ment around one and the experience of pro- 
ducing for oneself clearcut cognitions and solu- 
tions out of obvious initial ambiguity. The 
roommate task may largely represent the 
former and the parable task the latter. The 
roommate task may be approached and com- 
pleted without ever feeling that it is a “prob- 
lem” to be resolved. The parable task tends to 
present a challenge to the subject to see 
whether he can discover and fashion clear 
meaning. 

In this light, it is significant that only the 
parable task, not the roommate task, interacts 
with the measure of achievement valuation. 
There is a strong interaction between the 
measure of valuation of achievement and the 
experimental conditions for this task, as seen 
in Table 7. The roommate task shows no such 
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trend. Table 7 also shows that this difference 
does not hold for the other important inter- 
actions. In the case of self-esteem and the 
Space Relations Test, both tasks tend to show 
similar trends, although in no case do they 
achieve the same level of significance as when 
they are combined, and the parable task tends 
to show slightly stronger trends. 

The noninvolved failure control group per- 
formed inconsistently. They reported, on the 
direct measures, that they were disturbed and 
felt threatened by the manipulations but that 
this was not continually on their minds, as it 
was for the ego involved failure group (Table 
1). On the parable task this control group 
showed as much impulsive behavior as the 
threat group. But on the roommate task they 
showed the least impulsive behavior (Table 3). 

This greater variability of the group per- 
formance very likely simply reflects the greater 
variability of individuals within this group: 
on every measure obtained during the experi- 
mental session, it showed more variability than 
either of the others. This is presumably due to 
the greater ambiguity of the manipulations, 
in which some individuals would feel the threat 
of failure keenly and tend to ignore the non- 
involving instructions, whereas other indi- 
viduals would recall eagerly the noninvolving 
instructions and feel no threat. 

There is a possibility, however, that the re- 
assurance of the noninvolving instructions may 
have a delayed effect. The parable task was 
for all subjects the first one performed after 
the experimental manipulation, so that there 


TABLE 7 
CLOSURE SCORES ON PARABLE AND ROOMMATE TASKS 
Parable task Roommate task 
SP M Control Threat ^ Control Threat ; 

ù 3 Control y R Control 
duree | Gale | Falae ree Saee | Failure | aitare | ya Prent 
volved) | volved) | involved) involved) | volved) | volved) 

Self-Esteem Low | 1.6 | 1.7 22 .10 1.9 1.9 2.4 | «10 
£ High 1.6 1.6 1.7 2.0 1.6 24 
„p (Low vs. High) — .10 
Valuation of achieve- High 1.8 1.6 2.6 <.02 1.9 1.9 1:7 
ment 1 Low 1:5 2.0 1.8 2.0 1.8 2.0 
p (High vs. Low) .07 
Space Relations Test Low 1.8 2:3 2.2 <.15 1.6 2.1 2.0 «45 
3 igh 1.6 1.8 1.5 2:2 1:5 2.0 
p (Low vs. High) «.05 <.02 <.02 


ar. 
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is a confounding of task and order. The ex- 
perience of failure may have produced an im- 
mediate sense of disturbance and threat and 
the comforting thoughts that this test was 
valid only for stagehands and architects may 
have occurred only after an interval of time. 

Notice, in Table 1, that these subjects re- 
port a strong sense of immediate threat (Lines 
5 and 7) but that the threat did not stay on 
their minds throughout the experiment (Line 
6). The parable task may have been performed 
before the noninvolving instructions took 
effect psychologically. 


SUMMARY 


Some persons impose meaning and pattern— 
closure—on ambiguous stimuli more readily 
than other persons. To account for such dif- 
ferences, the hypothesis is proposed that 
closure generally acquires a self-esteem en- 
hancing reward value and that persons ex- 
periencing greater need to maintain or enhance 
self-esteem are therefore likely to impose more 
impulsive closure. 

Self-esteem was manipulated by causing 
some subjects to experience failure on an ego 
involving task and others (two control groups) 
to experience failure on a noninvolving task 
or to experience success. Ego involving failure 
resulted in more impulsive closure as repre- 
sented by tendencies (a) to find a positive 
meaning in an essentially incoherent prose 
Passage rather than to acknowledge its in- 
coherence, and (b) to base an impression of 
another person exclusively on more prominent 
traits and to ignore inconsistent traits. 

Consistent with the self-esteem hypothesis, 
the success-failure manipulation did not affect 
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closure equally for all subjects. Only the sub- 
jects low in self-esteem, low in a type of in- 
tellectual ability, or high in valuation of 
achievement were vulnerable to the success- 
failure manipulation and responded to it with 
the predicted differences in closure behavior. 

These findings correspond exactly to those 
of a previous experiment in which self-esteem 
was manipulated by affecting the subjects’ 
sense of acceptance by peers. 
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Ing work of McClelland, Atkinson, and 

their collaborators (Atkinson, 1958; 
McClelland, Atkinson, Clark, & Lowell, 

1953) in developing motive measures and in 
identifying correlates of these measures would 
seem to have some practical implications for 
improving predictions of certain aspects of 
individual performance in the industrial 
setting. This study was undertaken as the 
first phase of a research program designed to 
test a number of hypotheses regarding ex- 
pected relationships between measures of 
motivation and certain attitudinal or be- 
havioral variables in an industrial population. 
The measurement of “need for achieve- 
ment" (n Achievement), upon which a good 
deal of attention has been focused recently, 
would seem to be particularly pertinent in the 
competitive environment that characterizes 
many industrial roles. The fact that previous 
research has demonstrated a relationship be- 
tween achievement motivation and risk taking 
behavior increases the possible contribution 
of this kind of research to the understanding 
and prediction of industrial role performance. 
Atkinson (1957) presents a theoretical model 
showing that individual behavior in a situation 
of competitive achievement involving risk 
can be accounted for by the interaction of the 
“motive to achieve success" and the “motive 
to avoid failure." By assuming that motiva- 
tion to take a particular course of action is a 
multiplicative function of (a) the strength of 
the motive, (b) the expectancy (subjective 
probability) of success, and (c) the value of the 
incentive, and by making the further assump- 
tion that in a situation involving risk the 
amount of satisfaction inherent in success is 
an inverse function of the strength of the ex- 
pectancy (probability) of success, Atkinson 
demonstrates that a risk situation where 
chances of success are approximately 50-50 
is most attractive to the person who is strongly 
motivated to achieve. The individual who is 


primarily motivated by the need to avoid 
failure, on the other hand, finds a risk situation 
where odds of success are approximately 50-50 
most unattractive. 

Several research studies provide empirical 
support for the hypotheses based on Atkinson's 
(1957) theoretical model. McClelland (1958) 
found in a study of school children that the 
tendency to prefer moderate rather than ex- 
treme risks on a ring toss game was significantly 
related to scores on a graphic measure of n 
Achievement. Atkinson and Litwin (1960) 
found in a study of college students that pref- 
erence for intermediate level risks on a ring 
toss game was related to high n Achievement 
scores on the French Test of Insight, and 
preference for extreme risks was related to 
high scores on a Test Anxiety questionnaire. 
Atkinson, Bastian, Earl, and Litwin (1960), 
in a similar study of college students, found 
high n Achievement scores on the French Test 
of Insight were significantly related to the 
preference for intermediate risks on a shuffle- 


‘board game, and the preference for inter- 


mediate odds of winning over extreme odds 
of winning in a paper-and-pencil imaginary 
betting questionnaire. Litwin (1958) found in 
an experimental study of college students that 
high n Achievement and low test anxiety were 
related to preference for intermediate risks on 
a series of tasks including ring toss, penny 
pitch, a horse race game, and a series of pent 
maze puzzles. 

McClelland (1957) has argued that high 
achievement motivation should be associated 
with successful entrepreneurship. He has 
shown, for example, that the rate of economic 
growth of a number of different countries over 
the period from 1925 to 1950 was significantly 
correlated with the degree to which need for 
achievement themes were found in stories in 
widespread use in children's books in the re- 
spective cultures. He has also demonstrated 
that certain personality characteristics usual y 


570 


ENTREPRENEURSHIP: MOTIVE PATTERNS AND RISK PREFERENCES 


associated with the entrepreneurial role, such 
as willingness to take risks and sense of re- 
sponsibility, characterize the person who is 
high in achievement motivation (McClelland, 
1961). 

More recently, McClelland initiated a series 
of studies to compare the measured n Achieve- 
ment of persons who have demonstrated suc- 
cess in jobs that are definitely entrepreneurial 
in character with that of persons in definitely 
nonentrepreneurial jobs. One of the reasons 
for undertaking the investigation reported here 
was to provide additional data for that series 
of studies, 


METHOD 
Selection of Groups 


The study design required that measures of certain 
motives and risk preferences be obtained from persons 
in jobs with definite entrepreneurial characteristics 
and from a comparable sample of persons in jobs that 
are definitely nonentrepreneurial. An entrepreneurial 
job was defined as follows: 

The entrepreneur should have more responsibility 
for initiating decisions, rather than merely having to 
make decisions when presented with problems. 

5 He should also more often have individual responsi- 
bility for decisions and their effects—that is, without 
the review or assistance of a supervisor or a committee. 

The entrepreneur should get objective feedback of 
accurate data indicating the success of his decisions, 
such as in sales volume, profitability, reduction in 
errors, complaints, etc., rather than just a general 
subjective evaluation of his success by the group or by 
superiors. 

This type of feedback also means that the job entails 
more risk or challenge in that there is more chance ofa 
serious wrong decision being observed. 

The entrepreneurial and nonentrepreneurial groups 
were to be matched on the following variables: salary 
or status, age, years of education, and length of com- 
pany service. If these factors were not held constant, 
differences that might be found on the experimental 
measures could be attributable to differences in age, 
education, or status. 

Managers of shop operations in manufacturing 
plants were selected to represent the “entrepreneurial 
role.’ ‘These managers had complete responsibility for 
administering specific parts of the manufacturing 
Operation. In addition to the production employees in 
their groups, these managers had supporting personnel 
for $ such functions as scheduling, manufacturing 
eee and quality control reporting directly to 
them. Thus, in effect, each managed a fairly complete 
little business. 

For each Manager in the sample a staff “Specialist” 
Was chosen whose job seemed to have a minimum 
amount of entrepreneurial characteristics, and whose 
home level, age, educational background, and length 
of service was approximately the same as the respective 
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TABLE 1 


CHARACTERISTICS OF THE MANAGERS AND 
SPECIALISTS PARTICIPATING IN THE STUDY 


Years 


Education | Position 
service 


Age (Years) | level 


ue | nance | ar | Range| ar | Ẹ | ar rane 


Managers  |42.1| 28-60 |12.6| 8-17 | 8.5 |6-10)18.9) 3-34 
(WN = 31) 

Specialists |42.7| 27-59 |13.2 10-17 | 7.7 |5-10]18.1| 3-42 
(N = 31) 


® The position level refers to a company salary plan system. 
The difference shown would indicate that the salaries for the 
Managers were, on the average, about 6 or 7% higher than those 
of the Specialists. 


Manager. Table 1 presents the means and ranges on 
these variables. 

On the average, the two groups were almost. identical 
in age and years of service, but the Specialists were 
slightly higher in education and slightly lower in 
position level! 


Experimental Measures 

Each participant completed a short Risk Preference 
Questionnaire and wrote brief stories to six thematic 
apperception pictures. The pictures were selected by 
McClelland specifically for a study of businessmen. 
They are described in ‘Atkinson (1958) (Recommended 
Multiple Purpose Set A, Appendix IIT). The stories 
were scored for n Achievement, n Affiliation, and n 
Power by experienced scorers who had established rank- 
order scoring reliabilities of .90 or better prior to scoring 
the protocols for this study.* 

The Risk Preference Questionnaire consisted merely 
of a list of pairs of betting choices with instructions to 
choose the preferred alternative. The odds ranged from 
one chance in six to six chances out of six of winning, 
with hypothetical rewards presented for each alterna- 
tive at an amount related directly to the size of the 
odds. The test was scored by comparing the number of 
times the extreme risk options (1/6 and 5/6) were 
chosen to the number of times the intermediate risk 
options (2/6, 3/6, and 4/6) were chosen. 


1The mean scores in Table 1 do not, of course, 
indicate the accuracy with which individual matches 
were made on the variables. Since there was a rather 
limited number of individuals available in any com- 
ponent for matching, compromises were necessary to 
get reasonably close matches on all four variables. The 
mean discrepancies in the variables for the individual 
matches were 4.68 years for age, .90 years for education, 
1.06 in position level, and 5.42 in years of service. The 
distributions of discrepancies, however, were a little 
skewed, with most of the differences below the means. 
In each case a few very large discrepancies which were 
unavoidable inflated the averages. 

2We wish to acknowledge the assistance of Ann 
Litwin who scored the protocols for n Achievement and 
n Power. The protocols for n Affiliation were scored by 


George Litwin. 
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Interviews 

Each participant was subsequently interviewed to 
obtain an indication of his attitudes toward the entre- 
preneurial aspects of the Manager position. For the 
Specialists, the interviewer also attempted to determine 
to what degree the man’s own job had entrepreneurial 
characteristics. Entrepreneurial characteristics con- 
sidered were: the degree of objective risk the position 
entailed—that is, the chance to make a serious wrong 
decision, the result of which would be observable from 
objective indicators; responsibility for initiating 
decisions; and the objectivity of results measures fed 
back to the position incumbent. At the termination of 
the interview, the interviewer rated the degree to which 
that participant’s attitudes were positive or negative in 
relation to these characteristics, and for the Specialists, 
he also rated the degree to which the man’s job seemed 
to have these entrepreneurial characteristics. In addi- 
tion, the interviewer also questioned each participant 
regarding his “level of aspiration"—the ultimate 
position or status for which he was striving in his career. 


Management Ratings of “Achievement Drive” 


Managers from two levels above the subjects in this 
study were asked to rank the participating Managers 
and Specialists on the degree to which they had demon- 
strated "Achievement Drive" on their jobs. Specifically, 
they were asked to indicate: “How strongly does the 
man seem to be motivated to do a good job? Does he 
identify with the job, work hard at it, take pride in 
doing the work, strive to improve, etc.?” 

"The ranks assigned to each man were translated into 
standard scores on a 10-point scale, The ranks given to 
any one man by the various raters were then averaged 
and rounded off to the nearest whole number to obtain 
the summary rating statistic used in the data analysis. 
Since the Managers and Specialists were in four different 
departments, each rater considered approximately eight 
participants in each of the two groups. Each participant 
was rated by at least two higher level managers, and 
some by more. The split-half estimate of reliability 
(half the judges vs. the other half on the same partici- 
pants) was .77. 


RESULTS 


Motive and Risk Scores for Managers and 
Specialists 

Table 2 presents differences found on the 
motive scores and the Risk Preference Ques- 
tionnaire for the Managers and Specialists. 
Significant differences (by Wilcoxon matched- 
pairs signed-ranks test) were found between 
the groups on n Achievement and in the 
tendency to choose intermediate rather than 
extreme odds on the Risk Preference Ques- 
tionnaire.* 

*One-talled tests were used in computing the 


significance of these differences since the direction of the 
differences was predicted. A two-tailed test was used in 
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TABLE 2 
MEAN SCORES OF MANAGERS AND SPECIALISTS 
ON THE MOTIVE MEASURES AND THE 
RISK PREFERENCE QUESTIONNAIRE 


Managers Specialists 

Motive measures 

n Achievement 6.74 4. TT** 

n Affiliation 3.42 2.45 (ns 

n Power 6.39 6.45 n) 
Risk Preference Ques- 

tionnaire 
Percentage preferring 52 32* 


intermediate odds: 


* Difference between matched pairs significant at .05 level, 
** Difference between matched pairs significant at .025 level, 


Motivation and Entrepreneurial Attitudes 


None of the correlations between the inter- 
viewers’ ratings of the Managers’ and Special- 
ists’ attitudes toward the entrepreneurial as- 
pects of the Manager position and motive 
scores were found to be statistically significant. 
A significant correlation was found, however, 
between a composite “entrepreneurial atti- 
tudes” score and observed “achievement drive” 
of the managers, as rated by their superiors 
(.42).4 

For the Specialists’ group only, correlations 
were also computed between the interviewers’ 
ratings of the degree to which the Specialist’s 
job seemed to have entrepreneurial charac- 
teristics and scores on n Achievement and the 
Risk Preference Questionnaire. These corre- 
lations (.01 and .12, respectively) were not 
significantly greater than zero. 


Other Relationships 


Tntercorrelations of the predictor and control 
variables for the entire participant groups re- 
vealed few significant correlations, The n 
Affiliation measure was found to correlate 
negatively with age (—.35) and positively 
with education (.28), management ratings of 
achievement correlated negatively with age 
(—.33), and, as expected, age and education 


considering the differences in n Affiliation since 4 
difference was not predicted. Although a difference was 
expected in n Power, it is obvious that none was found. 

* Scores of participants on the motive measures and 
Risk Preference Questionnaire were withheld from the 
interviewers, and management ratings of the partici- 
pants were not obtained at a plant until the interviewing 
had been completed, in order to insure that correlations 
between data gathered from different sources would not 
be contaminated. 
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were found to be negatively correlated (—.64). 
Probably of greater interest is the fact that 
n Achievement and risk preferences were not 
found to be significantly correlated with each 
other (—.01). 

Analysis was also made of the relationships 
between the interviewers’ ratings of each 
participant’s “level of aspiration” and some 
of the other variables. Rated level of aspira- 
tion was found to be significantly correlated 
in a negative direction with age (— .48) and 
positively with education (.59), as might be 
expected. The correlation with n Achievement 
was also significant (.33), but if the effects of 
age and education are partialed out, this cor- 
rdation is attenuated. Actually, level of as- 
piration ratings resulted in a skewed distribu- 
tion, since almost half the participants (14 of 
the Managers and 14 of the Specialists) ex- 
pressed no aspiration for a position beyond 
the level of the one they now held. 


DISCUSSION 


While the results of this exploratory study 
are encouraging, they still leave some ques- 
tions unanswered. Two of the major hypotheses 
were verified, but other differences or relation- 
ships that had been expected were not found. 
Differences were found between the Manager 
group, selected to represent an entrepreneurial 
role, and the Specialists, whose jobs had a 
minimum of entrepreneurial characteristics, 
on n Achievement and preference for inter- 
mediate risks. These differences, however, 
were not at as high a level of significance as 
might be desirable. Furthermore, these vari- 
ables did not correlate with attitudes of the 
subjects toward the entrepreneurial charac- 
teristics of a job. This negative finding may 
well be due, however, to the fact that these 
were relationships within groups where sample 
variances were too small to yield substantial 
correlations. The small samples and possible 
shortcomings in the measures could, of course, 
also have contributed to the lack of significant 
correlations. 

Shortcomings in the two measures of achieve- 
ment motivation could also account for the 
fact that they were found to be uncorrelated. 
Atkinson and his associates (Atkinson et al., 
1960) in a study employing the same risk 
measure as was used here, also found that 
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preference for intermediate risks was not sig- 
nificantly correlated with n Achievement. In 
the same study, however, they did find a sig- 
nificant correlation between n Achievement 
scores and preference for intermediate risks on 
a game of skill. 

Risk taking behavior in a pure gambling 
situation may be a better indicator of fear of 
failure motivation, moreover, than of achieve- 
ment or success oriented motivation. In this 
study the risk questionnaire was scored by 
combining preferences for extreme risks (1/6 or 
5/6) and comparing this countwith preferences 
for the intermediate level risks (2/6, 3/6, or 
4/6). The Specialists, as a group, were found to 
prefer the extreme odds significantly more 
often than were the Managers. A close in- 
spection of the data revealed, however, that 
the two groups actually did not differ in pref- 
erence for the high risk (1/6) alternative. The 
difference in their scores was accounted for 
entirely by differences in their preference for 
the relatively low risk alternative (5/6), a find- 
ing that could indicate a fear of failure or “need 
for security” motivation. 

The research evidence suggests that high 
fear of failure is not consistently associated 
with low n Achievement. It could very well be 
that these two factors are, at least in part, 
independent variables. Some research has sug- 
gested that high fear of failure may be asso- 
ciated with a moderate level of achievement 
motivation (McClelland et al., 1953). The 
man who is very low in achievement motiva- 
tion may not have a strong enough need for 
success in a competitive situation to be greatly 
concerned about the consequences of failure. 

Tf preference for low risk does indicate fear 
of failure motivation, and if this is charac- 
teristic of persons with a moderate level of 
need for achievement, we would expect the 
n Achievement scores for subjects in this study 
who showed preference forthe 5/6 risk tocluster 
near the middle of the n Achievement distribu- 
tion. A check of the data indicates a definite 
trend in this direction. The average n Achieve- 
ment score for the men who preferred the 5 /6 
bet was a little lower, but not significantly dif- 
ferent from the average score for the remainder 
of the sample. An analysis of the distribution 
of the scores, however, revealed that the 
variance of the n Achievement scores for the 
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men who preferred the 5/6 bet was significantly 
smaller than the variance of these scores for 
the remainder of the distribution.’ 

The fact that Managers’ ratings of “achieve- 
ment drive” did not correlate with either n 
Achievement or preference for intermediate 
risks is not surprising. Other researchers have 
reported similar findings. In one study, for 
example, self-ratings of achievement motiva- 
tion were not found to be correlated with n 
Achievement (McClelland et al., 1953). In 
another study clinicians’ ratings of need for 
achievement were not found to be related to 
n Achievement (McClelland et al., 1953), 
and in a third study a similar finding was re- 
ported in relation to teachers’ ratings of the 
achievement drive of students (Winterbottom, 
1958). The kind of achievement motivation 
that is tapped by a thematic apperceptive 
measure of n Achievement is not a personality 
characteristic which is easy to observe and 
evaluate. 


SUMMARY 


This exploratory study represents one of 
the first attempts in an industrial setting to 
determine the relationship between measured 
motive patterns of individuals and their ad- 
justment to particular job roles. Persons in 
jobs with definite entrepreneurial characteris- 
tics were expected by hypothesis to score 
higher on a measure of "need for achievement” 
than persons whose jobs are nonentrepre- 
neurial Moreover, in a situation requiring 
decisions involving risk, entrepreneurs were 
expected to show preference for risks with 
odds of success near 50-50, since previous re- 
Search had shown this type of risk behavior to 
be associated with achievement motivation. 

Managers selected to represent the entre- 
preneurial role did score higher than Specialists 


5 F test significant at .01 level. 
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preference for intermediate level risks oi 
Risk Preference Questionnaire. Howevi 
attempt to establish more positively the; 
that these differences could be attribui 
the entrepreneurial characteristics of the 
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previous study, designed to account 

for the phenomenon of identification 

in terms of incidental learning, demon- 

strated that children readily imitated behavior 

exhibited by an adult model in the presence 

of the model (Bandura & Huston, 1961). A 

series of experiments by Blake (1958) and 

others (Grosser, Polansky, & Lippitt, 1951; 

Rosenblith, 1959; Schachter & Hall, 1952) 

have likewise shown that mere observation 

of responses of a model has a facilitating effect 

on subjects’ reactions in the immediate social 
influence setting. 

While these studies provide convincing evi- 
dence for the influence and control exerted on 
others by the behavior of a model, a more 
crucial test of imitative learning involves the 
generalization of imitative response patterns 
to new settings in which the model is absent. 
In the experiment reported in this paper 
ildren were exposed to aggressive and non- 
gressive adult models and were then tested 
mount of imitative learning in a new situ- 
in the absence of the model. According 


reinforcement, and these tendencies would 
generalize to some extent to adult experi- 
menters (Miller & Dollard, 1941). 

It was further predicted that observation 
of subdued nonaggressive models would have 
à generalized inhibiting effect on the subjects’ 
subsequent behavior, and this effect would be 
reflected in a difference between the non- 
aggressive and the control groups, with sub- 
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jects in the latter group displaying significantly 
more aggression. 

Hypotheses were also advanced concerning 
the influence of the sex of model and sex of 
subjects on imitation. Fauls and Smith (1956) 
have shown that preschool children perceive 
their parents as having distinct preferences 
regarding sex appropriate modes of behavior 
for their children. Their findings, as well as 
informal observation, suggest that parents re- 
ward imitation of sex appropriate behavior 
and discourage or punish sex inappropriate 
imitative responses, e.g, a male child is un- 
likely to receive much reward for performing 
female appropriate activities, such as cooking, 
or for adopting other aspects of the maternal 
role, but these same behaviors are typically 
welcomed if performed by females. As a result 
of differing reinforcement histories, tendencies 
to imitate male and female models thus acquire 
differential habit strength. One would expect, 
on this basis, subjects to imitate the behavior 
of à same-sex model to a greater degree than 
a model of the opposite sex. 

Since aggression, however, is a highly mascu- 
line-typed behavior, boys should be more pre- 
disposed than girls toward imitating aggres- 
sion, the difference being most marked for 
subjects exposed to the male aggressive model. 


METHOD 


Subjects 

The subjects were 36 boys and 36 girls enrolled in 
the Stanford University Nursery School. They ranged 
in age from 37 to 69 months, with a mean age of 52 
months. Š 

Two adults, a male and a female, served in the role 
of model, and one female experimenter conducted the 
study for all 72 children. 


Experimental Design 

Subjects were divided into eight experimental groups 
of six subjects each and a control group consisting of 
24 subjects. Half the experimental subjects were ex- 
posed to aggressive models and half were exposed to 
models that were subdued and nonaggressive in their 
behavior. These groups were further subdivided into 
male and female subjects. Half the subjects in the 
aggressive and nonaggressive conditions observed 
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same-sex models, while the remaining subjects in each 
group viewed models of the opposite sex. The control 
group had no prior exposure to the adult models and 
was tested only in the generalization situation. 

It seemed reasonable to expect that the subjects’ 
level of aggressiveness would be positively related to the 
readiness with which they imitated aggressive modes of 
behavior. Therefore, in order to increase the precision 
of treatment comparisons, subjects in the experimental 
and control groups were matched individually on the 
basis of ratings of their aggressive behavior in social 
interactions in the nursery school. 

The subjects were rated on four five-point tating 
scales by the experimenter and a nursery school teacher, 
both of whom were well acquainted with the children. 
These scales measured the extent to which subjects 
displayed physical aggression, verbal aggression, 
aggression toward inanimate objects, and aggressive 
inhibition. The latter scale, which dealt with the 
subjects’ tendency to inhibit aggressive reactions in the 
face of high instigation, provided a measure of aggres- 
sion anxiety. 

Fifty-one subjects were rated independently by 
both judges so as to permit an assessment of interrater 
agreement. The reliability of the composite aggression 
score, estimated by means of the Pearson product- 
moment correlation, was .89. 

The composite score was obtained by summing the 
ratings on the four aggression scales; on the basis of 
these scores, subjects were arranged in triplets and 
assigned at random to one of two treatment conditions 
or to the control group. 


Experimental Conditions 


In the first step in the procedure subjects were 
brought individually by the experimenter to the 
experimental room and the model who was in the 
hallway outside the room, was invited by the experi- 
menter to come and join in the game. The experimenter 
then escorted the subject to one corner of the room, 
which was structured as the subject’s play area. After 
seating the child at a small table, the experimenter 
demonstrated how the subject could design pictures 
with potato prints and picture stickers provided. The 
potato prints included a variety of geometrical forms; 
the stickers were attractive multicolor pictures of 
animals, flowers, and western figures to be pasted on a 
pastoral scene. These activities were selected since they 
had been established, by previous studies in the nursery 
school, as having high interest value for the children. 

After having settled the subject in his corner, the 
experimenter escorted the model to the opposite corner 
of the room which contained a small table and chair, 
a tinker toy set, a mallet, and a 5-foot inflated Bobo 
doll. The experimenter explained that these were the 
materials provided for the model to play with and, 
after the model was seated, the experimenter left the 
experimental room. 

With subjects in the nonageressive condition, the 
model assembled the tinker toys in a quiet subdued 
manner totally ignoring the Bobo doll. 

In contrast, with subjects in the aggressive condition, 

' the model began by assembling the tinker toys but 
after approximately a minute had elapsed, the model 
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turned to the Bobo doll and spent the remainder of 
the period aggressing toward it. 

Imitative learning can be clearly demonstrated ifa 
model performs sufficiently novel patterns of responses 
which are unlikely to occur independently of the ob- 
servation of the behavior of a model and if a subject 
reproduces these behaviors in substantially identical 
form. For this reason, in addition to punching the 
Bobo doll, a response that is likely to be performed by 
children independently of a demonstration, the model 
exhibited distinctive aggressive acts which were to be 
Scored as imitative responses. The model laid Bobo 
on its side, sat on it and punched it repeatedly in the 
nose. The model then raised the Bobo doll, picked up 
the mallet and struck the doll on the head. Following 
the mallet aggression, the model tossed the doll up in 
the air aggressively and kicked it about the room. This 
Sequence of physically aggressive acts was repeated 
approximately three times, interspersed with verbally 
aggressive responses such as, “Sock him in the nose 

++)” “Hit him down... ,” “Throw him in the air 

++)” “Kick him... ,” “Pow.,.,’ and two non- 
aggressive comments, “He keeps coming back for more” 
and “He sure is a tough fella.” 

Thus in the exposure situation, subjects were pro- 
vided with a diverting task which occupied their 
attention while at the same time insured observation 
of the model’s behavior in the absence of any instruc- 
tions to observe or to learn the responses in question, 
Since subjects could not perform the model’s aggressive 
behavior, any learning that occurred was purely on an 
observational or covert basis. 

At the end of 10 minutes, the experimenter entered 
the room, informed the subject that he would now 
go to another game room, and bid the model goodbye. 


Aggression Arousal 


Subjects were tested for the amount of imitative 
learning in a different experimental room that was set 
off from the main nursery school building. The two 
experimental situations were thus clearly differentiated; 
in fact, many subjects were under the impression that 
they were no longer on the nursery school grounds. 

Prior to the test for imitation, however, all subjects, 
experimental and control, were subjected to mild 
aggression arousal to insure that they were under some 
degree of instigation to aggression. The arousal experi- 
ence was included for two main reasons. In the first 
Place, observation of aggressive behavior exhibited 
by others tends to reduce the probability of aggression 
on the part of the observer (Rosenbaum & deCharms, 
1960). Consequently, subjects in the aggressive con- 
dition, in relation both to the nonaggressive and con- 
trol groups, would be under weaker instigation following 
exposure to the models. Second, if subjects in. the non- 
aggressive condition expressed little aggression in the 
face of appropriate instigation, the presence of an 
inhibitory process would seem to be indicated. 

Following the exposure experience, therefore, the 
experimenter brought the subject to an anteroom 
that contained these relatively attractive toys: a fire 
engine, a locomotive, a jet fighter plane, a cable cat, 
a colorful spinning top, and a doll set complete with 
wardrobe, doll carriage, and baby crib. The experi- 
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menter explained that the toys were for the subject 
to play with but, as soon as the subject became suffi- 
ciently involved with the play material (usually in about. 
2 minutes), the experimenter remarked that these 
were her very best toys, that she did not let just anyone 
play with them, and that she had decided to reserve 
these toys for the other children. However, the subject 
could play with any of the toys that were in the next 
room, The experimenter and the subject then entered 
the adjoining experimental room. 

Tt was necessary for the experimenter to remain in 
the room during the experimental session; otherwise a 


| number of the children would either refuse to remain 


alone or would leave before the termination of the 


| session. However, in order to minimize any influence 
| her presence might have on the subject’s behavior, 


the experimenter remained as inconspicuous as possible 
by busying herself with paper work at a desk in the 
far corner of the room and avoiding any interaction 
with the child. 


Test for Delayed Imitation 


: The experimental room contained a variety of toys 
including some that could be used in imitative or non- 
imitative aggression, and others that tended to elicit 
predominantly nonaggressive forms of behavior. The 
aggressive toys included a 3-foot Bobo doll, a mallet 
and peg board, two dart guns, and a tether ball with 
a face painted on it which hung from the ceiling. The 
nonaggressive toys, on the other hand, included a tea 
set, crayons and coloring paper, a ball, two dolls, three 
bears, cars and trucks, and plastic farm animals. 

Tn order to eliminate any variation in behavior due 

to mere placement of the toys in the room, the play 
material was arranged in a fixed order for each of the 
sessions. 
The subject spent 20 minutes in this experimental 
toom during which time his behavior was rated in terms 
of predetermined response categories by judges who 
observed the session through a one-way mirror in an 
adjoining observation room. The 20-minute session 
was divided into 5-second intervals by means of an 
electric interval timer, thus yielding a total number of 
240 response units for each subject. 

The male model scored the experimental sessions 
for all 72 children. Except for the cases in which he 
served as model, he did not have knowledge of the 
subjects? group assignments. In order to provide an 
estimate of interscorer agreement, the performances 
of half the subjects were also scored independently bya 
second observer. Thus one or the other of the two ob- 
servers usually had no knowledge of the conditions to 
which the subjects were assigned. Since, however, 
but two of the subjects in the aggressive condition 
performed the models’ novel aggressive responses while 
subjects in the other conditions only rarely exhibited 
A reactions, subjects who were exposed to the aggres- 
dcs models could be readily identified through their 

istinctive behavior. 
The responses scored involved highly specific con- 


» crete classes of behavior and yielded high interscorer 


teliabilities, the product-moment coefficients being 
in the .90s. 
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Response Measures 


Three measures of imitation were obtained: 

Imitation of physical aggression: This category in- 
cluded acts of striking the Bobo doll with the mallet, 
sitting on the doll and punching it in the nose; kicking 
the doll, and tossing it in the air. 

Imitative verbal aggression: Subject repeats the 
phrases, “Sock him,” “Hit him down,” “Kick him,” 
“Throw him in the air,” or “Pow.” 

Imitative monaggressive verbal responses: Subject 
repeats, “He keeps coming back for more,” or “He sure 
is a tough fella.” 

During the pretest, a number of the subjects imi- 
tated the essential components of the model’s behavior 
but did not perform the complete act, or they directed 
the imitative aggressive response to some object other 
than the Bobo doll. Two responses of this type were 
therefore scored and were interpreted as partially 
imitative behavior. 

Mallet aggression: Subject strikes objects other than 
the Bobo doll aggressively with the mallet. 

‘Sits on Bobo doll: Subject lays the Bobo doll on its 
side and sits on it, but does not aggress toward it. 

The following additional nonimitative aggressive 
responses were scored: 

Punches Bobo doll: Subject strikes, slaps, or pushes 
the doll aggressively. 

Nonimitative physical and verbal aggression: This 
category included physically aggressive acts directed 
toward objects other than the Bobo doll and any hostile 
remarks except for those in the verbal imitation cate- 
gory; eg, “Shoot the Bobo,” “Cut him,” “Stupid 
ball,” “Knock over people,” “Horses fighting, biting” 

Aggressive gun play: Subject shoots darts or aims the 
guns and fires imaginary shots at objects in the room. 

Ratings were also made of the number of behavior 
units in which subjects played nonaggressively or sat 
quietly and did not play with any of the material at all. 


RESULTS 
Complete Imitation of Mi odels’ Behavior 


Subjects in the aggression condition repro- 
duced a good deal of physical and verbal ag- 
gressive behavior resembling that of the 
models, and their mean scores differed mark- 
edly from those of subjects in the nonaggressive 
and control groups who exhibited virtually no 
imitative aggression (see Table 1). 

Since there were only à few scores for sub- 
jects in the nonaggressive and control condi- 
tions (approximately 70% of the subjects had 
zero scores), and the assumption of homo- 
geneity of variance could not be made, the 
Friedman two-way analysis of variance by 
ranks was employed to test the significance of 
the obtained differences. 

The prediction that exposure of subjects to 
aggressive models increases the probability 
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of aggressive behavior is clearly confirmed 
(see Table 2). The main effect of treatment 
conditions is highly significant both for physi- 
cal and verbal imitative aggression. Compari- 
son of pairs of scores by the sign test shows that 
the obtained over-all differences were due 
almost entirely to the aggression displayed by 
subjects who had been exposed to the aggres- 
sive models. Their scores were significantly 
higher than those of either the nonaggressive 


TABLE 1 


MEAN AGGRESSION ScoRES ror EXPERIMENTAL 
AND CONTROL SUBJECTS 


Experimental groups imitation. 
Analysis of variance of scores based on the 
Response category Aggressive | Nonaggres- [Control subjects? use of the mallet aggressively toward 
soups objects other than the Bobo doll reveals that 
F F treatment conditions are a statistically sig- 
hans neater nificant source of variation (Table 2). In ad- 
Imitative physical aggres- dition, individual sign tests show that both the 
sion aggressive and the control groups, relative to 

MR alae] o2| coj i2 subjects in the nonaggressive condition, pro: 
Imitative verbal aggression | | | | ^| "^ duced significantly more mallet aggression, 

eae subjecta EI yo) %3j 0.0] 0.7 the difference being particularly marked with 
ED doe anneal Met de 8101 47 regard to female subjects. Girls who observed 

Female subjects 17.2 | 18.7} €5| o.s 1.1 — nonaggressive models performed a mean num- 
M 15.5 | 28.8 | 18.7 | 6.7 | 13. ber of 0.5 mallet aggression responses as com- 

Female subjects 6.3] 16.5] s8] 4s] a pared to mean values of 18.0 and 13.1 for girls | 

Male subjects 18.9 | 11.9 | 15.6 | 14.8 7 in the aggressive and control groups, re- 
Erste iban 2.3 | &4| 7. spectively. 

Male subjects 16.2 | 36.7 it mais RE Although subjects who observed aggressive 
Aggressive gun play models performed more mallet aggression 
A cu 73|159| &»|ses| ui (f= 20.0) than their controls (M = 133), 

: the difference was not statistically significant. 
TABLE 2 


SIGNIFICANCE OF THE DIFFERENCES BETWEEN 
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or control groups, which did not differ from 
each other (Table 2). 

Imitation was not confined to the model’s 
aggressive responses. Approximately one-third 
of the subjects in the aggressive condition also 
repeated the model’s nonaggressive verbal 
responses while none of the subjects in either 
the nonaggressive or control groups made such 
remarks. This difference, tested by means of 
the Cochran Q test, was significant well beyond 
the .001 level (Table 2). 


Partial Imitation of Models’ Behavior 


Differences in the predicted direction were 
also obtained on the two measures of partial 


EXPERIMENTAL AND CONTROL GROUPS IN THE 


EXPRESSION OF AGGRESSION 


Comparison of pairs of treatment conditions | 


Res a : Nonag- 
OMA $a > * p 
? 
NAA responses 
sical aggression 27.17 4 .09 
MS aggression A 9.17 Sk Stone s .09 
lonaggressive verbal responses i 
pari ene 'esp 17.50 <.001 .004 .004 m 
Mallet aggression 11.06 <.01 02 +005 
Sits on Bobo — 13.44 <.01 “O18 "059 ns 
Nonimitative aggression 
pos e Hee i 2.87 ns 
ysical and verbal j 8.96 «.02 n 
Aggressive gun play 2-15. ns jas rd 
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With respect to the partially imitative re- 
sponse of sitting on the Bobo doll, the over-all 
group differences were significant beyond the 
‘01 level (Table 2). Comparison of pairs of 
scores by the sign test procedure reveals that 
subjects in the aggressive group reproduced 
this aspect of the models’ behavior to a greater 
extent than did the nonaggressive (p = .018) 
or the control (p = .059) subjects. The latter 
two groups, on the other hand, did not differ 
from each other. 


Nonimitative Aggression 


Analyses of variance of the remaining ag- 
gression measures (Table 2) show that treat- 
ment conditions did not influence the extent 
to which subjects engaged in aggressive gun 
play or punched the Bobo doll. The effect of 
conditions is highly significant (xê, = 8.96, 
p < 02), however, in the case of the subjects’ 
expression of nonimitative physical and verbal 
aggression. Further comparison of treatment 
pairs reveals that the main source of the over- 
all difference was the aggressive and non- 
aggressive groups which differed significantly 
from each other (Table 2), with subjects ex- 
posed to the aggressive models displaying the 
greater amount of aggression. 


Influence of Sex of Model and Sex of Subjects on 
Imitation 


The hypothesis that boys are more prone 
than girls to imitate aggression exhibited by 
a model was only partially confirmed. / tests 
computed for subjects in the aggressive con- 
dition reveal that boys reproduced more imita- 
tive physical aggression than girls (¢ = 2.50, 
pus 01). The groups do not differ, however, 
in their imitation of verbal aggression. 

The use of nonparametric tests, necessitated 
by the extremely skewed distributions of 


' scores for subjects in the nonaggressive and 


control conditions, preclude an over-all test 
of the influence of sex of model per se, and of 
the various interactions between the main 
effects. Inspection of the means presented in 
Table 1 for subjects in the aggression condi- 
tion, however, clearly suggests the possibility 
of a Sex X Model interaction. This interaction 
effect is much more consistent and pronounced 


. for the male model than for the female model. 


Male subjects, for example, exhibited more 
physical (¢ = 2.07, p < .05) and verbal imita- 
tive aggression (¢ = 2.51, p < .05), more non- 


579 


imitative aggression (/ = 3.15, p < .025), and 
engaged in significantly more aggressive gun 
play (¢ = 2.12, p < .05) following exposure 
to the aggressive male model than the female 
subjects. In contrast, girls exposed to the 
female model performed considerably more 
imitative verbal aggression and more non- 
imitative aggression than did the boys (Table 
1). The variances, however, were equally large 
and with only a small V in each cell the mean 
differences did not reach statistical significance. 

Data for the nonaggressive and control 
subjects provide additional suggestive evi- 
dence that the behavior of the male model 
exerted a greater influence than the female 
model on the subjects’ behavior in the gener- 
alization situation. 

It will be recalled that, except for the greater 
amount of mallet aggression exhibited by the 
control subjects, no significant differences were 
obtained between the nonaggressive and con- 
trol groups. The data indicate, however, that 
the absence of significant differences between 
these two groups was due primarily to the 
fact that subjects exposed to the nonaggressive 
female model did not differ from the controls 
on any of the measures of aggression. With 
respect to the male model, on the other hand, 
the differences between the groups are striking, 
Comparison of the sets of scores by means of 
the sign test reveals that, in relation to the 
control group, subjects exposed to the non- 
aggressive male model performed significantly 
less imitative physical aggression (p = .06), 
less imitative verbal aggression (p = .002), 
Jess mallet aggression (p = .003), less non- 
imitative physical and verbal aggression 
(p = .03), and they were less inclined to punch 
the Bobo doll (p = .07). 

While the comparison of subgroups, when 
some of the over-all tests do not reach sta- 
tistical significance, is likely to capitalize on 
chance differences, nevertheless the con- 
sistency of the findings adds support to the 
interpretation in terms of influence by the 
model. 


Nonaggressive Behavior 


With the exception of expected sex differ- 
ences, Lindquist (1956) Type III analyses of 
variance of the nonaggressive response scores 
yielded few significant differences. 

Female subjects spent more time than boys 
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playing with dolls (p < .001), with the tea set 
(p < .001), and coloring (p < .05). The boys, 
on the other hand, devoted significantly more 
time than the girls to exploratory play with 
the guns (p < .01). No sex differences were 
found in respect to the subjects use of the 
other stimulus objects, ie, farm animals, 
cars, or tether ball. 

Treatment conditions did produce signifi- 
cant differences on two measures of nonag- 
gressive behavior that are worth mentioning. 
Subjects in the  nonaggressive condition 
engaged in significantly more nonaggressive 
play with dolls than either subjects in the 
aggressive group (¢ = 2.67, p < .02), or in 
the control group (¢ = 2.57, p < .02). 

Even more noteworthy is the finding that 
subjects who observed nonaggressive models 
spent more than twice as much time as sub- 
jects in aggressive condition (¢ = 3.07, p < .01) 
in simply sitting quietly without handling 
any of the play material, 


DISCUSSION 


Much current research on social learning is 
focused on the shaping of new behavior 
through rewarding and punishing conse- 
quences, Unless responses are emitted, how- 
ever, they cannot be influenced. The results 
of this study provide strong evidence that 
observation of cues produced by the behavior 
of others is one effective means of eliciting 
certain forms of responses for which the origi- 
nal probability is very low or zero. Indeed, 
social imitation may hasten or short-cut the 
acquisition of new behaviors without the 
necessity of reinforcing successive approxima- 
tions as suggested by Skinner (1953). 

Thus subjects given an opportunity to 
observe aggressive models later reproduced a 
good deal of physical and verbal aggression 
(as well as nonaggressive responses) sub- 
stantially identical with that of the model, 
In contrast, subjects who were exposed to 
nonaggressive models and those who had no 
previous exposure to any models only rarely 
performed such responses. 

To the extent that observation of adult 
models displaying aggression communicates 
permissiveness for aggressive behavior, such 
exposure may serve to weaken inhibitory 
responses and thereby to increase the prob- 
ability of aggressive reactions to subsequent 
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frustrations. The fact, however, that subjects 
expressed their aggression in ways that clearly 
resembled the novel patterns exhibited by the 
models provides striking evidence for the 
occurrence of learning by imitation, 

In the procedure employed by Miller and 
Dollard (1941) for establishing imitative be- 
havior, adult or peer models performed dis- 
crimination responses following which they 
were consistently rewarded, and the subjects 
were similarly reinforced whenever they 
matched the leaders’ choice responses. While 
these experiments have been widely accepted 
as demonstrations of learning by means of 
imitation, in fact, they simply involve a spe- 
cial case of discrimination learning in which 
the behavior of others serves as discriminative 
stimuli for responses that are already part of 
the subject's repertoire. Auditory or visual 
environmental cues could easily have been 
substituted for the social stimuli to facilitate 
the discrimination learning. In contrast, the 
process of imitation studied in the present 
experiment differed in several important re- 
spects from the one investigated by Miller 
and Dollard in that subjects learned to com- 
bine fractional responses into relatively com- 
plex novel patterns solely by observing the 
performance of social models without any 
opportunity to perform the models’ behavior in 
the exposure setting, and without any rein- 
forcers delivered either to the models or to the 
observers. 

An adequate theory of the mechanisms 


underlying imitative learning is lacking. The - 


explanations that have been offered (Logan, 
Olmsted, Rosner, Schwartz, & Stevens, 1955; 
Maccoby, 1959) assume that the imitator 
performs the models responses covertly. 
If it can be assumed additionally that rewards 
and punishments are self-administered in 


conjunction with the covert responses, the ' 


process of imitative learning could be ac- 
counted for in terms of the same principles 
that govern instrumental trial-and-error learn- 
ing. In the early stages of the developmental 
Process, however, the range of component re- 
Sponses in the organism's repertoire is prob- 
ably increased through a process of classical 
conditioning (Bandura & Huston, 1961; 
Mowrer, 1950). 

The data provide some evidence that the 
male model influenced the subjects’ behavior 
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outside the exposure setting to a greater 
extent than was true for the female model. 
In the analyses of the Sex X Model interac- 
tions, for example, only the comparisons 
involving the male model yielded significant 
differences. Similarly, subjects exposed to 
the nonaggressive male model performed less 
aggressive behavior than the controls, whereas 
comparisons involving the female model 
were consistently nonsignificant. 

In a study of learning by imitation, Rosen- 
blith (1959) has likewise found male experi- 
menters more effective than females in in- 
fluencing childrens’ behavior. Rosenblith 
advanced the tentative explanation that the 
school setting may involve some social dep- 
tivation in respect to adult males which, 
in turn, enhances the male’s reward value. 

The trends in the data yielded by the 
present study suggest an alternative explana- 
tion. In the case of a highly masculine-typed 
behavior such as physical aggression, there is 
a tendency for both male and female subjects 
to imitate the male model to a greater degree 
than the female model. On the other hand, 
in the case of verbal aggression, which is less 
clearly sex linked, the greatest amount of 
imitation occurs in relation to the same-sex 
model. These trends together with the finding 
that boys in relation to girls are in general 
more imitative of physical aggression but do 
not differ in imitation of verbal aggression, 
suggest that subjects may be differentially 
affected by the sex of the model but that 
predictions must take into account the degree 
to which the behavior in question is sex-typed. 

The preceding discussion has assumed 
that maleness-femaleness rather than some 
other personal characteristics of the particular 
models involved, is the significant variable— 
an assumption that cannot be tested directly 
with the data at hand. It was clearly evident, 
however, particularly from boys’ spontaneous 
remarks about the display of aggression by 
the female model, that some subjects at least 
Were responding in terms of a sex discrimina- 
tion and their prior learning about what is 
Sex appropriate behavior (e.g., “Who is that 
lady. That’s not the way for a lady to behave. 


k Ladies are supposed to act like ladies...” 


tt 

You should have seen what that girl did in 
there. She was just acting like a man. I never 
Saw a girl act like that before. She was punch- 
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ing and fighting but no swearing.”). Aggression 
by the male model, on the other hand, was 
more likely to be seen as appropriate and 
approved by both the boys (“Al’s a good 
socker, he beat up Bobo. I want to sock like 
AL") and the girls ("That man is a strong 
fighter, he punched and punched and he 
could hit Bobo right down to the floor and 
if Bobo got up he said, ‘Punch your nose.’ 
He’s a good fighter like Daddy.”). 

The finding that subjects exposed to the 
quiet models were more inhibited and un- 
responsive than subjects in the aggressive 
condition, together with the obtained differ- 
ence on the aggression measures, suggests 
that exposure to inhibited models not only 
decreases the probability of occurrence of 
aggressive behavior but also generally re- 
stricts the range of behavior emitted by the 
subjects. 

“Identification with aggressor” (Freud, 
1946) or “defensive identification” (Mowrer, 
1950), whereby a person presumably trans- 
forms himself from object to agent of ag- 
gression by adopting the attributes of an 
aggressive threatening model so as to allay 
anxiety, is widely accepted as an explana- 
tion of the imitative learning of aggression. 

The development of aggressive modes of 
response by children of aggressively punitive 
adults, however, may simply reflect object 
displacement without involving any such 
mechanism of defensive identification. In 
studies of child training antecedents of ag- 
gressively antisocial adolescents (Bandura & 
Walters, 1959) and of young hyperaggressive 
boys (Bandura, 1960), the parents were 
found to be nonpermissive and punitive of 
aggression directed toward themselves. On 
the other hand, they actively encouraged and 
reinforced their sons’ aggression toward 
persons outside the home. This pattern of 
differential reinforcement of aggressive be- 
havior served to inhibit the boys’ aggression 
toward the original instigators and fostered 
the displacement of aggression toward objects 
and situations eliciting much weaker in- 
hibitory responses. 

Moreover, the findings from an earlier 
study (Bandura & Huston, 1961), in which 
children imitated to an equal degree aggres- 
sion exhibited by a nurturant and a non- 
nurturant model, together with the results 
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of the present experiment in which subjects 
readily imitated aggressive models who were 
more or less neutral figures suggest that 
mere observation of aggression, regardless 
of the quality of the model-subject relation- 
ship, is a sufficient condition for producing 
imitative aggression in children, A compara- 
tive study of the subjects’ imitation of ag- 
gressive models who are feared, who are linked 
and esteemed, or who are essentially neutral 
figures would throw some light on whether 
or not a more parsimonious theory than the 
one involved in “identification with the ag- 
gressor" can explain the modeling process. 


SUMMARY 


Twenty-four preschool children were as- 
signed to each of three conditions. One experi- 
mental group observed aggressive adult 
models; a second observed inhibited non- 
aggressive models; while subjects in a control 
group had no prior exposure to the models. 
Half the subjects in the experimental con- 
ditions observed same-sex models and half 
viewed models of the opposite sex. Subjects 
were then tested for the amount of imitative 
as well as nonimitative aggression performed 
in a new situation in the absence of the models. 

Comparison of the subjects’ behavior in 
the generalization situation revealed that 
subjects exposed to aggressive models re- 
produced a good deal of aggression resembling 
that of the models, and that their mean 
scores differed markedly from those of sub- 
jects in the nonaggressive and control groups. 
Subjects in the aggressive condition also 
exhibited significantly more partially imita- 
tive and nonimitative aggressive behavior 
and were generally less inhibited in their be- 
havior than subjects in the nonaggressive 
condition. 

Imitation was found to be differentially 
influenced by the sex of the model with boys 
Showing more aggression than girls following 
exposure to the male model, the difference 
being particularly marked on highly mascu- 
line-typed behavior. 
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Subjects who observed the nonaggressiy 
models, especially the subdued male model 
were generally less aggressive than 
controls. E 

The implications of the findings based on 
this experiment and related studies for t] 
psychoanalytic theory of identification wit 
the aggressor were discussed. 
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MPRESSIONS formed of others are only 

partially determined by their stimulus 

qualities. In addition, the judge's over- 
all attitude toward the person being judged 
(Thorndike, 1920), his “theory” of how traits 
are related to each other (Hays, 1958; New- 
comb, 1931), his emotional state (Feshbach & 
Singer, 1957; Murray, 1933), the underlying 
structure of perception itself (Asch, 1946; 
Levy & Dugan, 1960; Wishner, 1960), and 
the context in which the stimulus person is 
presented for judgment (Levy, 1960, 1961) 
have all been shown to enter into the process 
of impression formation. Yet to be investi- 
gated, however, is the role played by learning 
in impression formation, and it is to this prob- 
lem that the present investigation is ad- 
dressed. 

Using what was called a “constant error 
approach" Levy and Dugan (1960) obtained 
correlations between trait judgments which, 
by the nature of the method used, they claimed 
could not be attributed to either the opera- 
tion of the halo effect or the judges’ implicit 
theories of personality, but which must in- 
stead represent certain underlying dimensions 
of social perception. In essence, the constant 
error approach involves obtaining scores for 
each judge representing the frequency with 
which he makes a particular judgment on 
each of several bipolar dimensions where 
each judgment involves a different facial 
Photograph, and then correlating these scores 
over judges. Considering the scores as rep- 
resenting tendencies to make particular 
Judgments, or perceptual dispositions, the cor- 
relations between them may be viewed as re- 
flecting the extent to which these dispositions 
ru related to each other, and factors derived 
rom a matrix of such correlations may be 
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conceived of as representing certain basic 
dimensions of social perception or judgment. 

Two questions immediately suggest them- 
selves for investigation in regard to Levy and 
Dugan’s (1960) findings. The first concerns 
the extent to which perceptual dispositions 
can be modified by differential reinforce- 
ment, and the second is that of whether changes 
in dispositions with regard to one dimension 
resulting from differential reinforcement re- 
stricted to that dimension will generalize 
to other dimensions in a manner consistent 
with their correlations with that dimension. 
The answer to the first question bears upon 
the applicability of the traditional principles 
of learning to the realm variously referred to 
as impression formation, person perception, 
or social perception. The answer to the second 
question, while sharing with the first this same 
general implication, goes further in having a 
direct bearing upon the validity and interpre- 
tation of the correlations between dimensions 
reported by Levy and Dugan. 

The correlation between judgments for 
two traits obtained by the constant error 
method has been interpreted as indicating 
that they are not entirely independent of 
each other from the standpoint of perceptual 
functioning. That is, judgments for Trait A 
are believed to correlate to a greater extent 
with those for Trait B than they do with 
those for Trait C, because of those determi- 
nants governing each of these judgments, 
judgments for Traits A and B share a larger 
number in common than do judgments for 
Traits A and C. If this is so, then it should 
follow that modifications in judgmental 
tendencies in connection with one trait should 
be accompanied by modifications in judg- 
mental tendencies in connection with other 
correlated traits. Further, it is reasonable to 
expect that the extent to which changes occur 
in judgments on these other traits should be 
directly related to the magnitude of their 
correlation with the initial trait for which 
judgmental tendencies have been modified. 
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It should thus become possible, granting 
the validity of the above reasoning, to order 
traits along a continuum on the basis of their 
correlations with a particular trait for which 
judgments are to be differentially reinforced, 
and to produce a gradient for the extent of 
changes in judgments for each of these traits 
which varies monotonically with the magni- 
tudes of their correlations with the trait in 
question. Consequently, the following two 
hypotheses were proposed for testing in the 
present experiment: 

Hypothesis I. The probability of making a 
particular judgment for a facial photograph 
on a given bipolar dimension is dependent 
upon the nature of the reinforcements con- 
tingent upon making such a judgment. 

Hypothesis II. The relative magnitude of 
the probability of making a particular judg- 
ment, on a given bipolar dimension for which 
no previous reinforcement has been given, is 
directly related to the magnitude of the 
correlation between that dimension and the 
dimension for which judgments were differ- 
entially reinforced. 


METHOD 


Subjects. Subjects for this experiment were 30 men 
and 30 women, all students in an introductory psychol- 
ogy course which had as a course requirement participa- 
tion in a certain number of experiments. They were 
randomly and evenly assigned to one of three groups 
with the only restriction being that there be an equal 
number of men and women in each group. 

Stimuli, The facial photographs used in this experi- 
ment were of white, male applicants for graduate 
training in psychology at Indiana University who, 
for various reasons, did not enter the program. These 
pictures have been used in previous studies (Levy, 
1960, 1961; Levy & Dugan, 1960) and therefore 
norms are available for each picture concerning the 
frequency with which a particular dichotomous judg- 
ment is made for it on a particular dimension. These 
norms have been found to be quite stable from one 
Subject sample to another (Levy, 1961). Norms used 
in the selection of the present pictures were based on 
judgments made by 104 subjects for each of 75 pictures 
with regard to a particular dimension. The dimensions 
used for these judgments are those used in the present 
study; they are: safe-dangerous, good-bad, reliable- 
unreliable, and relaxed-tense. For the present study 
only pictures having values ranging between 40% and 
60% for frequencies of judgments at one pole or the 
other were used. Twenty-two safe-dangerous pictures 
were used having a mean of 51.4% for “safe” judg- 
ments, five pictures each were used for each of the other 
dimensions having means of 50.2%, 50.6%, and 50.4% 
for judgments of “good,” “reliable,” and “relaxed,” 
respectively. 


Leon H. Levy 


Each picture was mounted on a 3 X 4 inch piece 
of poster board. Attached immediately below was a 
label containing the two terms designating the di. 


mension on which that picture was to be judged. To 


control for possible position response sets, the left- 
right relationship for terms for approximately half of 
the pictures in each set was reversed. 

Dimensions. The dimensions used in this study were 
chosen because each had been found by Levy and 
Dugan (1960) to be highly loaded on a different 
orthogonal factor. The correlations found by Levy 
and Dugan for each of the generalization dimensions 
with safe-dangerous were .50 for good-bad, .34 for 
reliable-unreliable, and .02 for relaxed-tense. While 
these correlations had been based on judgments by 
100 subjects for only 15 pictures for each dimension, 
analogous correlations derived from the normative 
study involving 104 subjects and 75 pictures for each 
dimension were .80, .69, and .28, differing in absolute 
but not relative magnitude. Thus it would appear that 
some confidence may be placed in the reliability of a 
simple ordering of these dimensions with regard to the 
degree of their relationship to safe-dangerous. 


Procedure. Safe-dangerous was selected as the dimen- 


sion on which judgments would be differentially rein- 


: 


forced and the three groups of subjects were defined 
on the basis of the reinforcement combination used. 


The experimenter responded with "right" after all 
“safe” judgments and with “wrong” after all “danger- 
ous” judgments for group SRDW, he responded in 
reverse fashion for group SWDR, and made no verbal 
response following judgments for the control group. 
On Trials 1-20, which were the training trials, only 
Pictures to be judged on the safe-dangerous dimension 


were presented. Trials 21-39 were test trials for general- | 


ization and on these a fixed random order was used in 
assigning the picture-judgment dimension for a given 
trial with the exception of Trials 24, 28, 32, and 36 
on which safe-dangerous pictures were presented for 
the purpose of reducing any possible extinction effects 
of the generalization series. 

Although the order of judgment dimensions ovet 
trials was the same for all subjects, the actual picture 
to be judged on a given trial was randomly select 
for each subject from the pool of pictures assigned to 
the designated dimension. Thus, for example, on Trial 
21, all subjects were required to make a judgment of 


TABLE 1 
ANALYSIS OF VARIANCE FOR CONDITIONED AND 
GENERALIZED CHANGES IN JUDGMENTS 
For Four DIMENSIONS 


Source df MS F 
Between groups 2 111.713 55.004** 
Subjects in same 57 2.031 

group 
Between dimensions 3 1.782 1.390. 
Dimensions X 6 4.657 3.690 
Groups 
Pooled subjects X 171 1.282 
Dimensions 
*p < 01. 
**) < 001. 
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Fic. 1. Generalization gradients for the three groups 
of subjects for the judgments indicated on the abscissa. 


“good” or “bad” although which of the five pictures 
assigned to this dimension might be presented on this 
trial varied randomly from subject to subject. No 
picture was presented more than once to a subject 
with the exception of the safe-dangerous ones of which 
there were only 22 available and 24 needed. Thus on 
two of the booster trials during the test series, the 
experimenter randomly selected and presented two 
pictures which had been previously presented to the 
subject. 

The experiment was individually administered to 
each subject who was given the following instructions 
verbatim: 

This is one of a series of experiments pertaining 

to character evaluation. I am going to show you a 

number of pictures. Beneath each of these pictures 

are two descriptive words. Most of the descriptive 
words used are “safe-dangerous,” but others will 
be shown to you intermittently. I want you to 
look at the pictures carefully; look at the words 
printed below the pictures, and then tell me which 
of the two words you think best describes this person. 

Rate of presentation of the pictures was determined 
only by the time taken by the subject to report his 
judgment and the experimenter’s recording of it. There 
was no apparent distinction from the subject’s stand- 
Point between the training and test trials. 


RESULTS 


Each subject received four scores: one for 
the number of “safe” judgments for Training 
Trials 16-20, which was taken as the measure 
of level of conditioning, and one each for the 
numbers of “good,” “reliable,” and “relaxed” 
judgments for the appropriate pictures, which 
Were taken as the measures of generalization. 
Judgments for the safe-dangerous pictures 
used on booster trials were not analyzed. 

Treating these four scores as repeated 
Measures on the same individuals, an analysis 
of variance (Edwards, 1950) was performed 
that is summarized in Table 1. The F between 
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groups, which is significant at the .001 level, 
supports the first hypothesis regarding the 
contingency between judgments and their 
reinforcements. Judgments made by subjects 
in this experiment were susceptible to con- 
ventional conditioning procedures. 

The F for the interaction between dimen- 
sions and groups, which was significant at the 
.01 level, indicates that the trends of the 
means over the four dimensions for the three 
groups differ. As may be seen from the curves 
for the three groups plotted in Figure 1, 
judgments of pictures on the safe-dangerous 
dimension are affected by differential rein- 
forcement in the expected way, and this 
effect does generalize to judgments on the 
other dimensions in a way that is monotoni- 
cally related to the magnitude of their corre- 
lation with safe-dangerous. Thus the second 
hypothesis is also supported by these data. 


Discussion 


The constant error method provides a 
means of studying the structure of social 
perception from the standpoint of perceptual 
dispositions. The present study has shown 
that such dispositions are susceptible to 
modification by differential reinforcement, 
and that such modifications ramify through- 
out the perceptual system in a manner con- 
sistent with the structure of this system as 
delineated by the constant error method. 
Thus at one and the same time, the results 
of the present study argue for the validity 
of the structural picture that emerges by the 
use of the constant error method, and also 
serve to qualify social perception for member- 
ship within the realm of phenomena to which 
current principles of learning have been found 
applicable. , 

But beyond this, the present findings are 
consistent with Wishner’s (1960) demon- 
stration that in the Asch-type (Asch, 1946) 
impression formation study changes in im- 
pressions as à result of changes in informa- 
tional input follow lines predictable from 
empirical knowledge of the characteristics of 
the elements involved and their intercorrela- 
tions. This would suggest that given additional 
studies of the present type, as well as those of 
Asch and Wishner, the structure of the social 
perceptual system should become sufficiently 
clear so that it should ultimately be possible 
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to predict what effect any given bit of infor- 
mational input into one part of the system 
has upon any other part. 

The present study used what might be 
considered neutral stimulus objects with 
regard to the dimensions involved, and there- 
fore leaves unanswered the question of the 
interaction between perceptual dispositions 
and values of the stimulus objects involved. 
Every judgment might be thought of as the 
resultant of this interaction and thus the 
question arises of the relative contributions 
of dispositions and objects to the final out- 
come, To answer this question, research in- 
corporating an object sampling approach of 
the sort advocated by Brunswik (1947) is 
now being planned. 

The predicted changes in perceptual dis- 
positions observed in the present study were 
predicated upon the assumption of certain 
relationships existing between these disposi- 
tions. That these predictions were borne 
out supports the validity of this assumption. 
Now, another question of both theoretical 
and practical import is whether these very 
relationships can be altered. Conceivably, 
if they reflect the learning history of the 
individual with regard to situations requiring 
these judgments to be made, we should 
expect to find developmental changes in the 
Structure of social perception, and we should 
be able to modify this structure by appropriate 
experimental treatment, Concretely, for ex- 
ample, the question is whether it is possible 
to reduce the correlation between judgments 
of safeness and goodness and increase that 
between safeness and relaxedness. Some 
approaches to psychotherapy assume that 
this can be done (e.g., Kelly, 1955), and the 
whole question of the Possibility of modifying 
social stereotypes would seem to hinge upon 
this issue. Thus the question becomes one of 
the extent to which, and the means by which 
the relationships existing between perceptual 
dispositions may be modified. How, or whether, 
learning theory can make a contribution 
here remains to be determined. 

A question that is undoubtedly begged by 
the present study, one that plagues much of 
the research in the area of perception, is 
whether the changes observed truly represent 
changes in perception or merely changes in 
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verbal behavior. While the legitimacy 
question cannot be gainsaid, answerin; 
the context of the present study wo 
have been possible. For even if post- 
mental interviews were conducted in o 
determine whether subjects attributed 
icality to their judgments of pictures” 
either safe or dangerous, we would still 
have gone beyond the level of verbal behay 
To properly decide between these two al 
tives would require tapping into addi 
response systems of the organism and si 
ing for correlations between data obtai 
from these systems (Garner, Hake, & Eriks 
1956). Thus for the present, it must be r 
nized that an accounting for the pre 
findings in terms of changes in verbal beh 
remains a possible alternative. 


SUMMARY 

Subjects divided into two experiment 
and one control group were presented 
sets of facial photographs for each of 
they were to make a dichotomous judgm 
with regard to a specified bipolar dim 
Subjects in the experimental groups 
conditioned to make either “safe” judgm 
or “dangerous” judgments on the first set 
20 pictures by means of differential reinfort 
ment for each of these judgments, and 
were tested for the extent to which the eff 
of this conditioning generalized to judgmen 
for other pictures on good-bad, relial 
unreliable, and tense-relaxed dimensi 
The nature of the generalization gradienl 
had been predicted on the basis of ‘Correlation 
obtained in a previous study between jud 
ments on each of these dimensions and tho 


were discussed from the standpoint of 
implications for the nature of social percepuo 
and were interpreted as demonstrating the 
applicability of principls of learning 
problems in this area. 
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THE INFLUENCE OF AFFECTIVE FACTORS ON 
CONCEPTUALIZATION IN SCHIZOPHRENIA? 


MELVIN H. FEFFER 
Yeshiva University 


number of divergent positions exist 
A with regard to the interpretation of 

pathological concreteness in certain 
types of schizophrenia, The viewpoint under- 
lying the present study is that personality 
disorganization in schizophrenia is basic to 
pathological concreteness. This position can 
be articulated in terms of the following propo- 
sitions: 


Pathological concreteness of the schizophrenic is one 
manifestation of a generalized withdrawal from the 
affective disorganization engendered within interper- 
sonal relationships (Arieti, 1955; Hunt & Cofer, 1944; 
Jenkins, 1950). Standard conceptual tasks have inter- 
personal implications not only by virtue of the social 
context within which these are typically administered, 
but, more importantly, in terms of the view that con- 
ceptual activity in itself requires a coordination of 
different social perspectives (Mead, 1934; Piaget, 
1950). The schizophrenic’s focusing on peripheral, 
irrelevant aspects of a situation (Cameron, 1939; 
Von Domarus, 1944; Wegrocki, 1940) is interpreted 
as one technique of withdrawing from the interpersonal 
and hence affect arousing implications of the concep- 
tual task. More formally, one can hypothesize that the 
schizophrenic avoids the central, affect arousing im- 
plications of the conceptual task by focusing upon 
peripheral, irrelevant material, thereby evidencing 
inadequate conceptual functioning in terms of standard 
test criteria. 


A direct test of this hypothesis would in- 
volve varying affective and neutral com- 
ponents of a conceptual task along a “central- 
peripheral” dimension. Standard tests of 
conceptual activity, however, lend themselves 
less readily to such manipulation than per- 
ceptual figure-ground tasks, which can be 
so designed that contents in both the central 


1 This article is based upon a dissertation submitted 
in partial fulfillment of the requirements for the degree 
of Doctor of Philosophy in the Department of Psychol- 
ogy at the University of Chicago. 

The author wishes to thank David Shakow for his 
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made many helpful suggestions with regard to the 
present paper. 

This investigation has been supported in part by 
the Dementia Praecox Research Project, Worcester 
State Hospital, and a research grant (M-896) from the 
National Institute of Mental Health, United States 
Public Health Service. 


(figure) and peripheral (ground) q 
are either affective or neutral. The use of a 
figure-ground task in a test of the hypothesis, 
however, requires an assumption that patho- 
logical concreteness and inadequate figure- 
ground differentiation represent similar ac- 
tivities. In this regard, Goldstein (1939) 
has pointed out that the lack of subordination 
of ground to figure which occurs in inadequate 
figure-ground differentiation also characterizes 
pathological concreteness. That is, the patho- 
logically concrete attitude is one in which each 
object in the environment is considered as 
unique and self-contained and, hence, as 
important as every other object, rather than 
as subordinate exemplars of more general 
categories. 

Assuming that inadequate figure-ground 
differentiation and pathological concreteness 
are similar in terms of lack of subordination, 
one may advance the following experimental - 
hypothesis: Pathologically concrete schizo- 
phrenics will evidence an avoidance of affec- 
tive stimuli in figure-ground differentiation 
when figure and ground are varied in terms of 
affective and neutral valence. More speci- 
fically, the writer predicts that (a) under 
affective figure conditions, pathologically 
concrete schizophrenics focus on ground 
aspects of the field; (b) under affective ground | 
conditions, pathologically concrete schizo- | 


phrenics focus on figure aspects of the field; 
and (c) these patterns of behavior will be more 
characteristic of pathologically concrete 
schizophrenics than of two control groups 
adequately conceptualizing schizophrenics an 
normal individuals. 


METHOD 


Subjects and General Procedure 


Forty-one male subjects who had normal or corrected 
vision were employed. These consisted of the following 
three groups?: 


zn 


2The patient population was drawn from P 
State Hospital. The diagnoses were made by 4 St 
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1. Concrete group: 9 schizophrenic subjects who 
showed pathological concreteness on criterion tasks of 
conceptual thinking. 

2. Conceptualizing group: 16 schizophrenic subjects 
who showed adequate conceptualization on criterion 
tasks of conceptual thinking, 

3. Normal group: 16 subjects who, as far as could 
be ascertained, had no history of hospitalization for 
mental illness. Eleven subjects in this group were 
individuals working in the community and living at 
the YMCA. They were paid for taking part in the 
study. The rest of the group consisted of graduate 
students or professional workers in psychology. 

All subjects were seen in three sessions. In the first 
or screening session, the tests administered were the 
Stanford-Binet Vocabulary (1937) and two tests of 
conceptual ability derived from Wegrocki (1940). 
The purpose of this screening period was to select those 
subjects in the schizophrenic group who were extreme 
cases on the continuum of concrete-abstract thought, 
and to rule out those subjects who gave the impression 
that cooperation would be minimal on the more effort 
involving tasks, Following a random time interval of 
1-7 days, each subject was given a figure-ground tachis- 
toscopic task. A third session followed a time interval of 
3 days in which the Luria (1932) (word association- 
motor response) task, and a Wittenborn (1943) test of 
concentration were given. This latter test was used as a 
check on the effectiveness of the screening period in 
yielding cooperative subjects. 

The figure-ground situation was so constructed that 
figure and ground could be varied in terms of affective 
and neutral valence. This was accomplished by the use 
of a tachistoscopic task in which meaningful words 
Were placed in a central area on the stimulus field, 
while other meaningful words were placed on the periph- 
ery, All words were classified as either affective or neu- 
tral for each subject on the basis of the word associa- 
tion-motor technique developed by Luria (1932). 
Subjects were required to identify the central word in 
the figure-ground situation. A measure was devised 
which represented the presence of background content 
in the response to the figure, i.e., “ground intrusion.” 
Relationships between the degree of ground intrusion 
ànd the locus of the affective word—whether in figure 
or in ground—were then analyzed in order to determine 
whether avoidance of affective words had taken place. 


Criterion Tasks of Conceptual Thinking 


The tasks used as the basis for classifying each 
Schizophrenic subject as pathologically concrete or as 
an adequate conceptualizer were the Proverb Inter- 
Pretation test and the Essential Differences test (Wegro- 
1940). The Proverb Interpretation test consists of 
‘WO groups of proverbs and one group of interpretations. 
t ach proverb in the first group corresponds in meaning 
‘0 a proverb in the second group, although worded 
(G rently, e.g., “When in Rome, do as the Romans do" 

Toup 1) and “When in a forest, one must howl with 
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the wolves" (Group 2). The subject is first required to 
match the proverbs in the first two groups, and then 
he is asked to match the proverbs in the first group 
with the interpretations in the third group, e.g., “If 
one wants to live on good terms with his neighbors 
one must act as they do." Following Wegrocki, the 
subject's score is based on the number of errors made. 
Each incorrect matching of proverbs is given one point, 
while each incorrect matching of proverb and inter- 
pretation is given two points. On the basis of Wegro- 
cki's study it was decided that those schizophrenics 
scoring 9 or under would be classified as low error 
schizophrenics, while those scoring over 18 would be 
classified as high error schizophrenics. 

The Essential Differences test consists of groups of 
words, four words to a group. Three of the four words 
are similar in some respect while the fourth is not, 
eg, “chimney, fire, smokestack, stovepipe." The 
subject is required to underline the word that differs 
from the remaining three. The following instructions 
are read to each subject: 

I'd like you to underline that word in each line 
which you think is most different from the remain- 
ing three; but the words left must be in some way 
related and similar. For example, of these four words, 
chimney, fire, smokestack, stovepipe, which would 
you underline? 

The subject’s score is based on the number of errors 
made, each incorrect underlining or omission being 
given one point. Based on Wegrocki's findings, a critical 
score of 10 was chosen. Those schizophrenics scoring 
over 10 were classified as high error schizophrenics 
while those scoring 10 or under were classified as low 
error schizophrenics. For the present study, only those 
schizophrenics who were high error scorers on both 
tests were placed in the Concrete group. Similarly, 
only those schizophrenics who were low error scorers 
on both tests were placed in the Conceptualizing group. 
None of the Normal group exceeded the critical scores 
on either task. There is no significant difference in 
scoring between the Conceptualizing group and the 


Normal group. 


TABLE 1 


RELIABILITY COEFFICIENTS OF PROVERB 
INTERPRETATION, ESSENTIAL DIFFERENCES, 
AND WITTENBORN CONCENTRATION TESTS 


Test 
ittenbo 
Item — | proverb | Essential causeatration 
Interpre- | Dilfer 
S nens INOSAN sc 
e N-C N-C 
bro .82 .56 .88 82 .50 
SE 1.96 1.66 | 2.17 | 3.50 | 7.82 


Psychiatrists and psychologists, or by the psychia- 
Meee oe of the ward to which the patient was 


a N-C: normal control group; S-A: adequately conceptualizing 
schizophrenic group; S-C: pathologically concrete schizophrenic 


group. 
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Criteria of Affective and Neutral Words: 
Luria Task 


An estimate of the affective arousal of the words 
used in the tachistoscopic task was based on the Luria 
task (word association-motor response). The method 
and rationale are described by Luria (1932), who 
pioneered work in this area. Validation studies have 
been presented by Hull and Lugoff (1921), Huston, 
Shakow, and Erickson (1934), and Sharp (1938); 
other studies have shown that the test can be used with 
psychotic populations (Ebaugh, 1936; Shuey, 1937). 

The subject is seated in a comfortable chair while 
fingers of both hands rest on receiving tambours (rub- 
ber cups). He is asked to press down on one rubber cup 
with his dominant hand as he verbalizes his first asso- 
ciation to the examiner’s stimulus word, leaving his 
other hand resting lightly on the other rubber cup. 
Movements in either hand and reaction time were 
recorded on separate lines of a Phipps-Bird kymograph. 

The motor response of each subject was analyzed 
by the investigator without knowledge of either the 
stimulus word that elicited this response or the content 
of the subject’s association. A disturbance of either 
reaction time or dominant hand activity was sufficient 
to put that stimulus word in the affective category for 
that individual. If neither was considered disturbed, 
the word was categorized as neutral. A more complete 
description of the technique of ing the motor 
responses is given in Luria (1932), and Feffer (1954). 


Reliability of Tests 

The reliability coefficient of the tests used may be 
seen in Table 1, These were determined by the split- 
half method, using Rulon’s method (Guilford, 1942, 
p. 276) for the Essential Differences and the Witten- 
born Concentration tasks. The reliability coefficient 
for the Proverb Interpretation task was determined by 
the alternate forms method, using rank-order correla- 
tion. Since the subject’s score was determined by both 
forms, the Spearman-Brown formula was used. 

Reliability coefficients for the Essential Differences 
and the Proverb Interpretation tasks are based on the 
scores of the normal controls only. This is due to the 
fact that each schizophrenic group was, by design, 
selected for extreme homogeneity. 
Interjudge Reliability 

Seven representative Luria records were selected. 
Random sections of 10 consecutive responses which the 
experimenter and a second judge classified as affective 
or neutral were selected from each record. The degree 
of agreement between judges for each block of 10 
Consecutive responses was determined by the use of a 
contingency table. The null hypothesis was tested each 
time, Exact probabilities derived from each contingency 
were then combined in order to obtain an over-all 
probability measure. On the basis of such a test of 
significance, the hypothesis that the degree of agree- 
ment could have arisen by chance can be rejected at 
the .001 level. The median agreement was 90%. 


Figure-Ground Tachistoscopic Task 


The instructions to the subject were read orally: 
A number of words are going to be shown to you. 
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They are going to be shown very quickly so you will 
have a great deal of difficulty in seeing them. I want 
you to tell me in each case what the single diagonal 
word in large type is. It is best if you look at the 
place where the “X” is, because that is where the 
word will appear. Do not pay any attention to 
horizontal words in small type. Since you will have 
difficulty in seeing the diagonal word, be sure to 
guess if you are not sure. 

Sufficient time was taken on the first few practice 
trials to insure that the subject understood the nature 
of the task. There were 118 trials in this task; the first 
18 were practice trials and were not counted in the 
score. The order of presentation was the same for each 
subject. Each trial was presented at 3$ second, with a 
verbal warning 2 seconds prior to the presentation of 
the figure-ground stimulus? A Gerbrands modified 
Dodge tachistoscope was used for the presentation of 
the figure-ground stimuli. An example of the stimulus 
field is given below: | 

sugar À sugar 
sugar P sugar 
sugar P sugar 
sugar L sugar 
sugar E sugar A 

Figure and ground are defined by the "set" given in 
the instructions (Werner, 1949), Thus, it is assumed 
that when the examiner stresses that the subject report 
the central diagonal word, this part of the stimulus 
field becomes the figure or the goal toward which 
activity is directed. It is on this basis that the diagonal 
word is termed figure and the surrounding words 
ground. 

Figure and ground words were so combined that 
there was no duplication of consonants in the figure 
and ground for any given trial. For example, if the 
ground word contained the consonants ^s," ^g," and 
“r,” the experimenter selected a word for the figure 
which contained none of these letters. Such a procedure, 
however, could not be followed for vowels. 

The following are possible responses to the example 
given above (ground = “sugar,” figure = “apple”). 


1. “anger” G (ground intrusion: conso- 
nants “r” and “g” in the 
ground as well as in the in- Incorrect 
correct response to figure) Responses 
2. “avid” X (incorrect response to 
figure in which no con- 
sonants of the ground ap- 
Pear) 
3. “missed that NR (no response) 
one? 
4. “apple” C (Correct response) 


Two measures were derived from these responses. 
The first is %Inc (number of incorrect responses di- 
vided by the total number of figure-ground presenta" 


*The preparatory time of 2 seconds was based 0n 
the study by Huston, Shakow, and Riggs (1937) E 
which it was found that schizophrenic subjects tend 4 
to take longer periods of time to reach their peak 0 
readiness to respond than did normal subjects. A prep- 
aratory time of 2 seconds did not appear to p 
one group more than the other. 
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TABLE 2 
Ficure-GrounD ÁVOIDANCE COMPARISONS 


Figure Ground | Figure | Ground | Avoidance measures 


Avoidance of figure: 
figure varies, ground 
constant 

Avoidance of ground: 
figure constant, 
ground varies 

neutral affective | Extreme avoidance 

comparison 


affective | neutral | neutral) neutral 


neutral affective | neutral] neutral 


affective | neutral 


tions). This score is considered an index of general 
adequacy of figure-ground differentiation. 

The second measure is based on the ground intrusion 
response which represents the degree to which ground 
content is present in the response to figure. It should 
be emphasized at this point that all subjects were given 
the same figure-ground stimuli. Depending on the 
subject’s Luria responses, however, a given figure- 
ground situation might be affective-figure:neutral- 
ground for one subject, and neutral-figure:affective- 
ground for another subject. For this reason, scores are 
reported as percentages, i.e., %G, the number of ground 
intrusions in a given classification divided by the total 
number of presentations of that classification. 


Criteria of Avoidance 


Since the words used in figure or ground were 
classified as either affective or neutral for each subject, 
the following classification of figure-ground stimuli is 
possible; neutral-figure:neutral-ground, neutral-figure: 
affective-ground, affective-figure:neutral-ground, and 
aflective-figure:affective-ground. The first three of 
these Classifications, as seen in Table 2, were used in 
assessing avoidance. 

Avoidance was defined as either larger %G under 
afective-figure:neutral-ground conditions than under 
heutral-figure:neutral-ground conditions, smaller %G 
Under neutral-figure:affective-ground conditions than 
ids neutral-figure:neutralground conditions, or 
n %G under affective-figure:neutral-ground con- 

tions than under neutral-figurezaffective-ground 
conditions, 


RESULTS 


gue thin group comparisons of %G under 

€ various figure-ground situations are 
Jum in Table 3. It may be seen that for 
= Concrete group, there is a larger %G 
d er affective-figure:neutral-ground condi- 
lons than under neutral-figure:neutral-ground 
ee a smaller %G under neutral- 
nare raffective-ground conditions than under 
A al figure :neutral-ground conditions, and 

arger %G under affective-figure:neutral- 
oe conditions than under neutral-figure: 

€ctive-ground conditions. These data con- 
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firm the predictions that pathologically 
concrete schizophrenic subjects focus on 
ground aspects of the situation under affective 
figure conditions and focus on figure aspects 
of the situation under affective ground con- 
ditions. Accordingly, it may be inferred that 
the findings support the hypothesis than an 
avoidance of affective stimuli is evidenced in 
the figure-ground differentiation of pathologi- 
cally concrete schizophrenics when figure 
and ground are varied in terms of affective 
and neutral valence. 

The prediction that such avoidant behavior 
is more characteristic of the performance of 
the Concrete group than either the Con- 
ceptualizing or the Normal groups is also 
supported by the present findings. As may 
be seen in Table 3, there are no significant 
differences between the various figure-ground 
conditions in the Normal group. On the other 
hand, one may see a pattern of relationships 
in the performance of the Conceptualizing 
group opposite to that present in the Concrete 
group. Thus, for this group, there is a larger 
%G under neutral-figure:affective-ground con- 
ditions than under neutral-figure:neutral- 
ground conditions, a smaller %G under 
affective-figure:neutral-ground conditions than 
under neutral-fgure:neutral-ground condi- 
tions, and a larger %G under neutral-figure: 
affective-ground conditions than under affec- 
tive-figure:neutral ground conditions. 

In summary, the Concrete group meets 
the criteria of avoidance in the three com- 
parisons. The Conceptualizing group not only 


TABLE 3 
WITHIN-GROUP Comparison or %G 
Group HUS, ?» 
NA<NN +05 
dem ANSNN — «105.05 
NA<AN 02 
izi. NA>NN <.10>.05 
Conceptualizing NANN iG 
NA>AN «.01 
NA<NN >.25 
xe. ANCNN — 525 
NA«AN 2.25 


SAA: affective-figure:affective-ground; AN: affective-figure: 
neutral-ground; NA: neutral-figure:affective-ground; NN: neutral 


figuremeutral-ground. 3 
b The Wilcoxon (1948) T test was employed for the comparisons 


in this table. The significances reported are two-tailed. 
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fails to meet the criteria of avoidance in any 
of the comparisons, but these subjects evidence 
a pattern of behavior opposite to that pre- 
dicted on the basis of avoidance and, conse- 
quently, opposite to that of the Concrete 
group. The Normal group neither meets the 
criteria of avoidance in any of the comparisons 
nor shows behavior patterns opposite to those 
predicted on the basis of avoidance. 

In order to explore further the degree to 
which the three groups differ in avoidance of 
affective stimuli, the three comparisons in- 
volving %G were analyzed for each subject. 
Each subject was given a plus for each of the 
comparisons in which the difference in %G 
conformed to the criterion of avoidance of 
affective stimuli and was given a minus for 
each of the comparisons in which the differ- 
ence in %G did not conform to such a criterion. 
For example, if a given subject had a larger 
%G in the affective-figure:neutral-ground 
situation than in the neutral-figure:neutral- 
ground situation, he was assigned a plus; 
if he had a smaller %G (or equal) under af- 
fective-figure:neutral-ground conditions than 
under neutral-figure:neutral-ground condi- 
tions, he was assigned a minus. A test of 
significance of the incidence of plus and 
minus scores for each of the three comparisons 
was made among the three groups. If one group 
had a significantly greater incidence of plus 
than another group for any given comparison, 
this was taken as indicating that, for this 
particular comparison, the first group showed 
greater avoidance of affective stimuli. Table 
4 shows the pertinent relationships. In the 
three comparisons, the Concrete group showed 
significantly greater avoidance of affective 
stimuli than the  Conceptualizing group. 
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It will also be noted that in one comparison, 
the Normal group showed a greater avoidance 
than the Conceptualizing group. A composite 
avoidance score for each subject was also 
derived by computing a plus percentage 
(plus/plus + minus). It may be seen that 
the Concrete group had the highest and the 
Conceptualizing group the lowest composite 
avoidance scores. The Normal group fell 
between these extremes. 


Controls 


In order to determine whether the obtained 
relationships could be attributed to the opera- 
tion of variables other than those hypothesized, 
the following characteristics were investigated: 
age, educational level, vocabulary level, level 
of cooperation as indicated by Wittenborn 
scores, length of hospitalization, and word 
frequency. As shown in Table 5, there are 
no significant differences among the three 


. groups in age, educational level, Wittenborn 


scores, and no significant difference between 
the schizophrenic groups in length of hospitali- 
zation. The Concrete group is significantly 
lower in vocabulary level than either the 
Conceptualizing group, or the Normal group. 
However, no significant difference obtains 
between the Conceptualizing group and the 
Normal group in vocabulary level. 

In controlling for the higher vocabulary 
performance of the Conceptualizing group; 
the investigator dropped from this group the 
eight subjects with the highest vocabulary 
scores. He then compared this restricted group 
to the Concrete group from which the subject 
with the lowest vocabulary score was omitted. 
The difference in vocabulary level between 
these two restricted groups was not significant. 


TABLE 4 
BETWEEN-GROUP COMPARISON ON AVOIDANCE OF AFFECTIVE STIMULI GROUP COMPARISONS* 
Avoidance criteria SC/SA NC/SA SC/NC 
NN > NA SC > SA (.01) = — 
AN > NN SC > SA (<.01) — S 
AN > NA SC > SA («.01) NC > SA («.01) T. 
Composite* SC > SA (<.01) NC S SA (<.01) SC > NC (.05) 


Note,—The exact method for computing probabilities for 2 X 2 contingency table was employed for the comparisons in this table. 


The significances reported are two-tailed. 


* SC: pathologically concrete schizophrenic group; SA: adequately conceptualizing schizophrenic group; NC: normal control rout” i 


DAA: affective-figure:affective-ground; AN: affective-figure:neutral-ground; NA: neutral-figure:affective-ground; NN: neut" 


figure:neutral-ground. 


© The Mann-Whitney (1947) U test was employed in comparing group on the composite score. The significances reported are two-tailed: 


a 
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TABLE 5 
CHARACTERISTICS OF THE PATHOLOGICALLY CONCRETE SCHIZOPHRENIC, Ap 
CONCEPTUALIZING SCHIZOPHRENIC, AND THE NORMAL CRG E N 
Variable 
i Stanford-Binet | Wittenborn Length of 
G Age Educational | Vocabulary i italizati 
zap eee | eee) severe lll toaber ets |) Gtenber ot 
completed) errors) months) 
M SD SD M | sD | u SD M | sD 
Pathologically concrete schizo- 32.6 | 3.13 | 10.9 | 2.28 | 22. E 
spice S arc 2.9 | 5.36 | 11.9 | 8.62 | 36.6 | 10.7 
lequately conceptualizing schizo- | 29.6 5.66 | 11.9 | 2.08 | 29.2 | 4.28 | 12.2 i? 
phrenic (S-A) (N = 16) ; aE 
Normal Control (N-C) (N = 16) 31.1 | 7.09 | 13.9 | 4.05 | 28.1 | 5.57 | 12.2 | 10.6 = = 
SC/SA 1.40 1.00 3.098 .08 .46 
SC/NC .58 1.97 2.185 .07 = 
SA/NC 64 1.71 .61 .00 — 


pat or beyond the 1% level of significance. 
At the 5% level of significance. 


However, differences in avoidance scores 
between the two restricted groups continue 


to be significant in spite of the smaller number . 


of subjects. 

The experimenter dealt with the problem 
of word frequency by using words that oc- 
curred within the first 6,000 most frequently 
used words (Thorndike & Lorge, 1944), 
rather than socially tabu words having a 
low frequency of occurrence. However, in 
Spite of employing a narrow range of word 
frequency, it is possible that the relationships 
obtained between affectivity in the figure 
words and the phenomenon of ground in- 
trusion may still be accounted for by differ- 
ences in the frequency of occurrence of those 
words judged affective and those words 
judged neutral. The fact that the judgment 
of affective arousal is based on Luria criteria 
for each individual does not meet a “frequency 
criticism” because the question may be raised 
of whether less familiar words might be more 
difficult for the individual to associate to, 
result in disorganization on the Luria, and 
hence be judged affective. In order to investi- 
gate this possibility, a test of significance by 
Means of the x? test of ranks (Wilcoxon, 1948) 
Was made between frequency of occurrence 
of the words used as figure and the judgment 
of affective. arousal by Luria criteria. No signifi- 
| Eu relationship, however, was found be- 
E word frequency and the judgment of 

€ctivity by Luria criteria. The problem of 


word frequency does not exist for the relation- 
ships between background variation in af- 
fectivity and ground intrusion since all words 
in the background were of the same frequency 
of occurrence. Thus, the variable of familiarity 
as defined by the Thorndike-Lorge count can 
be ruled out as accounting for present findings. 
Tt is obvious, however, that individual subject 
differences in familarity with the test words 
are not thereby discounted. 


Discussion 


The results of the present study provide 
support for the experimental hypothesis 
that an avoidance of affective stimuli is 
evidenced in the figure-ground differentiation 
of pathologically concrete schizophrenics. 
It is less clear, however, as to whether the 
findings also provide support for the general 
hypothesis that pathologically concrete think- 
ing in schizophrenia represents an avoidance 
reaction to affective stimuli. It will be re- 
called that the use of the figure-ground per- 
ceptual task is based on the assumption that 
inadequate figure-ground differentiation and 
pathologically concrete thinking can be 
equated. A check on this assumption entails a 
comparison of the three groups with regard 
to the number of incorrect responses in the 
figure-ground task (%Inc). When the three 
groups are compared in over-all performance 
by means of the Mann-Whitney U test, no 
significant differences are present in %Inc. 
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In addition, there are no significant differences 
between groups in %Jnc under any of the 
specific figure-ground conditions. In terms of 
this measure, therefore, there is no confirma- 
tion that pathologically concrete schizo- 
phrenics show less adequate figure-ground 
differentiation than either adequately con- 
ceptualizing schizophrenics or normal in- 
dividuals. The empirical basis for equating 
inadequate figure-ground differentiation and 
pathologically concrete thinking is thereby 
lacking; accordingly, affect avoidance as 
inferred from the schizophrenic’s figure- 
ground differentiation cannot be attributed 
to his pathologically concrete thinking. The 
results may therefore be interpreted as being 
consistent with, rather than as directly sup- 
portive of, the general hypothesis that patho- 
logically concrete thinking in schizophrenia 
represents an avoidant reaction to affective 
stimuli, 

It is possible that a methodological weakness 
such as insufficient sensitivity of a measuring 
instrument can account for the lack of differ- 
ence between groups in adequacy of figure- 
ground differentiation. This consideration 
may be particularly relevant in view of the 
fact that “nontabu” words were employed, 
ie, those occurring within the 6,000 most 
frequently used words (Thorndike & Lorge, 
1944). Of more theoretical import, however, is 
a reconsideration of the assumption that 
inadequate figure-ground differentiation and 
pathological concreteness represent similar 
activities. It will be recalled that the schizo- 
phrenic subjects were selected on the basis of 
cooperation as well as extreme performance 
on tasks of conceptual ability. Shakow (1946) 
in presenting experiments investigating schizo- 
phrenic behavior at different levels of com- 
plexity has concluded that once degree of 
cooperation is controlled, disturbances of 
psychological function are not seen at lower 
levels of complexity such as the pattelar 
tendon reflex, but are only evidenced at more 
complex levels of behavior such as conceptuali- 
zation. The present lack of group differences 
in adequacy of figure-ground differentiation 
suggests the possibility that the figure-ground 
tasks differ from the conceptual tasks in 
terms of degree of complexity, and that lack 
of subordination as such is an insufficient 
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basis for equating inadequate performance 
on these tasks. This reappraisal does not 
negate the significance of the finding that 
pathologically concrete schizophrenics evi- 
dence avoidance of affective stimuli in the 
figure-ground task. It does, however, raise 
the possibility that avoidance of affect arousal 
may take quite different forms as a functiéh 
of the complexity-level of behavior analyzed. 
If conceptual activity is viewed as involving 
the simultaneous coordination of different 
social perspectives (Mead, 1934; Piaget, | 
1950), then the avoidance of interpersonal 
anxiety may necessitate a withdrawal from 
the very activity of conceptualization itself 
because of its immanent interpersongt quality, 
Arieti (1955) has suggested a similar notion 
in his evaluation of the schizophrenic’s per- 
ceptualization of thought as a “retreat from 
reason.” If this were the case, then conceptual 
deficit would be present whether the task and 
its context were designated as affective) 
neutral, social, or impersonal. Perceptual 
differentiation, on the other hand, has been 
interpreted as involving a qualitatively differ- | 
ent type of behavior, viz., a series of sequential 
focusing activities rather than simultaneous 
coordination (Feffer, 1959; Feffer & Goure- 
vitch, 1960; Piaget, 1950). This type of ac 
tivity could well be differentially influenced 
by the locus of affective and neutral stimuli 
In view of these considerations, the equating 
of inadequate figure-ground differentiation | 
and pathologically concrete thought may havé 
also been incorrect in terms of attempting 10. 
fit conceptual thought within an affect avoid- 
ance paradigm operationally appropriate t0 
behavior at a less complex level of organization: 

The inference of affect avoidance in the 
Concrete group should also be considered in. 
relation to the performance of the Com 
ceptualizing group. The difference between 
the two groups in their reaction to affective 
stimuli was not, as anticipated, one of degree 
The adequately conceptualizing schizo- 
phrenics, rather than merely showing less 
avoidance, appeared “drawn” to affective 
stimuli and, in fact, were significantly mol 
extreme in this regard than the normal con 
trols. Although unexpected, the diametrically | 
opposite patterns of affective orientation 
evidenced by the two schizophrenic group? 
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may not be an isolated finding. The pattern 
of differential sensitivity to anxiety provoking 
stimuli termed vigilance and defense (Chodor- 
koff, 1954; Lazarus, Eriksen, & Fonda, 1951), 
phenomenally, at least, is similar to the 
present opposite affective orientations. Also 
pertinent to the present results may be studies 
ima rather different area of investigation: 
physiologic (including autonomic nervous 
system) responsiveness in schizophrenia (King, 
1958; Malmo & Shagass, 1949; Meadow, 
Greenblatt, Funkenstein, & Soloman, 1953; 
Williams, 1953). From these studies, it seems 
reasonable to infer that conceptually adequate 
"reactive" schizophrenics are physiologically 
more responsive than both conceptually 
impaired “process” schizophrenics and nor- 
mals. The pattern of physiologic responsive- 
ness in relation to level of conceptual ability 
tesembles in certain respects the directional 
sensitivity to affective material found in the 
Present study, viz., the adequately con- 
ceptualizing schizophrenic’s heightened posi- 
tive sensitivity, the conceptually impaired 
schizophrenic’s heightened negative sensitivity, 
and the normal individual’s position between 
these extremes. Further studies could well be 
directed toward the nature of the relationship 
between adequacy of conceptual ability, level 
of physiologic responsiveness, and affect 
orientation, not only in schizophrenia but 
also, as the vigilance and defense studies 
Suggest, in normal and neurotic groups as 
well. An important possibility is that lowered 
Physiologic responsiveness in the conceptually 
impaired process schizophrenic represents 
Not a simple response deficiency, but an active, 
ongoing, avoidant reaction as part of his 
generalized withdrawal from interpersonal 
relationships, 


SUMMARY 


The present study is directed toward a 
test of the hypothesis that pathologically 
concrete thinking in schizophrenia represents 
an avoidance reaction to affective stimuli. 
To this end, a figure-ground situation was 
constructed in which figure and ground were 
Varied with regard to affective valence. A 
ground intrusion measure, representing the 
ee of background content in the response 
© figure was devised. The figure-ground 
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differentiation of schizophrenic and normal 
individuals who were independently evaluated 
as to conceptual level, was analyzed with 
regard to relationships between ground in- 
trusions and the locus of affective stimuli 
(whether in figure or in ground). The predic- 
tions were that (a) under affective figure 
conditions pathologically concrete schizo- 
phrenics would focus on ground aspects of 
the situation; (b) under affective ground con- 
ditions pathologically concrete schizophrenics 
would focus on the figure; and (c) these pat- 
terns of behavior would be more characteristic 
of pathologically concrete schizophrenics than 
of either adequately conceptualizing schizo- 
phrenics or normal controls. The results are 
interpreted as supporting these predictions 
and as being consistent with the main hy- 
pothesis. 
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HE present analysis was undertaken in 

an effort to gain insight into the nature 

of the “schizophrenic process" by 
examining the functional covariation of 
symptoms as the psychiatric condition of the 
patient changes. Symptoms that depend upon 
a unitary psychiatric process should change 
together as the condition of the patient im- 
proves or worsens. If schizophrenia is such a 
unitary psychiatric process, one should be 
able to define it in terms of a set of symptoms 
that change together as the schizophrenic 
process develops. Symptoms that are related 
to the schizophrenic process should change 
independently of symptoms that depend 
primarily upon other psychopathologic proc- 
esses, 

Other investigators have attempted to 
characterize differences between psychiatric 
patients by examining differences in symptom 
patterns at one point in time. Eysenck (1956) 
has been concerned with basic dimensions of 
difference between different diagnostic groups 
of patients. Using the multiple discriminant 
function approach, he concluded that **psychot- 

_icism” represents one type of unidimensional 
deviation from normal and that “neuroticism” 
Tepresents another distinctly different type of 
deviation. The symptoms that are useful in 
discriminating neurotics from normals thus 
may not be the same as those which are 
useful in discriminating psychotics from 
normals, 

Lorr (1953) used methods of factor analysis 
to identify basic dimensions of difference 
among individual mental patients. He found 
differences in symptomatology to be multi- 
dimensional, with 11 somewhat independent 
mensions being required for a comprehen- 
ive description of differences among indi- 

| vidual hospitalized patients. It should be noted 

1 Now in Department of Psychology, Kansas State 


University, Manhatten, Kansas. 


that the findings of Lorr and Eysenck need 
not be considered in conflict because different 
types of questions were being asked. Eysenck’s 
approach was concerned with differences be- 
tween diagnostic groups of individuals, 
whereas Lorr was concerned with differences 
among individuals, many of whom belonged 
to the same diagnostic groups. 

In contrast to problems dealt with by other 
investigators, as illustrated by the studies 
mentioned, a distinctly different issue is 
considered in this article. What are the basic 
dimensions of individual difference in change 
in psychiatric symptoms for patients having 
a somewhat homogeneous psychiatric classi- 
fication? 

The methods of factor analysis were ap- 
plied to the matrix of intercorrelations de- 
rived from change scores on 42 items? of the 
psychiatric interview portion of the Multi- 
dimensional Scale for Rating Psychiatric 
Patients (MSRPP) developed by Lorr (1953).* 
The data came from a study by Shawver, 
Gorham, Leskin, Good, and Kabnick (1959) 
in which three matched groups, each composed 
of 40 chronic schizophrenic patients who had 
been maintained on chlorpromazine for a 
period of 6 months or longer, were (a) con- 
tinued on chlorpromazine, (b) shifted to main- 
tenance on reserpine, or (c) shifted to placebo. 

In the previously reported analysis of these 
data, interest was centered on the evaluation 
of differences between the three maintenance 
treatment regimens with the “total morbidity” 

2 The psychiatric interview portion of the MSRPP 
contains 40 numbered items plus two additional sub- 
items, 22a and 27a. 

3In addition to the 42 items from the Lorr scale, 
five global ratings accomplished by ward personnel 
(Shawver, Gorham, Leskin, Good, & Kabnick, 1959) 
were included in the correlation matrix and factor 
analysis. Since these measures represent a completely 
different type of data, and since they were not found 
to correlate significanswith the rotated factors, they 
are not considered further in this report. 
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score from the MSRPP used as the major 
dependent variable. The treatment changes 
resulted in significant deterioration in the 
psychiatric condition of patients from whom 
chlorpromazine was withdrawn. Thus the 
data were thought to be especially suitable 
for the present analysis, which was under- 
taken to provide insight into the nature of 
changes in psychiatric pathology. Since the 
interest was in understanding basic dimen- 
sions of individual difference in psychiatric 
change, differences between drug groups were 
partialed out in order to remove this ex- 
traneous source of variance before the present 
analysis was undertaken. 


TABLE 1 
Loapincs OF 42 ITEMS FROM PSYCHIATRIC 
Interview Portion or MSRPP on 
Ercut CENTROID FACTORS 


Ie | Ile | Me | IVe | Ve | Vie | Viie | Vic 

1. |.55| .33| .30| .06| .06| .13|—.22|—.16 
2. |.16| .10| .41| .07|—.02|—.09|— .08|—.13 
3. |.47| .11| .25| .17| .12|—.06|—.13|—.13 
4. |.39| .25| .22| .06| .15| .18|—.13| .09 
$. |.50|—.24|—.15| .11| .19|—.09| .13|—.12 
6. |.31|—.24| .20|—.25|—.22| .13| .12|—.35 
7. |.27| .18|—.36|—.42| .03—.13|—.05|—.14 
8. |.43| .11.—.26|—.45| .07| .19|—.12)—.21 
9. |.40. .07| .04| .12| .13|—.14| .27| .11 
x —.15| .14|—.28| .21|—.15|—.08 
.20|—.04| .09| .14|—.18|—.09 

—.20| .24| .17|—.05| .26|—.14 

—.21| .11|-.10/|—.03|—.04| .06 

.05|—.35| .14.—.23| .05|—.06 

+22) .35| .20| .16| .14| .06 

—.24| .28| .08.—.19| .13| .15 

—.19| .19| .12—.17| .17| .18 
:09|—.11|—.09|—.10| .10|—.34 

.23|1—.39| .30|—.29|—.22| .17 
—.28|-.15.—.08| .14|—.16| .24 

.27|—.38| .28.—.28|—.31| .15 
.08|—.09|—.34|—.10|—.25| .09 
-36|—.03|—.05/—.03|—.05| .21 

—.22| .05|—.04 .15| .29| .46 

—.15|-.05. .17| .17| .13| .29 

—.38| .12| .17| .03.—.14| .07 

—.23|— .10|—.08|— .14|— .15|— .09 
-14|—.27|—.20| .35|—.08| .24 

—.07| .15| .23| .20| .05 

—.05| .18| .13| .14|—.09 

-46| .20| .11| .15|—.16 

-18|— .11|—.09|—.21|— .04 

—.30|-.17| .05| .20|—.02 

-23) .23| .32| .13| .16 

:27|—.25| .08|—.12|—.06 

-12| .40| .12/|—.28|—.11 

—.27—.21| .11| .27| .05 

—.34,—.09|—.13| .24|—.13 

.03,—.26| .11|—.24|—.10 

+03) .13|—.06|—.20|—.15 

.09.—.06| .17| .07| .13 

17|-.04—.06|—.03| .06 
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TABLE 2 


Loapincs OF 42 ITEMS FROM PSYCHIATRIC 
INTERVIEW PoRTION oF MSRPP on 
Six RorATED ORTHOGONAL 


Facrons 
I I m IV v VI 
1 .73| .10| .08| .15| .00 —. 
2 .37| —.08| .13 —.13| .14 — 03 
3, «52|. .15| | .11| —.04|-- 02] SN 
4 .67| —.20] .03| .04 —.05| .02 
5 .10| .40| .12| .07| —.02| 45 
6. .07| .18| .43 | .13  .30| —.08 
7 —.02| .02| .04  .63| —.07| 09 
8, .15| .13| .12| .63 —.12| 08 
9. .18| ..07| .18| —.19  .33| 87 
10. 09]  .77| .09| .08| .18| —.07 
11 .73|  .18| .12 | .32 —.02  .10 
12. .05| .45| —.03| —.04| .00| 52] 
13. .03| .58| .12| .05| 07] 
14, —.03| .10) .50| .17  .17| 28] 
15. .47| —.05| —.32| —.02 —.13| .23 
16. .10| .47| .10  .02 —.05| .5) 
17. .13| .46| .12| .07  .02| .53 
18. .35| .28| .08  .28| .27 14298 
19. .05| .02| .80  .05| .00 15] 
20. -16| .25| .02  .48| —.10| —.03 
21 .02| .08| .87| .03| .08 -021 
22 .20| .27| .15| .22  .20| =A 
22a. .55 .05| .22 .12 .23 4 8 
23 .05| .15| —.23 | .20 1| .2 
24. .18| .17| .07  .23| —.03| 2 
25, .21| .03| —.27 .32| —.43| .M 
26 .00| .08| —.08  .28| .05 2 
27. :05} .37| .34  .10| .30| —.25 
27a. 25. .43| .07|  .15|. 512] eae 
p .30| .15| .15 —.03| .15| .22 
29 132] 130 —.12 —.40  .02| .25) 
30. 115] .32| —.28  .38| —.30| .10 
31 -02| .00| —.20] .62| .08| M 
32 167 .02| —.12 —.02| —.02| 2 
33 —.07  .82 ..01| .00  .05 d 
34. 11 12  —.02  .10 —.56| 0 
35 —i01  .12 19 .08 .52 06 
36. —.07 108 —.22  .57 H EN 
37 07 73 .28 .08| .12 m 
38. 33| .33  .25| .08| —.08 IL 
39. 47]  .37| .07  .03| .24 p 
40. 15| .52 | .22 .00| .08 
METHOD 


Each of the 120 patients in the study was rated by à 
team of two raters prior to the start of the study ani 
after 3 and 6 months in the study. For each rating 
period, the two independent ratings were combined to 
yield measures with high reliability. Change scores were 
computed for each patient on each of the 42 rating- 
scale items by subtracting the 6 month rating from e ] 
initial rating. The change scores were intercorrelated 
after first being corrected to the separate drug gtouP 
means. The correction procedure was introduced ^ 
that the change factors obtained in the subsequenti 
analysis could be considered as representing primary 
dimensions of change in schizophrenic symptomatology | 
and not primary dimensions of difference jE 
treatment regimens. The centroid method of fac! 
analysis was employed on the matrix of interco: 


DIMENSIONS OF CHANGE IN SCHIZOPHRENIC SYMPTOMS 


TABLE 3 


SYMPTOM CLUSTERS CORRESPONDING TO SIX 
RorATED ORTHOGONAL FACTORS 


Psychoticism 
1. Irrelevance of responses to questions. 
. Inconsistency of thoughts and feelings. 
. Disorientation for time. 
, Elements of speech illogically and inconsistently 
connected. 
. Disorientation for person. 
. Speaks, mutters, mumbles to self. 
. Disorientation for place. 
. Perplexity. 
anoidism 
"Tension. 
. Suspicion of influence. 
Hostility, resentment, bitterness. 
. Unrealistic fears. 
Anxiety over impulses to commit criminal or 
hostile act. 
. Apprehension. 
. Suspicion of persecution. 
. Suspicion of reference. 
Guilt-conversion 
1. Feelings of sinfulness and guilt. 
2. Degree to which physical complaints not based 
upon organic factors. 
3. Use of physical symptoms as defense mechanism. 
4, Concern over health. 
General retardation 
1. Slowed motor reactions. 
2. Reduced amount of speech. 
3. Reduced loudness of speech. 
4. Blocked speech. 
5. Controlled, restrained, inhibited. 
6. Reduced emotional responsiveness. 
Depression 
1. Feelings of inadequacy. 
. Depression. 
3. Suicidal impulses. 
Anxiety and tension 
- Tension. 
2; Hostility, resentment, bitterness. 
3. Anxiety over impulses to commit criminal or 
hostile act. 
4. Apprehension. 


P: 


ONDA wbOrDeBoram gt 


between the change scores; and rotation to an approxi- 
mate orthogonal simple structure was accomplished by 
graphic methods, 


RESULTS 


The factor analysis resulted in a total of 
11 initial unrotated centroid factors. The 
loadings of the 42 MSRPP variables on the 8 
Strongest factors are presented in Table 1, 
Factors 9 through 11 having only minimal 
loadings, 

lt was decided arbitrarily that variables 
With factor loadings of .40 or greater would 
be used in the interpretation of rotated factors. 


nc, Computations were contracted for at the com- 
face from the Corporation for Economic and 
ustrial Research (CEIR), Arlington, Virginia. 
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That this high criterion might result in 
elimination of weak factors was realized; 
however, because the correlation matrix was 
estimated on only 117 degrees of freedom, 
it was considered desirable to restrict interpre- 
tation to only the major sources of common 
variance. Following rotation to an approxi- 
mate orthogonal simple structure, six factors 
were found to have more than two loadings 
exceeding .40 in magnitude. The loadings of 
the 42 rating-scale items on the six rotated 
factors are presented in Table 2. 

Table 3 identifies the content of the rating- 
scale items which were found to have loadings 
of .40 or higher on each of the six rotated 
orthogonal factors. The central concept which 
appears to represent the nature of the common 
variance in each set of items is included as 
the heading for each group of items. Funda- 
mentally, of course, these symptom change 
concepts are defined in terms of the measures 
which correlate highly with them. 


DISCUSSION 


The first basic dimension of change in the 
psychiatric symptomatology of chronic schizo- 
phrenic patients appears to involve mental 
disorganization. Each symptom which changes 
with a change in this primary schizophrenic 
factor is pathognomonic of psychosis. None 
of the symptoms with high loadings on this 
factor have high loadings on any other factor 
obtained in this analysis. No other obtained 
factor contains symptoms which are pathog- 
nomonic of psychosis. Thus it is suggested 
that the mental disorganization factor repre- 
sents the primary psychotic process in the 
chronic schizophrenic patients of this study. 
Factor I is independent of the mental pro- 
cesses represented by the other five factors, 
suggesting that the primary psychotic pro- 
cess can change without : corresponding 

e in the other unrelated” processes. 
HEART be noted that a nufber of nt 

otic symptoms, notably those concern 
DAR e = behavior and motor dis- 
turbances, fail to have high loadings on any 
of the obtained change factors. If these 
symptoms were present in the patients, and 
if they had changed to any noticeable degree, 
then the items dealing with them would be 
expected to load on one of the obtained factors 
or to result in one or more additional factors. 
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There are several plausible explanations for 
the obtained results. The symptoms in ques- 
tion may have had minimal existence in the 
chronic patients; hence little change could be 
noted. Or these symptoms may be more 
stable and less subject to drug change than 
are the other psychotic symptoms. One ex- 
planation may be more reasonable for halluci- 
nations and the other for mannerisms and 
posturing behavior. e 

In any event, only one psychoticism factor 
was obtained for the chronic patients involved 
in this study, and it must remain for future 
studies to determine whether other inde- 
pendent psychoticism factors exist for other 
patient populations. The finding of a unitary 
psychoticism factor appears to agree with 
conclusions advanced by Eysenck (1956) 
that there is a fundamental unitary process 
-associated with the condition of being psy- 
chotic. 

The second basic dimension of change in 
the psychiatric symptomatology of chronic 
schizophrenic patients appears to be related 
to distortion of thought accompanied by anx- 
iety and tension. Although a delusional 
system may be involved, the process is one of 
distortion of the organized thought process 
rather than its disorganization. Since the 
distortion factor is independent of the dis- 
organization factor, the results suggest that 
distortion of reality may occur at any level 
of mental organization; moreover, since 
change scores are being considered, change in 
degree of distortion need not be accompanied 
by change in degree of organization. Inde- 
pendent processes are apparently involved. 
The distortion factor is also independent of 
the other change factors resulting from the 
analysis. 

The third basic dimension of change in 
psychiatric symptomatology in chronic schizo- 
phrenic patients appears to involve a guilt- 
conversion process. The factor is concerned 
with functional somatic complaints which 
accompany or which are accompanied by 
feelings of sinfulness and guilt. It is of some 
theoretical interest to note that changes in 
functional somatic symptoms correlate with 
changes in feelings of sinfulness and guilt, 
especially since these are the only psychologi- 
cal variables which correlate highly with the 
appearance and disappearance of physical 
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symptoms. The fact that these symptoms 
change together suggests that a unitary process 
is involved. This factor appears to involve a 
neurotic process which is present in chronic 
schizophrenic patients but which is unrelated 
to the primary psychotic process. À 

The fourth independent dimension of change 
in the symptomatology of chronic schizo- 
phrenic patients is related to a general retarda- | 
tion process. Patients who are low on this 
factor become slower in movements, speak 
less, speak with reduced volume, and evince 
reduced emotional responsiveness. The fact 
that these several symptoms change together 
suggests that they are dependent upon a 
common underlying process which is possibly 
related to vital energy level. This factor has 
been found to be especially sensitive to drug 
effects, with some ataractic drugs given at 
high doses producing a marked increase in 
general retardation. 

The fifth basic dimension of change in the 
psychiatric symptomatology of chronic schizo- 
phrenic patients appears to represent the! 
mood component of the affective process 
Changes in mood, self-respect, and the oc 
currence of suicidal impulses seem to be 
related to a unitary mental process. It is 
interesting that depression is a psychiatric 
construct which is usually conceived to involve 
changes in mood, retardation, and somatic 
symptomatology; however, in the present 
analysis these three components were found 
to change independently of one another. 

The sixth basic dimension of symptom 
change in chronic schizophrenic patients 55 
somewhat more difficult to interpret than the 
other five because it involves only that part 
of anxiety and tension which is independent 
of distortion in thought processes An Im. 
crease in distortion of reality is accompanied 
by a corresponding increase in anxiety 8D 
tension, but anxiety and tension may increase 
independently of distortion in thought proc 
esses. As with other nonpsychotic primary’ 
processes, anxiety and tension appear % 
neurotic elements in psychotic patients, and 
the symptoms involved may change without 
a corresponding change in primary psychot | 
cism. 

5A final oblique rotation would have yielded an 


"anxiety" factor which was substantially correlated 
with the *paranoidism" factor. 


DIMENSIONS OF CHANGE IN SCHIZOPHRENIC SYMPTOMS 


Factor analysis leads to a conception of the 
psychiatric patient as characterized by a 
limited number of functionally independent 
processes which are reflected in observable 
symptomatology. Each patient is conceptua- 
lized as having a level of functioning related 
to each of the primary processes. It is in- 
teresting to note that the primary dimensions 
of change obtained from the present analysis 
are adequate in describing several “types” of 
patients included in the nomenclature of the 
American’ Psychiatric Association ` Mental 
Hospital Service (1952) classification system. 
Since the disorganization factor represents 
the only truly psychotic factor evident in 
these patients, it is conceived as the core of 
the schizophrenic process. A pure type of 
schizophrenic may evince pathology in this 
area only. The psychotic process may be 
superimposed on a personality pattern re- 
flecting other independent processes. In the 
case where the disorganization process runs 
concurrently with a marked distortion process, 

schizophrenic reaction, paranoid type,” is 
the result. The present results suggest that 
two independent processes, not one, are 
tesponsible for this classification. Patients 
may show a marked distortion process without 
accompanying disorganization. In such a 
case, “paranoid reaction” would be the dassi- 
fication. * 

No special nomenclature is available to 
describe the schizophrenic patient with a 
primary neurotic process centering about 
guilt and somatic complaints. Hypochon- 
driasis, functional somatic disorders, and 
somatic delusions are common early in several 

types” of schizophrenic reactions. The im- 
Portant thing to note here is that over a 
Teasonably short period of only 6-months 
duration, change in this neurotic process is 
uncorrelated with change in the primary 
schizophrenic process. 

The general retardation process, when ex- 
“treme and in association with the mental 
disorganization process, may result in the 
Classification of “schizophrenic reaction, 
catatonic type.” If the retardation is less 
Severe and is accompanied by a marked de- 
Pression of mood, the picture is one of “schizo- 
Phrenic reaction, schizoaffective type.” In 
Ses. Where the mood factor predominates 
and in cases where mood, retardation, and 
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somatic processes coexist, the classification 
may be one of “depressive reaction.” Without 
the point being labored further, one can recog- 
nize how other neurotic and psychotic re- 
action types can be conceived to result from 
various combinations of the independent 
processes. The most interesting thing is that 
such a comprehensive set of processes can 
be identified from change in the symptoma- 
tology of chronic schizophrenic patients and 
that the several processes are independent 
over the period of time considered in this 
study. 

'The fact that the several processes appear 
to be independent over a 6-month observa- 
tion period does not establish their inde- 
pendence in the long-term. Risking pure con- 
jecture for the purpose of elaborating this 
point, one may conceive the schizophrenic 
process as being psychogenic in origin and 
even as resulting directly from one or more 
of the observed neurotic processes. This 
would not require the processes to have a 
short-term relationship. They could operate 
in different media. On the other hand, the 
schizophrenic process may be conceived as 
an entirely separate disease process which 
superimposes itself upon an existing set of 
personality and adjustment processes. The 
several processes may appear to be related 
because disorganization renders the individual 
less capable of coping with the world and 
hence brings the neurotic processes into the 
foreground. It may be only in stabilized, 
chronic psychotic patients that we are able 
to observe the fact that several independent 
processes are involved. 

As a final point to be considered in this 
discussion, it is interesting to examine the 
similarities and differences between the change 
factors obtained in this analysis and the status 
factors obtained by Lorr (1953). The number 
of first order change factors identified in 
this analysis was smaller than the number of 
factors identified by Lorr in analysis of status 
scores. Lorr's factors include ward behavior 
items in addition to those from the psychiatric 
interview; his sample was considerably more 
heterogeneous, and he worked with "'between- 
patient" variance rather than “within- 
patient-over-time” variance. Nevertheless, 
some general agreement is present. The dis- 
organization factor in the present analysis 
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corresponds closely to the “conceptual dis- 
organization” factor identified by Lorr. In 
the present analysis, all three of the “orienta- 
tion" items loaded heavily on this. single 
factor, whereas only "orientation for time" 
was included in the corresponding MSRPP 
factor. The MSRPP factor also includes “false 
beliefs," which would seem more appropriate 
in a “distortion” than in a “disorganization” 
factor. 

The distortion factor obtained in the 
present analysis includes the scales subsumed 
under Lorr's (1953) "paranoid projection" 
factor. In this case, the MSRPP factor de- 
rived from between-patient variance appears 
to be a conceptually purer factor than the 
distortion factor derived from the change 
scores. Apparently, paranoid projection can 
appear in individuals with any characteristic 
level of anxiety (lack of between-patient 
covariance), but an increase in distortion 
results in an increase in the characteristic 
anxiety level which is present. 

The guilt-conversion factor in the present 
analysis appears to correspond directly to 
Lorr's (1953) “hysteric conversion" factor, 
which he has marked as "tentative," with 
the exception that the guilt item does not 
appear in the MSRPP factor. 

The general retardation factor from the 
present study includes items from "retarded 
depression,” "activity," “melancholy agita- 
tion," and "conceptual disorganization" fac- 
tors of the MSRPP. There is considerable 
overlap among the first three of these MSRPP 
factors with several items appearing on 
more than one of them. It appears reasonable 
that a single concept should be involved in 
all three. Similarly, the mood factor in the 
present analysis appears to correspond to 
another common source of variance included 
in the "retarded depression," "activity level," 
and "melancholy agitation" factors from the 
MSRPP. It is concluded here that analysis 
of change scores in these areas provides more 
meaningful and unitary constructs than were 
produced by analysis of status scores. 

No pure anxiety factor was obtained in 
the previous analysis based upon status 
scores. The “melancholy agitation" factor 
obtained by Lorr (1953) contains anxiety 
and tension items, but the present results 
lead to the suggestion that anxiety and tension 
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represent a useful independent construct of 
individual difference in psychiatric patients, | 

The symptom clusters identified in the. 
present analysis are homogeneous and readily 
interpretable in terms of unitary underlying. 
constructs. The method of examining psychi- | 
atric processes through analysis of change. 
scores is recommended. It is recognized that 
this work represents only the initial step in 
the establishing of a reliable and compre- 
hensive description of basic psychia i 
processes which could result from analysis of | 
the covariation of symptom change. The | 
utility of objective psychiatric rating scales, 
such as the one developed by Lorr (1953), | 
is strongly supported by the results. | 


I 
SUMMARY 


Data from a previously reported m 
therapy study involving 120 chronic schizo- | 
phrenic patients were reanalyzed for the | 
purpose of identifying primary dimensions of | 
change in psychiatric symptomatology. Six | 
independent processes were identified through | 
a centroid factor analysis and orthogonal | 
rotation. The processes are identified with 
mental disorganization, distortion of thought | 
processes, guilt-conversion, retardation, de | 
pression in mood, and anxiety. The several | 
processes are independent of one another - 
over a reasonably short period of time in | 
chronic schizophrenic patients. Only the 
disorganization process involves unquestion- | 
ably psychotic symptomatology; it appears 
to be identified withthe primary schizophrenic | 
process. Various subtypes of schizophrenic 
patients can be described in terms of this” 
psychotic process being superimposed upon 4 
personality marked by emphasis of one oF | 
more of the other independent processes 
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RENCE OF FUTURE EVENTS! 
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possibility that a future event that 

would have important consequences 
for them may actually occur. Usually, in 
such a situation, it is possible to take some 
action in preparation for the possible future 
event so that, if it should occur, its impact 
on the person would be more favorable than 
would otherwise have been the case. Such 
preparatory action, however, may involve 
considerable inconvenience and, if the possible 
future event should not occur, the preparatory 
action will have been useless. 
_ Obviously, the more a person in such a 
situation believes that the possible future 
event will really occur, the more willing he 
should be to take action in preparation for it. 
This statement is almost self-evident and, of 
course, is not our concern here. The problem 
we wish to deal with is, indeed, the reverse 
of this, That is, if a person has somehow 
been induced to take preparatory action of a 
tather inconvenient or effortful kind, will 
this, in and of itself, lead him to believe 
that the future event is even more likely to 
occur? 

The theory of dissonance (Festinger, 1957) 
would lead one to give an affirmative answer 
to this question. Consider*the cognitions of a 
Person who has expended considerable effort 
Preparing for some possible future event 
sand who also knows many things that indicate 
to him that the event will not actually occur. 
These two sets of cognitions are dissonant 
with one another. Considering only those 
facts the person knows which indicate that 
the event will not occur, it would follow 
Psychologically that the person would not 
expend effort in preparation for it. Hence, 
the information that he did expend effort in 
Preparation is dissonant with information 
he has that indicates the event will not occur. 

Given this dissonance, then, let us examine 

. how it may be reduced. There are three 
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‘future event, whi 


major ways in which such dissonance can be 
reduced: 

1. The person may somehow be able to 
persuade himself that the preparation was 
really effortless and not at all inconvenient. 

2. The person may find for himself some 
other objectives, in addition to the possible 
“justify” the actions 
he has taken. 

3. The person may persuade himself that 
the possible future event is, indeed, very 
likely to occur. 

To the extent that this last means of re- 
ducing dissonance is employed by people, 
we emerge with the prediction that, after a 
person engages in effortful preparation for a 
future event, he should believe the event is 
even more likely to occur than he had pre- 
viously believed. The present experiment 
was designed to test this derivation under 
controlled laboratory conditions, 


METHOD 


The basic design of the experiment was simple. All 
subjects were given a standard set of instructions con- 
cerning the exact probability of occurrence of a future 
event (that they might have to take an examination). 
One group of subjects, selected at random, was then 
induced to engage in effortful preparation while another 
group was induced to engage in relatively effortless 
preparation. The major dependent variable in which 
we were interested was how likely the subject thought 
it was that the event would actually occur (that she 
herself would have to take the examination). The 


details of the procedure follow. 


Subjects and Volunteering Procedure 


The subjects were 60 female high school students, 
ranging from 14 to 18 years of age, who had volunteered 
to participate in an experiment on “Techniques of 
Studying." à 

esting for volunteers the experimenter said: 

We are doing research on the E à of 
tudying; specifically, we are looking into the guessing 
pene dut seems to be involved when students 

study for tests. . 

Now, here is what we are asking usd to 3o We 
uesting that everyone volunteer for two hours. 

"The ex iment takes one hour for 50% of the sub- 

jects and two hours for the other 50%. I can’t tell 

you now in which group, the one hour or the two hour 

Froup, you will be placed, for this is part of the 


* 
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experiment itself, That is why we are asking everyone 

to volunteer for two hours, although half of you will 

be finished at the end of one hour. 

Thus, before the experiment each subject knew 
that, even though each appointment was for 2 hours, 
50% of the subjects would be finished at the end of 1 
hour. 


Course of the Experiment 


When each subject arrived for the experiment she 
was told the following: 

We are studying the techniques that students use 
to study for exams. We know that in the process of 
studying, students form hypotheses, hunches, or 
guesses about an exam (for example, if there will be 
essays on the exam, what the essays will be about, 
the type of material to be asked, and so on), and 
then they orient their behavior according to their 
guesses. As yet we don’t know the hows or the whys 
or the types of conclusions that students reach and 
their relative accuracy. These are the questions we 
hope to answer in this experiment. 

As I told you before, this experiment takes one 
hour for half of the subjects and two hours for the 
others, We won’t tell you which group you are in 
until we finish the first part of the experiment. We 
have the experiment designed this way so that we 
can get at the answers we want. What we have done 
is to divide the people that have volunteered for 
the experiment into two experimental conditions; 
we have done this on the basis of certain criteria 
(we have access to high school records and other 
things). Now I can’t tell you what these criteria 
are at this point, for we want you to guess about 
them later. In short, however, you have already 
been put in one of the two conditions, so no matter 
what you do now this will not change. 

In one condition the people will take a new, 
highly reliable aptitude test, one developed spe- 

cifically for people above the ages of 14. If you 
happen to be in this condition, the time you will be 
here will be two hours. The other half of the people 
will not take any aptitude test at all; these people 
will be finished in one hour. We had everyone 
volunteer for two hours, so that people wouldn’t 
know which condition they were in. 

So that you will be familiar with the total situa- 
tion, even though you may or may not have to 
take the test, we tell everyone generally about it, 

As I mentioned before, this test that half the 
people that signed up will be taking is a new type 
of aptitude or IQ test, “The Revised Standardized 
Aptitude Test 1958,” It has been specifically de- 
veloped for people above the ages of 14 and has 
proved to be very successful in accurately indicating 
IQ. The test goes on the assumption that the reason 
past IQ tests have been unreliable with people 
above these ages is that through studying specific 
areas, these students have a specialized knowledge 
of the factual information in the area of their con- 
centration and relatively little extensive knowledge 
of the information in other areas. This, you can 
see, leads to noncomparable groups of people and, 
as a consequence, leads to noncomparable measures 

of the aptitudes. In order to get around this problem 
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and to standardize the conditions for all students, 
this test requires that the people taking it be given | 
some specific information beforehand, which will 
give them all the necessary tools to work with the 
questions on the test; this way everyone approaches 
the test with the same information. What the test 
measures is not this information, but rather the 
way that they work with the information that they | 
haye been given. 

In order to establish the “high preparatory effort 
condition," the experimenter said: 

Now in a few minutes I shall give you the “In- | 
formation Sheet.” As you will see, it is crucial for 
taking the test. If you hapen to be one of those 
selected to take it, which you may not be, it is of 
utmost importance that you memorize these defini- 
tions, for the test is not valid if the people do not 
learn them. We are requiring everyone to commit 
these definitions to memory, no matter which con- | 
dition they are in. 

Thus far, it has taken people around 25-30 
minutes to memorize these. You may take as much 
time as you need to learn this material thoroughly, 
for you will not be able to have this sheet later, if 
you happen to be one of those previously designated 
to take the test. E 

After you have memorized these definitions, we 
will give you a short questionnaire and interview; 
then we will tell you in which group you have been 
placed. If you have not been selected to take the | 
test, you can leave then; if you are one of the people 
that have been selected, you will then take the test. 
The “low preparatory effort condition" was es- 

tablished by the following instructions: 

Now in a few minutes I shall give you the “In- 
formation Sheet." As you will see, it is crucial for 
taking the test, if you happen to be one of those 
selected to take it, which you may not be. In any | 
case, we want everyone to briefly look over it. 

You will have a minute or so to look over the 
information, just to get an idea of what it is all 
about. You need not attempt to commit it to memory 
or study it, for you will be able to have the “Informa- 
tion Sheet" with you, if you should be one of the 
people previously designated to take the test. { 

After you have looked over these definitions, 
we will give you a short questionnaire and interview; 
then we will tell you in which group you have been 
placed. If you have not been selected to take the 
test, you can leave then; if you are one of the people 
that have been selected, you will then take the test. 
Each subject was then handed an "Information 

Sheet" containing 25 symbols with arbitrary abstract 
definitions of each symbol. After each subject had 
finished either memorizing the list (high effort com 
dition) or briefly familiarizing herself with it (low 
effort condition), she was asked a number of questions 
which we will describe exactly when we discuss the 
Tesults of the experiment. When the questionnaire ha 
been completed, the true nature of the experiment 
was explained to each subject in detail. Of course, 20 
aptitude test was administered. 

Although each subject was asked not to talk about 
the experiment with others in the school, some talking 
about the details of the experiment did occur. Seve? 
subjects, four from the “high effort” and three from 
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the “low effort” condition, had to be discarded because 
they had heard what the experiment was all about. 
Such subjects, of course, realized beforehand that there 
was no aptitude test to take. Two other subjects were 
discarded (one from each condition) because of mis- 
understanding of instructions and directions. Valid 
data remained for 51 subjects, 26 in the high preparatory 
effort condition and 25 in the low preparatory effort 
condition. 


RESULTS 

Effectiveness of the Manipulation A 

In order to determine whether or not we 
had effectively created a high preparatory 
effort and a low preparatory effort condition, 
we asked all subjects the following question: 

Do you think that the work you had to do 
in preparation was... 


(-3) (-2) (-1) (+1) (+2) (+3) 
very some- slightly slightly somewhat very 
easy what easy difficult difficult diff- 

easy cult 


The mean rating for subjects in the high 
preparatory effort condition was +.7. The 
comparable figure for those in the low prepara- 
tory effort condition was —.9. The difference 
between the two conditions is significant 
beyond the .05 level. In other words, our 
experimental manipulation was successful. 
Those in the high effort condition felt they 
had worked harder in preparation for the 
test than did those in the low effort condition. 


Expectation of Taking the Test 


Let us recall the hypothesis this experiment 
was designed to test. We would expect, from 
considering the theory of dissonance, that, ifa 
person is induced to engage in effortful ac- 
tivity in preparation for a possible future 
event, the person will subsequently believe 
that the event is more likely to occur. If this 
hypothesis is correct, we would expect the 
subjects in the high preparatory effort con- 
dition to believe that they would actually 
have to take the test more than would sub- 
jects in the low preparatory effort condition. 
To obtain the data necessary to test this 
hypothesis, we asked the subjects the follow- 
Ing question: 

Do you believe that you are one of the 


People that have been selected to take the 
test? 


GD G2) GH) 
Certain probably slightly 
likely 


c2. C9 
probably defi. 

nitely 
not 


(-1) 
slightly 
unlikely not 
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The mean rating on this scale for subjects in 
the high effort condition was +1.1; for those 
in the low effort condition the mean rating 
was only +.2. The difference is significant 
at the .01 level (¢ = 2.70). Another way of 
looking at these data is in terms of the number 
of subjects who checked the plus side of the 
scale, indicating that they thought they would 
probably have to take the test. In the high 
effort condition 24 of the 26 subjects (92%) 
checked the plus side of the scale while in the 
low effort condition only 15 out of 25 (6070) 
checked this side of the scale. In other words, 
as a group, the subjects in the low effort 
condition reflect fairly well what they have 
been told, namely, that 50% of them would 
have to take the test. The low effort condition 
subjects divided themselves 60% to 40%, 
which is not very different from an “expected” 
50%. In the high effort condition, however, 
there is clearly a stronger belief that they 
would have to take the test. Since subjects 
were allocated to experimental conditions at 
random, it seems clear that this stronger 
belief in having to take the test is a conse- 
quence of the amount of effort expended in 
preparation. We may confidently conclude 
that the data support the hypothesis. 


Other Measures 


In the post-experimental questionnaire, 
three other questions were asked in an at- 
tempt to obtain control data. That is, these 
questions were asked in order to show that 
the difference we observed in how likely 
subjects thought it was that they would have 
to take the test was a specific difference 
related to dissonance reduction. Let us explain 
a bit more what we mean by this. The notion 
behind the study was that there js dissonance 
between knowing one has expended consider- 
able effort in preparation for a test and 
knowing that the test is not very likely to 
take place. Persuading oneself that the test is, 
indeed, likely to take place obviously reduces 
this dissonance. But we would clearly be 
more confident that the obtained difference 
was due to dissonance reduction if we could 
show that on other measures, which were 
related to the situation but served no dis- 
sonance reducing function, there were no 
differences between the two groups. 

The three questions we used were chosen 
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to be closely related to the testing situation 
which the subjects presumably faced. On 
none of the questions was any difference 
between the two experimental conditions 
apparent. One question concerned how im- 
portant they thought IQ measures were, 
another concerned the perceived difficulty 
of the possible test, and the third asked them 
how anxious they were about possibly having 
to take the test. The means for the two experi- 
mental groups are very close together on each 
question. On importance of IQ tests the means 
are +1.38 and +1.40; on perceived difficulty 
the means are +1.08 and +.96; and for 
anxiety about the test the means are —.23 
and —.40. All the differences are clearly 
negligible. We may then conclude that the 
difference in belief that they would have to 
take the test is not a general difference be- 
tween the conditions but is specific to dis- 
sonance reduction. 


An Incidental Finding 


A recent study by Schachter (1959) re- 
ports a relationship between birth order and 
experienced anxiety when subjects are faced 
with a threatening situation. Specifically, 
Schachter found that when subjects were 
told they were to receive a strong electric 
shock, those who were first born or who were 
only children experienced more anxiety than 
did those who were later born, 

Although this whole issue is not of direct 
relevance to the present experiment, we took 
the opportunity to verify Schachter’s (1959) 
finding in a slightly different context. While 
Schachter used electric shock to produce an 
anxiety arousing situation, his finding should 
hold more generally if it is indeed valid. 
The situation in our experiment may also be 
seen as a threatening and anxiety arousing 
situation. After all, the possibility of taking a 
test does arouse anxiety for many students, 
It is, of course, easy to ask subjects whether 
they are only children, first born or later born, 
and so we collected these data. 

If Schachter’s (1959) finding is valid we 
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would expect birth order to be related: 
responses the subjects gave on th 
that asked them how anxious the 
about the prospect of having to t 
examination. The subjects answ 
question on a scale from —3 "feel 
fortable about it," to +3 “feel very 
fortable about it.” Throwing both 
mental conditions together (there v 
differences at all between the two 
mental conditions), we find that the 
jects who were first born or only ¢ 
show a mean of —.03, just about 
on the scale. There were 18 later born 
and these give a mean of —.98, consid 
in the direction of saying they feel comfoi 
about it. This difference is significant 
the .05 and .10 levels (¢ = 1.7). In othe 
later born subjects do tend to have 
perienced anxiety than first born I 
children. Although the difference is not str 
significant, it provides clear support for: 
ter's findings. It may also be pointed oui 
birth order was found to relate to anxiety on 
and to nothing else that we measur 
this experiment. 


SUMMARY 


An experiment was conducted to 
following hypothesis derived from Festi 
(1957) theory of cognitive dissona 
people exert a great deal of effort (memo! 
a list of symbolic definitions) in prep 
for a possible future event (an int 
test), they should believe more S 
in the likelihood of occurrence of the 
(that they would have to take the test) 
people who do not have to engage in 
preparatory activity. The results of 
periment provide strong support for i 
hypothesis. : 
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PSYCHIATRIC DIAGNOSIS: 
A CRITIQUE! 


EDWARD ZIGLER 
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HE inadequacies of conventional psy- 

chiatric diagnosis have frequently been 

noted (Ash, 1949; Cattell, 1957; 
Eysenck, 1952; Foulds, 1955; Harrower, 
1950; Hoch & Zubin, 1953; Jellinek, 1939; 
King, 1954; Leary & Coffey, 1955; Mehlman, 
1952; Menninger, 1955; Noyes, 1953; Phillips 
& Rabinovitch, 1958; Roe, 1949; Rogers, 
1951; Rotter, 1954; Scott, 1958; Thorne, 
1953; Wittenborn & Weiss, 1952; Wittman & 
Sheldon, 1948). The responses to this rather 
imposing body of criticism have ranged from 
the position that the present classificatory 
system is in need of further refinement 
(Caveny, Wittson, Hunt, & Herman, 1955; 
Foulds, 1955), through steps towards major 
revisions (Cattell, 1957; Eysenck, 1952; 
Leary & Coffey, 1955; Phillips & Rabinovitch, 
1958; Thorne, 1953; Wittman & Sheldon, 
1948), to a plea for the abolishment of all 
"labeling" (Menninger, 1955; Noyes, 1953; 
Rogers, 1951). As other investigators have 
noted (Caveny et al, 1955; Jellinek, 1939), 
this last position suggests that the classifica- 
tory enterprise is valueless. This reaction 
against classification has gained considerable 
Popularity in clinical circles. The alacrity 
with which many clinicians have accepted 
this view seems to represent more than a 
disillusionment with the specific current form 
of _ Psychiatric diagnosis. These negative 
attitudes appear to reflect a belief that diag- 
nostic classification is inherently antithetical 
to such clinically favored concepts as “dy- 
namic,” "idiographic," etc. Thus, a question 
is raised as to whether any diagnostic schema 
Can be of value. Let us initially direct our 
attention to this question. 


On Classification 


The growth among clinicians of sentiment 
against categorization has coincided with a 
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period of critical reappraisal within the 
behavioral sciences generally (Beach, 1950; 
Brower, 1949; Cronbach, 1957; Guthrie, 
1950; Harlow, 1953; Koch, 1951; MacKinnon, 
1953; Marquis, 1948; Rapaport, 1947; Roby, 
1959; Scott, 1955; Tolman, 1953; Tyler, 
1959). This parallel development is more than 
coincidental. The reaction against “labeling” 
can be viewed as an extreme outgrowth of 
this critical self-evaluation, ie. that psy- 
chology’s conceptual schemata are artificial 
in their construction, sterile in terms of their 
practical predictions, and lead only to greater 
and greater precision about matters which 
are more and more irrelevant. It is little 
wonder that in this atmosphere, conceptuali- 
zation has itself become suspect nor that 
Maslow’s (1948) exposition of the possible 
dangers of labeling or naming has been ex- 
tended (Rotter, 1954) as a blanket indictment 
of the categorizing process. 

The error in this extension is the failure to 
realize that what has been criticized is not 
the conceptual process but only certain of its 
products. The criticisms mentioned above 
have not been in favor of the abolishment of 
conceptualization, but have rather been 
directed at the prematurity and rarifications 
of many of our conceptual schemata and our 
slavish adherence to them. Indeed, many of 
these criticisms have been accompanied by 
pleas for lower-order conceptualization based 
more firmly on observational data (Koch, 
1951; MacKinnon, 1953; Tolman, 1953). 

In the clinical area, the sentiment against 
classification has become sufficiently serious 
that several investigators (Cattell, 1957; 
Caveny et al., 1955; Eysenck, 1952; Jellinek, 
1939) have felt the need to champion the 
merits of psychiatric categorization. They 
have pointed out that diagnosis is a basic , 
scientific classificatory enterprise to be viewed 
as essentially the practice of taxonomy, 
which is characteristic of all science. Eysenck 
(1952) puts the matter quite succinctly in his 
statement, "Measurement is essential to 
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science, but before we can measure, we must 
know what it is we want to measure. Qualita- 
tive or taxonomic discovery must precede 
quantitative measurement” (p. 34). 

Reduced to its essentials, diagnostic classifi- 
cation involves the establishment of categories 
to which phenomena can be ordered. The 
number of class systems that potentially 
may be constructed is limited only by man’s 
ability to abstract from his experience. The 
principles employed to construct such classes 
may be inductive, deductive, or a combina- 
tion of both, and may vary on a continuum 
from the closely descriptive to the highly 
abstract. 

Related to the nature of the classificatory 
principle are the implications to be derived 
from class membership. Class membership 
may involve nothing more than descriptive 
compartmentalization, its only utility being 
greater ease in the handling of data. Ob- 
versely, the attributes or correlates of class 
membership may be widespread and far- 
reaching in their consequences. The originators 
of a classificatory schema may assert that 
specified behavioral correlates accompany 
class membership. This assertion is open to 
test. If the hypothesized correlates represent 
the full heuristic value of the diagnostic 
schema and class membership is found not to 
be related to these correlates, then revision or 
discard is in order. A somewhat different type 
of problem may also arise. With the passage of 
time, correlates not originally related to the 
schema may erroneously be attributed to 
class membership. Nevertheless, the original 
taxonomy may still possess a degree of rele- 
vance to current objectives in a discipline. 
In these circumstances, its maintenance may 
be the rational choice, although a clarification 
and purification of categories is called for. 
The relationship of the two problems outlined 
here to the criticism of contemporary psy- 
chiatric diagnosis will be discussed later. 
What should be noted at this point is that the 
solution to neither problem implies the abolish- 
ment of the attempt at classification. 

Another aspect of taxonomy is in need of 
clarification. When a phenomenon is assigned 
to a class, certain individual characteristics 
of that phenomenon are forever lost. No two 
class members are completely identical. 
Indeed, a single class member may be viewed 
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as continuously differing from itself over time, | 
It is this loss of uniqueness and an implied 
unconcern with process that have led many | 
clinicians to reject classification in principle, 
While classificatory schemata inevitably in- 
volve losses of this type, it must be noted 
that they potentially offer a more than com- 
pensatory gain. This gain is represented in 
the significance of the class attributes and | 
correlates. Class membership conveys in- 
formation ranging from the descriptive simi- 
larity of two phenomena to a knowledge of 
the common operative processes underlying 
the phenomena. 

A conceptual system minimizes the afore- | 
mentioned loss to the extent that only irrele- | 
vant aspects of a phenomenon are deleted in 
the classificatory process. The implicit as 
sumption is made that what is not class 
relevant is inconsequential. The dilemma, of 
course, lies in our lacking divine revelation 
as to what constitutes inconsequentiality, 
It is this issue which lies at the heart of the | 
idiographic versus nomothetic controversy 
(Allport, 1937, 1946; Beck, 1953; Eysenck, | 
1954; Falk, 1956; Hunt, 1951a, 1951b; Skaggs, | 
1945, 1947). The supporters of the idiographi¢ | 
position (Allport, 1937; Beck, 1953) have 
criticized certain conceptual schemata for 
treating idiosyncratic aspects of behavior as 
inconsequential when they are in fact pertinent 
data which must be utilized if a comprehensive 
and adequate view of human behavior is to 
emerge. However, the idiographic position i$ 
not a movement toward the abolishment of 
classification, a fact emphasized by Allport 
(1937) and Falk (1956). Rather, it represents à 
plea for broader and more meaningful classifi- 
tory schemata, 

A conceptually different type of argument 
against the use of any diagnostic classifica; 
tion has been made by the adherents of 
nondirective psychotherapy (Patterson, 1948; 
Rogers, 1946, 1951). This position has ad 
vanced the specific contention that differential 
diagnosis is unnecessary for, and perhaps | 
detrimental to, successful psychotherapy: | 
This attitude of the nondirectivists has beet 
interpreted (Thorne, 1953) as an attack 0n 
the entire classificatory enterprise. To argue 
against diagnosis on the grounds that it affects | 
therapeutic outcome is to confuse diagnos! | 
as an act of scientific classification with the 
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present clinical practice of diagnosis with its 
use of interviewing, psychological testing, etc. 
The error here lies in turning one’s attention 
away from diagnosis as an act of classification, 
a basic scientific enterprise, and attending 
instead to the immediate and prognostic 
consequences of some specific diagnostic 
technique in a specific therapeutic situation, 
ie, an applied aspect. To reject the former 
on the basis of the latter would appear to be 
an unsound decision. 

Although the nondirectivists’ opposition to 
diagnosis seems to be based on a confusion 
between the basic and applied aspects of 
classification, implicitly contained within 
their position is a more fundamental argument 
against the classificatory effort. Undoubtedly, 
diagnosis both articulates and restricts the 
range of assumptions which may be enter- 
tained about a client. However, the philosophy 
of the nondirectivist forces him to reject any 
theoretical position which violates a belief 
in the unlimited psychological growth of the 
client. It would appear that this position 
represents the rejection, in principle, of the 
view that any individual can be like another 
in his essential characteristics, or that any 
predictable relationship can be established 
between a client's current level of functioning 
and the ends which may be achieved. In the 
setting of this assumption, a transindividual 
classificatory schema is inappropriate. There 
is no appeal from such a judgment, but one 
should be cognizant that it rejects the essence 
of a scientific discipline. If one insists on 
operating within the context of a predictive 
psychology, one argues for the necessity of a 
Cassificatory system, even though particular 
diagnostic schemata may be rejected as ir- 
relevant, futile, or obscure. 

Let us now direct our discussion toward 
Some of the specific criticisms of conventional 
Psychiatric diagnosis—that the categories 
employed lack homogeneity, reliability, and 
validity. 


Homogeneity 


‘ A criticism often leveled against the con- 
emporary diagnostic system is that its cate- 
_8etles encompass heterogeneous groups of 
individuals, i.e., individuals varying in respect 
to symptomatology, test scores, prognosis, etc. 
King, 1954; Rotter, 1954: Wittenborn, 
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1952; Wittenborn & Bailey, 1952; Wittenborn 
& Weiss, 1952). Contrary to the view of one 
investigator (Rotter, 1954), a lack of ho- 
mogeneity does not necessarily imply a lack 
of reliability. King (1954) has clearly noted 
the distinction between these two concepts. 
Reliability refers to the agreement in as- 
signing individuals to different diagnostic 
categories, whereas homogeneity refers to 
the diversity of behavior subsumed within 
categories. While the two concepts may be 
related, it is not difficult to conceptualize 
categories which, though quite reliable, sub- 
sume diverse phenomena. 

King (1954) has argued in favor of con- 
structing a new diagnostic classification 
having more restrictive and homogeneous 
categories. He supports his argument by 
noting his own findings and those of Kantor, 
Wallner, and Winder (1953), which have 
indicated that within the schizophrenic group 
subcategories may be formed which differ in 
test performance. King found further support 
for the construction of new and more ho- 
mogeneous diagnostic categories in a study 
by Windle and Hamwi (1953). This study 
indicated that two subgroups could be con- 
structed within a psychotic population which 
was composed of patients with diverse psy- 
chiatric diagnoses. Though matched on the 
distribution of these diagnostic types, the 
subgroups differed in the relationship ob- 
tained between test performance and prognosis. 
On the basis of these studies, King suggests 
that the type of homogeneous categories he 
would favor involves such classificatory 
dichotomies as reactive versus process schizo- 
phrenics and chronic versus nonchronic 
psychotics. ro 

An analysis of King's (1954) criticism of 
the present diagnostic system discloses certain 
difficulties. The first is that King's heteroge- 
neity criticism does not fully take into con- 
sideration certain basic aspects of classifi- 
cation. A common feature of classificatory 
systems is that they utilize classes which 
contain subclasses. An example drawn from 
biology would be a genus embracing a number 
of species. If schizophrenia is conceptualized 
as a genus, it cannot be criticized on the 
grounds that all its members do not share a 
particular attribute. Such a criticism would 
involve a confusion between the more specific 
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attributes of the species and the more general 
attributes of the genus. This is not to assert 
that schizophrenia does in fact possess the 
characteristics of a genus. It is, of course, 
possible that a careful analysis will reveal 
that it does not, and the class schizophrenia 
will have to be replaced by an aggregate of 
entitites which does constitute a legitimate 
genus. However, when a genus is formulated, 
it cannot be attacked because of its heteroge- 
neous nature since genera are characterized 
by such heterogeneity. 

A more serious difficulty with King’s (1954) 
heterogeneity criticism lies in the inherent 
ambiguity of a homogeneity-heterogeneity 
parameter. To criticize a classificatory system 
because its categories subsume heterogeneous 
phenomena is to make the error of assuming 
that homogeneity is a quality which inheres in 
phenomena when in actuality it is a construc- 
tion of the observer or classifier. In order to 
make this point clear, let us return to King's 
argument, What does it mean to assert that 
chronic psychosis is an example of an homo- 
geneous class, while schizophrenia is an 
example of an heterogeneous one? In terms 
of the descriptively diverse phenomena en- 
compassed, the latter would appear to have 
the greater homogeneity. The statement only 
has meaning insofar as a particular correlate— 
for instance, the relationship of test score to 
prognosis—is shared by all members of one 
class but not so shared by the members of the 
other class. Thus, the meaningfulness of the 
homogeneity concept is ultimately dependent 
on the correlates or attributes of class member- 
Ship or to the classificatory principle related 
to these correlates or attributes. The intimacy 
of the relationship between the attributes of 
Classes and the classificatory principle can 
best be exemplified by the extreme case in 
which a class has but a single attribute, and 
that attribute is defined by the classificatory 
principle, e.g., the classification of plants on 
the basis of the number of stamens they 
possess. "Therefore, the heterogeneity criticism 
of a classificatory system is nothing more 
than a plea for the utilization of a new classi- 
ficatory principle so that attention may be 
focused on particular class correlates or at- 
tributes not considered in the original schema. 
While this plea may be a justifiable one, 
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depending on the significance of the ney | 
attributes, it has little to do with the hg. | 
mogeneity, in an absolute sense, of phenomena, 
Indeed, following the formulation of a ney 
classificatory schema, the heterogeneity criti 
cism could well be leveled against it by the 
adherents of the old system, since the phe 
nomena encompassed by the new categories 
would probably not be considered homogeneous 
when evaluated by the older classificatory 
principle. 

Although differing in its formulation, the 
heterogeneity criticism of present psychiatric | 
classification made by Wittenborn and his 
colleagues (Wittenborn, 1952; Wittenborn & 
Bailey, 1952; Wittenborn & Weiss, 1952) 
suffers from the same difficulties as dos 
King's (1954) criticism. Wittenborn's findings 
indicated that individuals given a common| 
diagnosis showed differences in their symptom 
cluster score profiles based on nine symptom 
clusters isolated earlier by means of factor| 
analytic techniques (Wittenborn, 1951; Wit 
tenborn & Holzberg, 1951). It is upon tht 
existence of these different profiles within a 
diagnostic category that Wittenborn bases 
his heterogeneity criticism. Here again the 
homogeneity-hetorogeneity distinction is only | 
meaningful in terms of an independent) 
criterion, a particular symptom cluster score 
profile. Had it been discovered that all in- 
dividuals placed into a particular diagnostic 
category shared a common symptom cluster 
score profile, then this category would be 
described as subsuming homogeneous phê 
nomena, But the phenomena—the symptoms 
mirrored by the symptom profile—are not 
homogeneous in any absolute sense because 
the pattern of symptoms may involve the 
symptoms in descriptively diverse symptom 
clusters. Thus, the homogeneity ascribed 10 
the category would refer only to the fact that 
individuals within the category homogeneously 
exhibited a particular pattern of descriptively 
diverse behaviors. However, the organization | 
of symptoms mirrored by the symptom clustet 
profiles is not in any fundamental sense differ 
ent from that observed in conventional diag- | 
nostic syndromes. Both methods of categoriza- - 
tion systematize diverse behaviors becaus 
of an observed regularity in their concurrent 
appearance. 
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The difference between these two ap- 
proaches, then, lies only in the pattern of 
deviant behaviors that define the categories. 
Indeed, Eysenck (1953) has noted that both 
the clinician and the factor analyst derive 
syndromes in essentially the same manner, De. 
in terms of the observed intercorrelations 


. of various symptoms. It is the difference in 


method, purely observational versus statistical, 
that explains why the final symptom structure 
may differ. The assumption must not be 
made that the advantage lies entirely with 
the factor analytic method. The merit ac- 
cruing through the greater rigor of factor 
analysis may be outweighed by the limitations 
imposed in employing a restricted group of 
symptoms and a particular sample of patients. 
Thus, the factor analyst cannot claim that 
the class-defining symptom pattern he has 
derived is a standard of homogeneity against 
which classes within another schema can be 
evaluated. The plea that symptom cluster 
scores, derived from factor analytic tech- 
niques, substitute for the present method of 
psychiatric classification has little relevance 
to the heterogeneity issue. 

In the light of this discussion we may con- 
clude that the concept of homogeneity has 
little utility in evaluating classificatory 
schemata. Since the heterogeneity criticism 
invariably involves an implicit preference for 
one classificatory principle over another, it 
would perhaps be more fruitful to dispense 
entirely with the homogeneity-heterogeneity 
distinction, thus, allowing us to direct our 
attention to the underlying problem of the 
relative merits of different classificatory 
principles. 

Reliability and Validity 

A matter of continuing concern has been 
the degree of reliability of the present diag- 
Dostic system. Considerable energy has been 
expended by both those who criticize the 
Present system for its lack of reliability 
(Ash, 1949; Boisen, 1938; Eysenck, 1952; 
Mehlman, 1952; Roe, 1949; Rotter, 1954; 
Scott, 1958) and those who defend it against 
He criticism (Foulds, 1955; Hunt, Wittson, 

Hunt, 1953; Schmidt & Fonda, 1956; 


~ Seeman, 1953). Certain investigators (Foulds, 


1955; Schmidt & Fonda, 1956) who have 


611 


offered evidence that the present system is 
reliable have also pointed out that the earlier 
studies emphasizing the unreliability of 
psychiatric diagnosis have suffered from 
serious conceptual and methodological diffi- 
culties. 

In evaluating the body of studies con- 
cerned with the reliability of psychiatric 
diagnosis, one must conclude that so long as 
diagnosis is confined to broad diagnostic 
categories, it is reasonably reliable, but the 
reliability diminishes as one proceeds from 
broad, inclusive class categories to narrower, 
more specific ones. As finer discriminations 
are called for, accuracy in diagnosis becomes 
increasingly difficult. Since this latter char- 
acteristic appears to be common to the 
classificatory efforts in many areas of knowl- 
edge, it would appear to be inappropriate to 
criticize psychiatric diagnosis on the grounds 
that it is less than perfectly reliable. This 
should not lead to an underestimation of the 
importance of reliability. While certain 
extraclassificatory factors, e.g. proficiency of 
the clinicians, biases of the particular clinical 
settings, etc., may influence it, reliability is 
primarily related to the precision with which 
classes of a schema are defined. Since the 
defining characteristic of most classes in 
psychiatric diagnosis is the occurrence of 
symptoms in particular combinations, the 
reliability of the system mirrors the specificity 
with which the various combinations of 
symptoms (syndromes) have been spelled 
out. It is mandatory for a classificatory 
schema to be reliable since reliability refers 
to the definiteness with which phenomena 
can be ordered to classes. If a system does 
not allow for such a division of phenomena, 
it can make no pretense of being a classificatory 


While reliability is a prerequisite if the 
diagnostic system is to have any value, it 
must not be assumed that if human effort 
were to make the present system perfectly 
reliable, it could escape all the difficulties 
attributed to it. This perfect reliability would 
only mean that individuals within each class 
shared a particular commonality in relation 
to the classificatory principle of symptom 
manifestation. If one were interested in 
attributes unrelated or minimally related to 


612 


the classificatory principle employed, the 
perfect reliability of the system would offer 
little cause for rejoicing. Perfect reliability 
of the present system can only be the goal 
of those who are interested in nothing more 
than the present classificatory principle and 
the particular attributes of the classes con- 
structed on the basis of this principle. 

When attention is shifted from characteris- 
tics which define a class to the correlates of 
class membership, this implies a shift in con- 
cern from the reliability of a system to its 
validity. The distinction between the reliability 
and validity of a classificatory system would 
appear to involve certain conceptual diffi- 
culties, It is perhaps this conceptual difficulty 
which explains why the rather imposing body 
of literature concerned with diagnosis has been 
virtually silent on the question of the validity 
of the present system of psychiatric diagnosis. 
Only one group of investigators (Hunt, 1951; 
Hunt, Wittson, & Barton, 1950a, 1950b; Hunt, 
Wittson, & Hunt, 1953; Wittson & Hunt, 
1951) has specifically been concerned with the 
predictive efficacy of diagnoses and, thus, to 
the validity of psychiatric classifications; and 
even in this work, the distinction between 
validity and reliability is not clearly drawn. 

In order to grasp the distinction between 
the reliability and the validity of a classifica- 
tory schema, one must differentiate the de- 
fining characteristics of the classes from the 
correlates of the classes. In the former case, 
we are interested in the principles upon which 
classes are formed; in the latter, in the predic- 
tions or valid statements that can be made 
about phenomena once they are classified. The 
difficulty lies in the overlap between the 
classifying principles and the class correlates. 
Tf a classificatory system is reliable, it is also 
valid to the extent that we can predict that 
the individuals within a class will exhibit 
certain characteristics, namely, those behaviors 
or attributes which serve to define the class. 

It is the rare class, however, that does not 
connote correlates beyond its defining char- 
acteristics. The predictions associated with 
class membership may vary from simple 
extensions of the classificatory principles to 
correlates which would appear to have little 
connection with these principles. Let us 
examine a simple illustration and see what 
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follows from categorizing an individi 
an individual has been classified as m 
a manic-depressive reaction, depre 
on the basis of the symptoms of de 
of mood, motor retardation, and 
(American Psychiatric Association, 1952 
prediction may be made that the indi 
will spend a great deal of time in bed, | 
represents an obvious extension of thi 
tom pattern. One may also hypoth 
the patient will show improvement if el 
shock therapy is employed. This is a cı 
which has little direct connection will 
symptoms themselves. These predicti 
open to test, and evidence may or may n 
found to support them. Thus, meas 
validity may be obtained which are i 
dent of the reliability of the system 
classification. 

The problem of validity lies at the 
the confusion which surrounds psychii 
diagnosis. When the present diagnostic 
is assailed, the common complaint is that 
membership conveys little information b 
the gross symptomatology of the patient 
contributes little to the solution of the pi 
problems of etiology, treatment proce 
prognosis, etc. The criticism that class mt 
bership does not predict these import 
aspects of a disorder appears to be a legitin 
one. This does not mean the present s 
has no validity. It simply indicates 
system may be valid in respect to c! 
correlates but invalid in respect to © 
Much confusion would be dispelled if as 
care were taken in noting the existing CO! 
of classes as is taken in noting the classi 
principles. A great deal of effort has go! 
the formalization of the defining charac! 
tics of classes (American Psychiatric 
tion, 1952), but one looks in vain for a 
delineation of the extraclassificatory attri 
and correlates of class membership. As a T 
the various diagnostic categories have 
burdened with correlates not systemati 
derived “from a classificatory principle b 
which were attributed to the classes beca 
they were the focal points of clinical in 
A major question is just what correlate 
justifiably be attributed to the class categ 
To answer this question we must 
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attention to the purposes and philosophy 
underlying contemporary psychiatric diagnosis. 
Philosophy and Purpose of Conventional 
Diagnosis 

The validity of the conventional diagnostic 
system is least ambiguous and most free 


‘from potential criticism as a descriptive 


schema, a taxonomy of mental disorders 
analogous to the work of Ray and Linnaeus 
in biology. In this sense, class membership 
confirms that the inclusion of an individual 
within a class guarantees only that he exhibit 
the defining characteristics of that class. Only a 
modest extension of this system, in terms of a 
very limited number of well established cor- 
relates, makes for a system of impressive 
heuristic value, even though it falls consider- 
ably short of what would now be considered 
an optimal classificatory schema. As has been 
noted (Caveny et al., 1955; Hunt et al., 1953), 
the present diagnostic system is quite useful 
when evaluated in terms of its administrative 
and, to a lesser extent, its preventive implica- 
tions. Caveny et al. (1955) and Wittenborn, 
Holzberg, and Simon (1953) should be con- 


sulted for a comprehensive list of such uses, 


but examples would include legal determina- 
tion of insanity, declaration of incompetence, 
type of ward required for custodial care, census 
figures and statistical data upon which con- 
siderable planning is based, screening devices 


. for the military services or other agencies, etc. 


In view of the extensive criticism of contem- 
porary diagnosis, the surprising fact is not 
that so few valid predictions can be derived 
from class membership, but that so many can. 

The value of the present psychiatric classi- 
fication system would be further enhanced by 
its explicit divorcement from its Kraepelinian 
heritage by an emphasis on its descriptive 
aspect and, through careful empirical investiga- 
tion, the cataloging of the reliable correlates 
of its categories. That this catalog of correlates 
Would be an impressive one is expressed in 
Hoch's (1953) view that the present system is 
superior to any system which has been evolved 
to replace it. It is an open question whether 
the system merits this amount of praise. In 
Beneral however, the defense of the present 
System—or, for that matter, diagnosis in 
general (Caveny et al, 1955; Eysenck, 1952; 
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Hunt et al, 1953; Jellinek, 1939)—tends to 
rest on the merits of its descriptive, empirical, 
and nondynamic aspects. 

The present classificatory system, even as a 
purely descriptive device, is still open to a 
certain degree of criticism. Its classificatory 
principle is organized primarily about symp- 
tom manifestation. This would be adequate 
for a descriptive system if this principle were 
consistently applied to all classes of the schema 
and if the symptoms associated with each 
diagnostic category were clearly specified. 
There is some question, however, whether 
the system meets these requirements (Phillips 
& Rabinovitch, 1958; Rotter, 1954). The 
criticism has been advanced that the present 
system is based on a number of diverse prin- 
ciples of classification, Most classes are indeed 
defined by symptom manifestation, but the 
organic disorders, for example, tend to be 
identified by etiology, while such other factors 
as prognosis, social conformity, etc. are also 
employed as classificatory principles. This does 
not appear, however, to be an insurmountable 
problem, for the system could be made a com- 
pletely consistent one by explicitly defining 
each category by the symptoms encompassed, 
The system would appear to be eminently 
amenable to the unitary application of this 
descriptive classificatory principle, for there 
are actually few cases where classes are not so 
defined, Where reliable relations between the 
present categories and etiology and prognosis 
have been established, these also could be 
incorporated explicitly within the system. 
Etiology and prognosis would be treated not 
as inherent attributes of the various classifica- 
tions, but rather as correlates of the particular 
classes to which their relationship is known, 
They would, thus, not be confounded with the 
classificatory principle of the system. 

This course of action would satisfy the 
requirement of consistency in the application 
of the classificatory principle. A remaining 
area of ambiguity would be the lack of agree- 
ment in what constitutes a symptom. In 
physical medicine, a clear distinction has been 
made between a symptom, which is defined as 
a subjectively experienced abnormality, and a 
sign, which is considered an objective indica- 
tion of abnormality (Holmes, 1946). This 
differentiation has not, however, been extended 
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to the sphere of mental disorders. A source of 
difficulty may lie in the definition of what is 
psychologically abnormal. In psychiatric ter- 
minology, symptoms include a wide range of 
phenomena from the grossest type of behavior 
deviation, through the complaints of the 
patient, to events almost completely inferential 
in nature. One suggestion (Yates, 1958) has 
been to eliminate the term “symptom” and 
direct attention to the manifest responses of 
the individual. This suggestion appears to be 
embodied in the work of Wittenborn and his 
colleagues (Wittenborn, 1951, 1952; Witten- 
born & Bailey, 1952; Wittenborn & Holzberg, 
1951; Wittenborn et al., 1953; Wittenborn & 
Weiss, 1952). Wittenborn’s diagnostic system, 
in which symptoms are defined as currently 
discernible behaviors, represents a standard 
of clarity for purely descriptive systems of 
psychiatric classification. This clarity was 
achieved by clearly noting and limiting the 
group of behaviors which would be employed 
in the system. But even here a certain amount 
of ambiguity remains. The number of responses 
or discernible behaviors which may be con- 
sidered for inclusion within a diagnostic 
schema borders on the infinite. The question 
arises, then, as to how one goes about the selec- 
tion of those behaviors to be incorporated in 
the classificatory system. Parsimony demands 
that only “meaningful” items of behavior be 
chosen for inclusion, and this selective principle 
has certainly been at work in the construction 
of all systems of diagnosis, In this sense, the 
present method of psychiatric classification is 
not a purely descriptive one, nor can any 
classification schema truly meet this criterion 
of purity. Meaning and utility inevitably 
appear among the determinants of classifica- 
tory systems. 

Several investigators (Cameron, 1953; 
Jellinek, 1939; Magaret, 1952) have stressed 
the inappropriateness of discussing diagnosis 
in the abstract, pointing out that such a dis- 
cussion should center around the question of 
"diagnosis for what?" Indeed, a diagnostic 
system cannot be described as “true” or 
"false," but only as being useful or not 
useful in attaining prescribed goals. Therefore, 
when a system is devised, its purposes should 
be explicitly stated so that the system can be 
evaluated in terms of its success or failure in 


EDWARD ZIGLER AND LESLIE PHILLIPS 


attaining these objectives. Furthermore, these 
goals should be kept explicit throughout 
the period during which the system is being 
employed. The present diagnostic schema has 
not met this requirement. Instead, its goals 
have been carried along in an implicit manner 
and have been allowed to become vague, The | 
result has been that some see the purpose of 
the schema as being an adequate description 
of mental disorders (Hunt et al., 1953), others 
view it as being concerned with prognosis 
(Hoch, 1953), and still others view thei 
schemata goal as the discovery of etiology 
(Cameron, 1953). 1 
Typically, the present schema has been 
conceptualized as descriptive in nature, but a 
brief glance at its history indicates that the. 
original purposes and goals in the construction: 
of this schema went far beyond the desire fora” 
descriptive taxonomy. As Zilboorg and Henry 
(1941) clearly note, Kraepelin not only studied 
the individual while hospitalized, but also” 
the patient's premorbid history and post | 
hospital course. His hope was to make out 
understanding of all mental disorders as 
precise as our knowledge of the course of 
general paresis. He insisted on the classifica- 
tion of mental disorders according to regulari- | 
ties in symptoms and course of illness, believing | 
this would lead to a clearer discrimination 
among the different disease entities. He hoped 
for the subsequent discovery of a specife | 
somatic malfunction responsible for each | 
disease. For Kraepelin, then, classification was 
related to etiology, treatment, and prognosis. 
Had the system worked as envisaged, these 
variables would have become the extra 
classificatory attributes of the schema. When 
matched against this aspiration, the present. 
system must be considered a failure since the 
common complaint against it is that a diagnos- h 
tic label tells us very little about etiology, 
treatment, or prognosis (Miles, 1953). How- | 
ever, it would be erroneous to conclude that 
the present system is valueless because its - 
classes are only minimally related to etiology ^ 
and prognosis. d 
What should be noted is that etiology 4” 
prognosis, though important, are but two ofa 
multitude of variables of interest. The impot 
tance of these variables should not obscure. 
the fact that their relationship to a classifica- 
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tory system is exactly the same as that of any 
other variables. This relationship may take 
one of two forms. Etiology and prognosis may 
be the correlates of the classes of a diagnostic 
system which employs an independent classi- 
ficatory principle like symptom manifestation. 
Optimally, we should prefer a classificatory 

"schema in which the indices of etiology and 
preferred modes of treatment would be incor- 
porated (Hunt et al., 1953; Pepinsky, 1948). 
In essence, this was Kraepelin’s approach, and 
it continues to underlie some promising work 
in the area of psychopathology. Although 

-Kraepelin's disease concept is in disrepute 
(Hoch & Zubin, 1953; Marzoff, 1947; Rotter, 

. 1954), it is the opinion of several investigators 
(Eysenck, 1953; Phillips & Rabinovitch, 1958; 
Wittenborn et al, 1953) that further work 
employing the descriptive symptomatic ap- 
proach could well lead to a greater under- 
standing of the etiology underlying abnormal 
“processes.” 

Another manner in which etiology, treat- 
ment, or prognosis could be related to a 
classificatory schema is by utilizing each of 
these variables as the classificatory principle 
for a new diagnostic system. For instance, we 
might organize patients into groups which 
respond differentially to particular forms of 
treatment like electroshock, drugs, psycho- 

, therapy, etc. The new schemata which might 
be proposed could be of considerable value in 

_Tespect to certain goals but useless in regard 
to others, Since we do not possess a diagnostic 
System based on all the variables of clinical 
interest, we might have to be satisfied with 
the construction of a variety of diagnostic 
Systems, each based on a different principle 
of classification. These classificatory techniques 
Would exist side by side, their use being deter- 

| mined by the specific objectives of the 
diagnostician, 


Etiology versus Description in Diagnosis 


The classical Kraepelinian classification 
Schema shows two major characteristics: 
à commitment to a detailed description of the 
Manifest symptomatic behaviors of the 
individual and an underlying assumption that 
Such a descriptive classification would be 
transitory, eventually leading to and being 
replaced by a system whose classificatory 
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principle was the etiology of the various 
mental disorders. Major criticism of this 
classificatory effort has been directed at the 
first of these. The reservations are that, in 
practice, such a descriptive effort allows no 
place for a process interpretation of psycho- 
pathology and that it has not encouraged the 
development of prevention and treatment 
programs in the mental disorders. 

The authors do not feel that the failure of the 
Kraepelinian system has demonstrated the 
futility of employing symptoms as the basis 
for classification. It does suggest that if one 
approaches the problem of description with an 
assumption as to the necessary correlates of 
such descriptions, then the diagnostic system 
may well be in error. Kraepelin's empiricism 
is contaminated in just this way. For example, 
he refused to accept as cases of dementia 
praecox those individuals who recovered from 
the disorder, since he assumed irreversibility 
as a necessary concomitant of its hypothesized 
neurophysiological base. Bleuler, on the other 
hand, who was much less commited to any 
particular form of causality in this illness, 
readily recognized the possibility of its favor- 
able outcome. It is not, then, the descriptive 
approach itself which is open to criticism, 
but description contaminated by preconcep- 
tion. An unfettered description of those 
schizophrenics with good prognosis in contrast 
to those with poor prognosis reveals clear 
differences in the symptom configuration 
between these kinds of patients (Farina & 
Webb, 1956; Phillips, 1953). 

Kraepelin's basic concern with the problem 
of etiology has remained a focus of efforts in 
the clinical area. Although his postulate of 
central nervous system disease as the basis of 
mental disorder is in disrepute, and. his 
systematic classificatory eflorts are assailed, 
one nevertheless finds a striking congruence 
between Kraepelin's preconceptions and cer- 
tain current attempts at the solution of the 
problem of psychopathology. There is an 
unwavering belief that some simple cate- 
gorical system will quickly solve the mysteries 
of etiology. The exponents of these newer 
classificatory schemata have merely replaced 
symptoms by other phenomena like test 
scores (King, 1954), particular patterns of 
interpersonal relations (Leary & Coffey, 
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1955), etc. It is the authors’ conviction that 
these new efforts to find short-cut solutions 
to the question of etiology will similarly re- 
main unsuccessful. The amount of descriptive 
effort required before etiological factors are 
likely to be discovered has been underesti- 
mated (Kety, 1959a, 1959b), and the pursuit 
of etiology should represent an end point 
rather than a beginning for classificatory 
systems. The process of moving from an 
empirical orientation to an etiological one is, 
of necessity, inferential and therefore sus- 
ceptible to the myriad dangers of premature 
inference. We propose that the greatest 
safeguard against such prematurity is not to 
be found in the scrapping of an empirical 
descriptive approach, but in an accelerated 
program of empirical research. What is needed 
at this time is a systematic, empirical attack 
on the problem of mental disorders. Inherent 
in this program is the employment of 
symptoms, broadly defined as meaningful 
and discernible behaviors, as the basis of a 
classificatory system. Rather than an abstract 
search for etiologies, it would appear more 
currently fruitful to investigate such empirical 
correlates of symptomatology as reactions to 
specific forms of treatment, outcome in the 
disorders, case history phenomena, etc. 

The pervasive concern with etiology may 
derive from a belief that if this were known, 
prevention would shortly be forthcoming, 
thus, making the present complex problems 
of treatment and prognosis inconsequential. 
Unfortunately, efforts to short-circuit the 
drudgery involved in establishing an em- 
pirically founded psychiatry has not resulted 
in any major breakthroughs. Etiology is 
typically the last characteristic of a disorder 
to be discovered. Consequently, we would 
suggest the search for etiology be put aside 
and attempted only when a greater number of 
the correlates of symptomatic behaviors have 
been established. 

The authors are impressed by the amount 
of energy that has been expended in both 
attacking and defending various contemporary 
systems of classification. We believe that a 
classificatory system should include any 
behavior or phenomenon that appears promis- 
ing in terms of its significant correlates, At 
this stage of our investigations, the system 
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employed should be an open and expanding. 
one, not one which is closed and defended on 
conceptual grounds. Systems of classification 
must be treated as tools for further discovery, 
not as bases for polemic disputation, | 
As stated above, it is possible that a num. 
ber of systems of classification may be needed 
to encompass the behaviors presently d 
clinical interest. It may appear that the 
espousal of this position, in conjunction 
with a plea for empirical exploration of the 
correlates of these behaviors, runs headlong 
into a desire for conceptual neatness and 
parsimony. It may be feared that the use of a 
number of classificatory systems concurrently; 
each with its own correlates, may lead to the) 
creation of a gigantic actuarial table of une 
lated elements. However, the authors do nof 
feel that such a fear is well founded because 
it assumes that the correlates of these systems) 
have no eventual relation one to the other 
We believe that this latter view is um 
necessarily pessimistic. While in principle à 
multiplicity of classificatory systems might 
be called for, results from the authors’ owl! 
research program suggests that a single 
relatively restricted and coherent classifi 
cation system can be derived from an em- 
pirical study of the correlates of symptomati( 
behaviors (Phillips & Rabinovitch, 1958; 
Zigler & Phillips, 1960). Such a system might 
serve a number of psychiatrically significant 
functions, including the optimum selection of 
patients for specific treatment programs and 
the prediction of treatment outcomes. 
conclusion, a descriptive classificatory system) 
appears far from dead, and if properly em 
ployed, it can lead to a fuller as well as # 
more conceptually based understanding d 
the psychopathologies. 
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mrs report pertains to a description of 
, mental health and disorder that empha- 

sizes a continuum called Mutual Achieve- 
» ment Health. The analytic system is derived 
partly from past work (Lichtenberg, 1955; 
Lichtenberg, Kohrman, & MacGregor, 1960), 
partly from attempts to classify recent events 
in the lives of newly admitted hospital pa- 
tients, and partly from efforts to coordinate a 
mental health theory with a formulation of 
types of treatment (Lichtenberg, Cassetta, & 
Scanlon, 19602). 

It is our belief, as well as that of recent re- 
viewers (Herzog, 1959; Jahoda, 1958; Scott, 
1958a, 1958b), that a good theory of mental 
health is vital for research and practice in 
the fields of mental health education, the 
epidemiology of mental disorder, and the 
evaluation of psychotherapy or other treat- 
ment services. It is also our belief, as it is 
Maslow’s (1954), that the study of psy- 
chologically disordered persons in the thera- 
peutic setting is strategic for the development 
of scientific ideas about what are good human 
relationships and what is the good society. 
This report is intended to contribute to these 
fields and endeavors. 


: GENERAL THEORETICAL CONSIDERATIONS 


Apart from questions about the value of 
conceptualizing mental health (e.g., Freides, 
1960; Szasz, 1960), three views regarding 
mental health and disorder are vying for 
Scientific dominance at present: 

„1. Persons can be measured on a con- 
nuum according to their distance from com- 
Plete mental health or complete mental 
disorder, À 

2. Two dimensions exist—one defining the 

extent of mental health, the other defining 
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the extent of mental illness. For example, 
Korner (1958) proposed two continua, “mental 
health-mental injury” and “susceptibility to 
mental illness-mental illness.” 

3. There is no quantitative factor as- 
sociated with mental disorder; all differences 
are qualitative. 

We prefer the continuum thesis because 
it seems more consistent with fundamental 
scientific procedure and because the alterna- 
tives seem inadequate. To divide matters 
into two continua (positive mental health 
and mental illness) seems to conceal the 
judgment of severity on a single continuum. 
In practice, one must decide which of the 
two dimensions is more appropriate for a given 
individual, a choice that represents acting 
according to a judgment of severity. If one 
evaluates all people on both dimensions, 
the problems of overlap negate the value of 
separate dimensions. To propose that one 
study only qualitative differences, such as 
diagnostic classes, similarly postpones unduly a 
severity measurement. We inevitably act as 
if there are severity differences when we 
plan treatments for schizophrenics or anxiety 
neurotics. These alternatives will be sub- 
sumed within a single mental health dimension 
in the long run in any case. 

Two assumptions other than the con- 
tinuum thesis have guided this work. First, 
we have supposed that conscious experience, 
especially in the form of intentions, is com- 
panion to the social actions of the person and 
must be so conceived. We are proposing to 
observe simultaneously social actions and 
private intentions, noting their congruencies 
and discrepancies. We treat intentions as 
parts of actions, not motivators of actions. 
Thus, we assume that one must know what a 
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person intends to do, what he has in mind, 
and that one must also know how he correlates 
his intentions with his actions. The investiga- 
tion of many intentions-in-transactions in- 
corporates the necessary and sufficient con- 
ditions for evaluating mental health.! 

The second assumption is that environ- 
mental contributions to transactions must be 
evaluated when estimation is made of mental 
health level. It is one of the more significant 
facets in our theory that environmental 
contributions are considered to vary and 
to do so systematically. For one thing, en- 
vironmental agents may be more or less 
inducing (initiating) or responsive (reacting) 
with respect to the person being studied. 
For another, environmental agents may 
encourage inadequate intentions and dis- 
courage superior ones (or vice versa). Both 
of these influences act to prevent any person 
from rising far above his fellow men with 
respect to mental health level. Conversely, 
the most disturbed among us cannot be so 
very different from the rest of us, for his 
disturbance is partly reflective of our actions 
as environmental forces. _ 


Our MENTAL HEALTH THEORY 


Mental health is a quality of personality, 
reflected in virtually every action of a person’s 
life. We mean by this, first, that mental 
health is an attribute of a personality system. 
Second, by the time a person possesses a 
definite personality system (sometime during 
childhood or early adolescence), he has also a 


1 The influence of the unconscious is inferred when 
there is a discrepancy between intention and transac- 
tion and when that discrepancy is not determined 
primarily by unforeseen contributions from the en- 
vironment. This is to say, before any inference can be 
drawn about unconscious forces, one must first note 
a discrepancy and then note that it was primarily a 
function of the person. 

This analysis suggests two types of situations in 
which interpretations of unconscious motivation are 
irrelevant: Situations in which the intentions and 
transactions are congenial so that the intention is 
fulfilled. The overall intention-transaction: symmetry 
(given even minor discrepancies) makes reference to 
unconscious motivation trivial. Such agreement of 
intention and transaction is probably more frequent 
than we have been led to believe by the emphasis on 
man’s irrationality. Situations in which a person’s 
environment is primarily responsible for the discrepancy 

between intention and transaction. 


P. LICHTENBERG, R. K. CASSETTA, AND J. C. SCANLON 


discernible mental health level that is rela 
tively persistent. Third, the mental health 
factor affects all actions of a person and is 
thus, open to observation at all times, It 
takes exceedingly microscopic observation, 
however, to detect the entire structure in 
one or two events. It is more reliable to ob- 
serve consistent biases and their central 
tendency or adaptation point over a variety 
and number of events. Momentary influences 
are muted in favor of the consistencies of 
the personality system. Thus, mental illness is 
considered to be a generalized tendency of a 
person to deviate to a given degree from what 
is “appropriate” (a matter to be discussed 
later). Further, mental health pertains to 
psychological events in which the person i 
actively helping to create the event, to give it 
content, scope and meaning. The events art 
transactional, 

While the person is never entirely 1 
sponsible for the totality of an event, he 8 
responsible for his intentions. We take im 
tentions to mean the purposes, plans, oi 
projected outcomes that the person com 
sciously experiences, that he has in mind, 
An intention is what a person “wants” to do 
when he knows that he “wants” to do it. A 
person who turns on the gas in the oven all 
puts his head in may want to die, or he may 
want to be rescued. One must study lis 
phenomenological experience to discover whic 
is his intention. 

The aspect of the intentions that seems 
significant to a theory of mental health 1$ 
whether the intention reflects strivings to 
equality or inequality. The equality facto! 
refers to the sharing of whatever satisfaction 
accrue to the transactions. Equality is "i 
tended if the person and the others are E 
posed to suffer or gain together and in rough y | 
equal proportions. Inequality takes either D 
aggrandizing or a sacrificing form. In 3 
aggrandizing intention, the imbalance favo 
the person; in the sacrificing intention, 
imbalance favors the other. 

Because intention and action need 
coincide, evaluating intentions solely is "* 
sufficient for determining mental health. 

a person is intending equality, he may ‘he 


T 7 2S 
be creating circumstances that inhibit = 
He mi 


| 
not 


practical realization of equality. 


A Continuum FOR MENTAL HEALTH 


reveal a discrepancy between his theory and 


his practice, his thought and his action. 


The feature of events that seems significant 
in providing a context for the intention toward 
equality or inequality is a transactional factor 
here called mutual achievement. All events are 
characterized, on the one hand, by some goal 
or purpose which gives expression to the goal 
directed or achievement character of the 
event. (We speak here of the goals of the 
transaction, not the intention of the person.) 
All events contain, on the other hand, some 
indication of the sharing among the par- 
ticipants of the achievements gained, of the 
implications of their mutual inclusiveness or 
exclusiveness, in brief, of the mutuality of 
satisfaction. 

We contend that the achievement factor 
and the mutuality factor are always inte- 
grated in events and that one observes only 
mutual achievement strivings. All behaviors 
must be observed, accordingly, with simul- 
taneous attention to achievement and sharing 
factors. For instance, a person's orientation 


toward achievement is determined in part 


bo 


by his subjective probability of success and 
in part by his anticipation of how that which 
ls achieved is to be divided among the col- 
laborators. Similarly, a person's sharing 
propensities will depend both on his anticipa- 
tions of mutuality and his expectancy of 
Successful goal attainment (cf. Lichtenberg, 
Kohrman, & MacGregor, 1960, pp. 200-203). 

We have formulated four distinguishable 
levels of mutual achievement strivings. At the 
lowest level are to be found minimal mutuality 
and achievement. Whether in examining 
Persons on this level one emphasizes the 
mutuality component (e.g., a man as a social 
isolate) or the achievement component (e.g., 
à man as a poor worker), one sees that the 
Person's attempts at relating both to people 
and to goals lead to loneliness and to low 


-level accomplishments. At the highest level, of 


Course, the reverse is true. In describing these 
four levels, we proceed from the highest to 
the lowest, from that presumed more healthy 
to that presumed less healthy. 

The highest level of mutual achievement 
Striving in this system (Level IV) is called 
the “Redefine? level. In events at this level, 
the person’s intentions are met with attempts 
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at redefining or restructuring by the other(s) 

in the interaction. The intentions of the 

person are affirmed as starting points, in- 

complete and awaiting development, while 

at the same time revision and expansion of 

these intentions or their structural arrange- 

ments is already taking place. The other in 

the transaction is suggesting by his manner or 

actions that the person's intentions are 
tentatively acceptable, but acceptable pri- 

marily as origins from which will grow new 

intentions, ones that will permit closer in- 

tegration of the person and the other. The 

other in the transaction aims to restructure 

the person’s intentional experience so that 

his own intentions and wishes are given more 

expression. The alignment of the person’s 

intentions vis-à-vis the actions of the other is 

presented in Figure 1. 4 

The redefining process may focus upon the 

goal, the proposed sharing of rewards and 

punishments, the coordination of efforts, or 
other similar manifestations of mutual achieve- 
ment striving. The focus will vary according 
to the actual congruence between the person 
and the other in their intentions, their past 
experiences in coordinated effort, the nature 
of the task, and other such “reality” factors. 
When there is much willingness to share 
rewards, more of the redefining endeavor will 
center upon the goals to be sought or the ways 
in which goals may be reached. When there are 


PATIENT'S UE ERCIOP 
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DIS CORD 
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l 
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Fic. 1. Representation of maneuvers by the other 
in a transaction vis-à-vis a person's intentions with 
reference to levels of mutual achievement strivings. 
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common goals, the restructuring concerns 
sharing, cooperation, and coordination. 

The next level of mutual achievement 
striving (Level IIT) is that called the “Ac- 
cord" level. In events labeled “Accord,” 
the person’s intentions are affirmed by the 
other, not only tentatively as starting points, 
but decisively, as the intentions whose de- 
velopment will control the future. The goals 
of the person in their original form become 
the goals of the transaction. The initial 
sharing patterns are accepted without major 
attempts at modification. The other in the 
transaction is receptive to or promotes se- 
lected intentions of the person (those that 
are congenial, of course), to which he then 
adds his talents and skills and hopes. The 
other aligns himself in accord with the per- 
son's direction, as indicated in Figure 1. 
It represents that form of cooperation in which 
I do what you want, thereby cooperating 
with you. 

At this level there is typically an emphasis on 
maximum mutuality between the person and 
the other; but this is not the same as maximum 
mutual achievement striving, The mutuality 
alone is divorced to a certain extent from the 
goals that the other may wish to posit. The 
other in the transaction is forced to inject his 
intentions in an indirect way—by selectively 
attending to the intentions offered to him. 
He aligns himself only with those intentions 
which promise gratification to him as well as 
to the originator. Some constructive achieve- 
ment components are, thus, lost to the trans- 
action, They are , transformed from task 
oriented achievement striving to endeavors 
for the attainment of mutuality as achieve- 
ment itself. 

Level II in mutual achievement strivings 
contains those events in which the person 
meets a call for new intentions, that is, for 
intentions not currently active or likely to be 
active were he alone to dictate the course of 
events. The other in these transactions tries to 
induce distinctly new intentions into the 
person’s life. It is asking the person to 
“want” in a way he has not previously con- 
ceptualized. 

It is true that all actions by the other tend 
to call forth responses from the person and 
in some sense to require new behaviors and 
intentions. If one redefines another’s be- 
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havior, for example, he accepts the original 
intentions only as stations on the road to | 
something new and different, as places to 
obtain fuel that will be consumed as move- 
ment progresses and newly created intentions 
arise. We do not use the Level II mutual | 
achievement analysis, however, to refer to | 
these types of new intentions being required. | 
When something new is called for directly. 
and distinctly, it is to be recognized in that 
form. Sometimes the clue is expressed by 
surprise, perplexity, or hestitation before | 
any commitment is made. Whatever the clue; 
it does not seem difficult to infer this type of | 
transaction. | 

The remaining level of mutual achieve 
ment striving (Level I) includes events in | 
which the other aligns himself in discord 
with the person's intentions or selects goals 
that will not be acceptable to the person] 
There is, of course, a minimum of sharing 
and of achievement in this level. The other 
in the transaction aims to inhibit the develop” 
ment of the expressed intentions; but because } 
a relationship is presumed to exist for all 
event to be observed, efforts are being exi 
pended to create a different set of intentions. 
in the person through the rejection of those. 
expressed. 

The other implies that he wants the rela- 
tion to move to a new plane of intentions 
One way, of course, is to have the person- 
other relation dissolved. Another way, aM 
probably the expected way as far as the 
opposing other experiences it, is for the person: 
to inject a radically different intention into 
the transaction, one that will be congenat 
so that the relationship and goal striving cai 
progress. 1 

These are the four levels of mutual achiev 
ment striving that we discern: Redefine av) 
Accord (III), Require a Tendency (ID, # 
Discord with the person's intention by th 
other (I). The differences between the lev 
reflect both mutuality or sharing differenos 
and achievement or goal attainment differ 
ences. 

We have developed the thesis that th 
important aspect of intentions is the strivl 
toward equality or inequality. Aggrandi ? 
and sacrificing forms of inequality are coun 
as equivalents for this analysis, since 
do not contribute differently to severity 
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TABLE 1 


A QUANTIFICATION OF INTENTIONS TOWARD 
EQUALITY OR INEQUALITY WITHIN LEVELS 
OF MUTUAL ACHIEVEMENT STRIVINGS 


Weighted 


UTE. The intention-transaction combination 


An equalitarian intention is redefined 

An equalitarian intention finds accord 

An equalitarian intention follows a requirement 
of a tendency* 

An aggrandizing or sacrificing intention is re- 
defined 

An equalitarian intention finds discord 

An aggrandising or sacrificing intention finds 
accord 

An aggrandizing or sacrificing intention follows 
a requirement of a tendency* 

An aggrandizing or sacrificing intention finds 
discord 


Dn pe & wre 


^ Since “require a tendency” presumes the lack of a specific 
intention by the person in the determination of an event, we took 
the intention that developed as a function of the event for this area. 


disorder. We have also proposed that the im- 
portant facet of the transaction is the mutual 
achievement striving. Combining these sepa- 
tate proposals gives eight varieties of in- 
tention-transaction categories, which are 
presented in Table 1 in the form used to 
quantify each event recorded. The eight 
varieties include intentions toward equality 
within the four levels of mutual achievement 
striving and intentions toward inequality 
within the four levels. 

In giving quantitative values to the eight 
Categories, we were led to establish two equiv- 
-alents, This equivalence derives from the 
fact that the person helps to create the trans- 
action in addition to creating his intention. 
Thus, functional equivalence exists between 
Posing an equalitarian intention in a transac- 
tional context of discord and posing an ag- 
&randizing intention that is supported by the 
other. On the one hand, the discord of the 
transaction limits the extent of satisfaction 
that Is to be shared equally; on the other hand, 
the kind of aggrandizing supported by the 
other Is restricted by his selectivity regarding 
Which Intentions he will sustain. The aggran- 

ing is restricted by the conditions of sup- 
Port in the transaction, and the intention 
toward equality is limited by the lack of 
communion in the transaction. 
Nal ta equivalents are an equalitarian 
E. inte that follows the requirement of a 
in nition by the other, and an aggrandiz- 
8 or sacrificing intention that is redefined 
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by the other. We do not credit the person 
with the original, private development of the 
intention toward equality in the first instance; 
we do credit him for selecting a good rede- 
fining transaction in which to express ag- 
grandizing intentions in the second instance. 

The procedure for developing an estimate 
of severity of mental disorder treats the 
mental health level as the central tendency of 
deviations over a variety of events. A series of 
events in a person’s life is scored, and an 
average is taken. The larger the series scored, 
of course, the greater the reliability of such a 
central tendency. 

Because mental disorder is conceived as a 
consistent deviation from something desig- 
nated as “appropriate,” a deviation must be a 
deviation from some standard. The standard, 
or value, or that-which-is-appropriate, in 
our case, is an equalitarian intention in a 
redefining transaction. 


METHOD 


This analysis of mental health was implemented in a 
clinical record form relevant to disposition case con- 
ferences. We collected information pertaining to sig- 
nificant events in the recent past of 164 patients 
admitted to Syracuse Psychiatric Hospital? between 
late October 1958, and early June 1959. The rating 
schedule, which includes items on functional disability, 
diagnosis, presenting complaint, type of treatment, 
and locomotion in treatment, was filled out by the 
three authors, operating in three teams of two persons 
each when the patients and their histories were pre- 
sented by the responsible residents and interns to the 
director of the hospital or his assistant director. Thus, 
the information used is our reflection of the interviewer’s 
understanding of the patient’s story and that of his 
relatives, friends, welfare workers, etc. The patients 
were nearly always presented to the staff in person, 
and new material and verification or clarification from 
these presentations were obtained. Further informa- 
tion was gained from a systematic search of the written 
records kept by the hospital about the patients. 

For a significant event to berecorded, it wasnecessary 
to have reasonable information regarding four matters: 
who initiated the event, the patient or the other; 
the environmental area—family, marital, religious, 
etc.—in which the other exists; the mutual achievement 
characteristic of the patient-other relation; and a 
descriptive basis for determining whether the patient’s 
intentions were aggrandizing, sacrificing, or equalitarian. 


2 Syracuse Psychiatric Hospital is a state hospital 
with 60 beds, attached to the Upstate Medical Center 
as a teaching and research hospital and oriented toward 
diagnosis and short-term treatment. We are indebted to 
Marc Hollender, the director, for permission to work 
in the hospital and for the help he and his staff gave 
to us. 
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The last factor, the descriptive basis, varied accord- 
ing to several issues that accompanied our aim for 
immediate descriptiveness. We directly rated the 
patient’s intention as toward or away from treatment 
in those events connected with locomotion toward the 
hospital. We inferred from these an equalitarian inten- 
tion if the patient aimed toward, anda nonequalitarian 
intention if he aimed away from treatment. (Since we 
did not wish to overweight this sort of event, the 
locomotion scores were averaged, and the resulting 
average score was used as one of the series of scores 
pertaining to other events.) Regarding other events, 
behaviors were described briefly, and later ratings 
were made concerning the equalitarian-inequalitarian 
nature of the intentions. If the patient initiated the 
event, we described his behavior. If the other initiated 
the event, we recorded the environmental pressure and 
inferred the patient’s intention from the equality of 
the pressure and the mutual achievement character of 
the transaction. 

An “event” was taken as experienced by the patient. 
Using the patient's definition of an event is important 
for establishing a proper unit of observation. Some 
examples may be illuminating. 

1. An ex-patient foresaw the return of his severe 
depressive feelings and turned to his family physician 
for help in returning to the hospital. The doctor 
arranged for his admission. We noted that (a) the 
patient initiated the transaction, (5) the environmental 
area was "other kinds of concurrent treatment," 
(c) the patient and doctor were in accord, and (d) 
the patient's intention was toward treatment. Counting 
the "toward treatment" as an equalitarian intention 
and the transaction as accord gives an intention- 
transaction weighted score of 2 for this event. 

2. One patient consulted her priest about what she 
felt to be her badness about not wanting her recently 
born child. The priest tried to assure her (and only 
upset her more) by saying she had not committed a 
sin. We recorded that (a) the patient initiated the 
event, (b) the area was religion, (c) the patient and 
priest were in discord, and (d) the intention was to 
sacrifice, The denial in the transaction of a sacrificial 
intention was given a weighted score of 6. 

3. A patient's husband, after a long period of debate, 
suddenly decided on his own that the family should 
move out of their home into a new one. He was prepared 
to sacrifice his own desires to please his wife. We rated 


TABLE 2 
INTERRATER AGREEMENT IN RATING INTENTIONS 
Agree Disagree? "Total 
Intention 
N 9 |N To VAN % 
Equalitarian - 69| 65.7) 36) 34.2) 105)100.0 
Nonequalitarian 530| 87.6| 75| 12.4) 605|100.0 


Total 599| 84.4) 111) 15.6) 710/100.0 


a If two raters scored the intention as equalitarian and the 
thirddidnot, theaccountingwasEqualitarian-Disagree. Similarly 
with two raters scoring nonequalitarian, the accounting was Non- 
equalitarian-Disagree. Note that this table encompasses all three 
raters judging each item. 
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TABLE 3 


INTERRATER AGREEMENT IN RATING LEVEL 
OF MUTUAL ACHIEVEMENT 


Agree Disagree Total 
Level eS eee — 

N| %|N| %|N| % 
IV Redefine 16| 61.5} 10| 38.5} 26/100.0 | 
III Accord 31| 60.8| 20| 39.2) 51/100.0 
II Require a 14| 38.9) 22| 61.1| 36)100.0 
tendency | 
I Discord 122| 79.2, 32| 20.8| 154)100.0 | 


©The total number of events included in this analysis is small | 
because only events selected initially by both raters are used. 
This procedure enhances the “independence” of the ratings. 


this as (a) other initiates, (b) marital area, (c) patient- 
husband accord, and (d) patient's intention as ag 
grandizing (the husband's sacrifice plus accord implying 
the patient's intention). We scored this, accordingly, 
asa4. 


| 
Reliability 
The total number of scores used in esti | 
mating the mutual achievement health of the} 
164 patients was 874, an average of 5.33 ptr. 
person. This total consists of 710 items othet 
than locomotion and 164 locomotion average 
scores. The scores for each person were 
averaged to give a mental health score. ‘The 
mental health scores ranged from 2.3 to 60 
with a mean of 4.6 and a median of 4h 
The scores tend toward the upper limits, 
partly because we have a hospital popula- 
tion that is presumably more mentally di 
turbed than a nonhospital population 
partly because significant events reported at 
a case conference tend to favor descriptions 
more serious disabilities and troubles. 
In examining agreement among Tate 
distinction is made between the selection 
events from the mass of information available) 
and the classification of selected events 
Group decision prevailed in the final selection) 
and scoring, but we wished to study how coms 
sistently different people could apply k 
theoretical analysis. Between 47% and 144 
of events proposed by one rater were 
chosen by the other; the average of the thre 
raters taken with each other was 62%. — 
When events have been selected, conr risol 
can be made of agreements on the rating 
assigned to the events. The intentional a 
the transactional characteristics of the eve% 
were rated separately in order to mn" 
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such halo effects as may occur in the creation 
of a single score for a person from many items 
and in rating one sort of information along 
with other material with which correlations 
are to be made. The results are presented in 
Tables 2 and 3. In Table 3, the percentages 
of agreement correlate with an intuitive esti- 
mate of what is easiest to rate (deny or oppose) 
and what is hardest (requires a new tendency). 
The agreement in Table 2 refers to all three 
raters judging simultaneously; that in Table 
3 is derived from a team-by-team judging 
process. The information in these tables lends 
credence to the final scores evolved in group 
discussion. 


Tests Applied to the Theory 


Two sorts of tests beyond consistency are 
pertinent to the scientific valuation of this 
theoretical analysis. First, how well do the 
results based on this system conform to 
other estimates of severity of mental disorder, 
and second, does placing people on this con- 
tinuum relate to such practical issues as 
type of treatment and disposition of the pa- 
tient? 

For comparison purposes, we used two 
Measures of severity of mental illness. The 
first of these is “Functional Disability,” a 
theoretical scheme we developed (Lichtenberg, 
Cassetta, & Scanlon, 1960b) which is based 
on the extent of a patient’s disability (none, 
mild, moderate, or severe) in each of four 
domains or foci of relations: self-others, self- 
things, self-body, and self-psyche. Summing 
the degree of disability in the four domains 
Permits a range from a score of 0 to a score of 
12, For the 164 cases reported, the observed 
Severities range from 1.5 to 11.0 with a mean 
of 5.6, a median of 5.5, and a standard devi- 
ation of 1.6, 

_ There is minimal overlap between the 
information determining the mutual achieve- 
ment health score and that determining the 
functional disability score. Any correlation 
between the two scores is likely to be a func- 
tion of their measurement of a person, not of 
their interdependence methodologically. 

€ second measure of presumed severity 
of mental illness which we used is diagnosis 
as given in the American Psychiatric Associ- 
ation standard nomenclature of mental dis- 
orders. The diagnostic categories applied, in 
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order of presumed severity of illness, are 
schizophrenia, affective psychosis (including 
involutional melancholia), personality pattern 
disturbance, sociopathic personality disturb- 
ance, psychoneurotic reactive depression, all 
psychoneuroses other than reactive depression 
(called ‘other neuroses”), and transient 
situational personality disorder. In the com- 
parisons that test the mutual achievement 
health analysis, the following are combined, 
either because the numbers are small or other, 
prior results suggest their equivalence: per- 
sonality pattern disturbance plus sociopathic 
personality disturbance (small numbers); 
“other neuroses” and transient situational 
personality disorders (functionally equivalent), 
and a miscellaneous group (small numbers). 

The next test to which the mutual achieve- 
ment health ratings are put concerns the 
prescriptions of therapy. The formulation of 
treatment types (Lichtenberg, Cassetta, & 
Scanlon, 1960a) derives from the same theo- 
retical analysis as the mutual achievement 
health scheme. Indeed, it is a particularization 
of the general theory with special reference 
to psychological treatment as a task and the 
therapist as the other in the transaction. 
In the functional disability study, we were 
able to demonstrate that severity of disorder 
and level of mutual achievement striving in 
therapy vary together. The same should hold 
true with the mutual achievement health 
ratings. Comparisons were made by the 
following fourfold classification, so limited 
because one level of treatment has few cases: 
no treatment or not established; uncovering 
and interpreting treatment; supportive care 
or changing the patient's world to enhance 
fulfillment of his wishes; and directive care or 
protective treatment, including the protection 


TABLE 4 
Tue RELATION OF MUTUAL ACHIEVEMENT 
HEALTH SCORES AND FUNCTIONAL 
DISABILITY SCORES 


Mutual Functional disability scores 
achievement 
scores 0-4.7 4.8-5.5 5.6-6.5 6.6-11.0 Totals 
04.2 22 12 4 6 44 
4.3-4.6 12 10 18 3 43 
4.7-5.1 6 14 6 10 36 
5.2-6.0 3 8 10 20 41 
Totals 43 44 38 164 


39 
x? = 48.21, df = 9, p < .001 C= 48 
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TABLE 5 
Tue RELATION OF MUTUAL ACHIEVEMENT HEALTH SCORES AND DIAGNOSTIC IMPRESSION 


Diagnostic impression 


Mutual achievement 


mores. Affective 


Schizophrenia psychosis 


Personality 
pattern and 
sociopathic 


Other 
neuroses* and 
situational 


Neurotic 
depression 
reaction 


All other 


diagnoses Total 


x 
eO 1000 
© 


N 
ROWAN 


Totals 44 27 21 


of the patient from himself and the protection 
of others. 

The final test concerns what may be called 
outcome. While severity of disorder is not 
the sole determinant of a patient’s disposition 
or ability to remain out of hospital, it is 
probably a significant one. Patients were 
divided into the following three classes: 
patients discharged to the community who 
remained out of a psychiatric hospital for at 
least 9 months subsequent to discharge (“out, 
not back”); patients discharged to the com- 
munity who returned to any psychiatric 
hospital within 9 months after discharge 
(“out, back”); and patients sent directly to 
another inpatient psychiatric hospital for 
care ("to other inpatient care"). Since Syra- 
cuse Psychiatric Hospital does not ordinarily 
provide long-term care, these categories seem 
reasonable for a practical evaluation of the 
mental health theory. The expectation was 
that the first group of patients would cluster 
on the healthier side, the second group in the 
middle, and the third group on the more 
disordered side of a ranking of all patients on 
mental health. 


RESULTS 


The findings in the comparison between 
Mutual Achievement Health and Functional 
Disability are presented in Table 4. The 
distribution of patients in this table follows 
theoretical expectations with minor excep- 
tions (e.g., the four central cells), where 
greater precision is needed to separate cor- 
rectly people who are in the central cells 
because the range of scores defining them 


: 12 14 44 
.3-4. 7 6 43 
1-5. 5 36 
.2-6. 2 3 41 | 
26 25 21 164 
xi = 37.20, df = 15, p < .002 C = 43 
U This class includes all psychoneuroses other than depression reaction. | 
TABLE 6 
MUTUAL ACHIEVEMENT HEALTH SCORES AS 
RELATED TO Types OF TREATMENT 


"Treatment types 
Mutual 
achievement 
heal 


III 
Sup- 

rt or 
nhance 


N Herd 
en LIV Direct or Totals 
Protect 


not es- 


scores tablished Uncover 


1 15 11 4 
4 13 20 43 
4 7 20 36 
3 4 30 41 


Totals 22 22 39 81 164 
x? = 23.59, df = 9, p = .005 


TABLE 7 
Tur RELATION or MUTUAL ACHIEVEMENT 
HEALTH Scores To OUTCOME 


Outcome 

Mutual 

chit Discharged- " i - 
mates | nore | eif | Dice Tus 

within 9 sion within | “hospital 
2a 

0-4.2 33 6 4 5i 
4.3-4.6 28 1 1 ^M 
4.7-8.1 17 11 6 NT 
5.2-6.0 16 9 15 

Totals 94 37 26 157 

= 2547,df=6,p<.001 C=37 


® This total differs from the 164 total studied because 7 patien | 
could not be classified in these three groups. 


(eg, 4.8-5.5 and 5.6-6.5 in Function 
Disability) is narrower than the range 
scores for the peripheral cells (e.8-, 04 


Jl 
and 6.6-11.0 in Functional Disability). | 


The second comparison (Table 5) involv 
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the Mutual Achievement Health formula- 
tion and diagnostic impression. The distribu- 
tion of persons in this table also tends to 
confirm the theory being examined. Although 
there are some failures in the data, the overall 
tendencies are confirmatory. 

The Mutual Achievement Health scores 
fail to give order to the group of patients 
diagnosed as affective psychosis. Because 
these patients were adequately handled in 
the Functional Disability investigation and 
because the neurotic counterpart of a psy- 
chotic affective state, neurotic depressive 
reaction, is in accord with theoretical expecta- 
tions in the current study, this failure is 
perplexing. 

The relation of preferred type of treatment 
and Mutual Achievement Health is shown in 
Table 6. The distribution of the 164 patients 
shows a statistical significance that confirms 
the hypothesized relation. The healthier 
patients tend to be given those treatments in 
which the level of mutual achievement striv- 
ings is higher, and the less healthy patients are 
given those in which the level is lower. 

The final test was health level and out- 

come. The results are presented in Table 7. 
They tend to confirm the utility of the mental 
health theory. More of those people who are 
discharged and not readmitted to a hospital 
within 9 months are at the healthy end of the 
scale; those people who are discharged and 
Teadmitted cluster in the middle scores; and 
those who are sent directly to another hospital 
are more likely to be scored in the lower 
echelons of the Mutual Achievement Health 
dimension, 

Inspection of the records of the 16 patients 
With a low mental health score who are dis- 
Charged but do not return suggests that some 
of these people were discharged to their homes 
for essentially custodial care. One man died 
in the interim. Others were hospitalized for 
acute phases of chronic disorders and released 
When these phases were ended. In most in- 
Stances, therefore, the mental health scores 
of these 16 people do not seem to be faulty. 


Discussion 


| A close reading of the theoretical analysis 
reveals that the person under observation is 
Dot classified as redefining the intentions of the 
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other, according with them, requiring new 
ones, or disagreeing with the expressed in- 
tentions of the other. We do not study the 
person's behavior as it fits the intentions of 
others, which a complete examination of 
personality might profitably include, but 
only the impingement of the other on the 
person's intentions. That is, we do not study 
the person as an object in the life of the other. 
This limitation reflects the fact that this 
study was done in a clinical setting where 
patients’ intentions are more accessible to 
observation than the intentions of collabo- 
rators. 

A slightly different matter concerns who 
initiates each event, who is the "attributed 
source" as Heider (1958) put it. When the 
person's intentions are redefined, he is neces- 
sarily the initiator; when a new intention is re- 
quired of him, he must be a reactor. However, 
he can be initiator or reactor when there is 
either an accord or discord level of mutual 
achievement striving. Thus, the four levels of 
mutual achievement striving expand into six 
types of striving. 

From an examination of all the items, we 
conclude that six concomitant factors ac- 
company mutual achievement levels. By 
considering a factor to be a concomitant, we 
mean both that it goes with mutual achieve- 
ment strivings levels in a natural connection 
and that it accompanies in a subordinate 
capacity. These six factors are: 

1. The apparent unification of mutuality 
and achievement concerns. At upper levels of 
mutual achievement strivings, it is difficult 
to separate mutuality facets from achieve- 
ment facets. At lower levels, people’s actions 
seem to be either mutuality directed or achieve- 
ment directed, even though they are mutual 
achievement strivings in fact. 

2. The number and complexity of intentions 
in the events. In the upper levels, many and 
complex intentions fit within events; in the 
lower levels, few and simple ones are present. 

3. Reliance upon external authority for prog- 
ress or maintenance of the relation. In the 
higher levels of mutual achievement strivings, 
there is little reliance upon hierarchy, tradi- 
tion, or outside authority. The reverse is true 
at-the lower levels. 

4, The autonomy of the actors. At the higher 
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levels, the participants bring to the relation 
autonomously formed plans and proposals; 
at the lower levels, a certain emptiness is 
evident. 

5. The combined embeddedness, openness, 
and sense of development characteristic of 
events. The events of a higher level are more 
directly embedded in other, often larger, 
events; whereas those of a lower level are 
discrete, being seen in larger units, mainly as 
repetitive acts. 

6. The apparent discreteness of events as 
events, The determination of what sequences 
are “events,” the units of observation, is more 
difficult for the higher levels than for the 
lower levels. 


SUMMARY 


This report presents a theory of mental 
health formed under the influence of three 
general considerations: persons can be de- 
scribed according to a position on a mental 
health continuum; conscious experience plus 
transactional characteristics of social behavior 
comprise the essential features for determina- 
tion of mental health; and the environment 
is to be included in variable and systematic 
fashion in such a determination. From the 
analysis of intentions and mutual achieve- 
ment strivings are derived eight quantified 
combinations of intentions-in-transactions. 
Applying the scoring system to a series of 
events provides a quantitative statement of 
mental health level. 

The theory was examined with respect to 
the histories of 164 hospitalized patients. 
The outcomes were compared to an alternate 
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analysis of mental health, to diagnosis, to 
treatment prescribed, and to outcome. The 
results tend to verify the theory. 
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THE INFLUENCES OF PERSONALITY AND SITUATIONAL 
CAUTIOUSNESS ON CONCEPTUAL BEHAVIOR: 
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NUMBER of psychologists have stressed 
the importance of considering situ- 
ational and personality factors as 

codeterminants of behavior. Newcomb (1943) 
dealt with the interaction of situational and 
personality factors in his classic study of 
attitude change among college students. As 
early as 1935 Lewin stated “the actual be- 
havior of the child depends in every case 
both upon his individual characteristics and 
upon the momentary structure of the existing 
situation” (p. 71). In a recent review of the 
literature, Masling (1960) discusses the de- 
creased predictive power of studies that fail 
to consider the combined influence of situa- 
tional and personality factors. 

The complex effects of personality and 
situational variables require more than the 
simple acknowledgment that they both con- 
tribute to behavior. Not all subjects are 
equally responsive to different situational 
cues. A situational variable may influence 
only the behavior of subjects who exhibit a 
Particular personality characteristic. By pool- 
ing Subjects, without regard for personality 
differences, the influence of the situation may 

be masked or greatly attenuated. On the other 
hand, the variance associated with the per- 
Sonality variable may be neutralized by 
Strong situational cues. That is, different 
personality tendencies may be observable only 
under certain condition. The optimum 
Approach for assessing psychological processes 
Should be based on the study of these two 
types of variables both as separate and 
Combined determinants of behavior; as 
Rotter (1960) contends, it is necessary to 
Study the interaction of situational and 
Personality variables if we are to increase the 


‘Adapted from a dissertation submitted to the 
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power of our predictions and provide a basis 
for replicability. 

This investigation deals with the way 
responses are modified by the subject's ex- 
pectancy of success or failure on the experi- 
mental tasks. Persons who fear that they will 
be perceived in a negative light frequently 
exercise certain controls over their test be- 
havior. These controls are defensive since 
they protect the person from revealing aspects 
of himself that might result in failure and/or 
disapproval. Defensive reactions should be 
greater when an individual has a generalized 
low expectancy for success or in a situation 
that specifically engenders low expectancies. 

The term cautiousness is used to describe 
the class of defensive behaviors dealt with 
in this investigation. Cautiousness is regarded 
as “the tendency to behave in a manner 
designed to avoid potential failure or dis- 
approval experiences, and this goal is achieved 
often at the expense of other satisfactions.” 
That is, the cautious person is more con- 
cerned with avoiding failure than with obtain- 
ing success. In order for cautiousness to be ob- 
served there must be a basis for judging 
certain response alternatives as entailing 
less risk and less potential satisfaction than 
other alternatives. The level of aspiration 
paradigm provides a task situation that meets 
these criteria. The subject can behave in 
a cautious way by stating low goals, even 
when these expressed low goals place restric- 
tions on his maximum possible score. That 
is, the subject is told that his score can be 
no higher than his expectancy statement. On 
the other hand, unrealistically high expect- 
ancy statements are discouraged by penal- 
izing the subject for failing to score as high 
as he predicted he would. mo 

The use of the level of aspiration as a 
measure of cautiousness has been discussed 
frequently in the psychological literature. 
gem SER Festinger, and Sears (1944) 
conclude that fear of failure is a major de- 
terminant of level of aspiration behavior. 
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Persons who have a strong need to avoid 
failure behave cautiously by stating low goals. 
Frank (1935) states that cautiousness is a 
generalized personality trait that influences 
level of aspiration behavior. He did not ob- 
tain any independent measure of caution 
but inferred the trait from the level of aspira- 
tion behavior. Gould (1939) concludes from 
her results that goal setting behavior often 
serves a person as a protection of the ego in 
which the stated estimates are designed to 
shield him from failure. 

Cautiousness is also reflected by the con- 
ceptual approach used in responses to visual 
stimuli. The verbal labels used to conceptua- 
lize a visual stimulus may vary considerably. 
For instance, responses to TAT pictures can 
be descriptive or, conversely, highly in- 
ferential. The descriptive statements are 
objective and their accuracy cannot be easily 
challenged. In addition, descriptive responses 
are nonprojective and provide the subject 
with a way of evading disclosure of internally 
held beliefs, conflicts, or needs. Descriptive 
responses are therefore assumed to represent a 
cautious approach since there is minimal risk 
of criticism and/or personal exposure. In- 
ferential responses can be idiosyncratic, 
personalized, and imaginative. These re- 
sponses are more open to evaluation of either a 
critical or favorable nature and would gen- 
erally reflect a noncautious approach. 

The hypotheses tested by this investiga- 
are as follows: 

1. Cautiousness, when induced by an 
experimental situation of failure, leads to 
increased descriptiveness in verbal responses 
to TAT pictures and conceptual sorts (MAPS 
figures). 

2. The degree to which the failure situation 
leads to increased descriptiveness is determined 
by the subject’s personality disposition. 
That is, personality and situational factors 
interact in the prediction of descriptive re- 
sponse. 

3. Cautiousness, as a general personality 
characteristic (determined from level of 
aspiration behavior), is associated with 
greater descriptiveness in verbal responses 
to the personality test materials. The relation- 
ship should be most pronounced when assessed 
under neutral conditions (Control group). 
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METHOD 
Subjects 


The sample consisted of 109 students enrolled in 
the introductory psychology classes at the Ohio State 
University: 61 females and 48 males from 17 to 30 
years of age. 


Procedure 


The subjects were randomly assigned to three 
experimental groups and each subject was seen in- 
dividually for two separate sessions. 

Session 1. Prior to the experimental conditions 
all subjects performed on the Rotter (1942) Level of 
Aspiration Board, to provide a behavioral measure of 
characteristic cautiousness.? The subjects were seated 
at the end of a table on which the Level of Aspiration 
Board was placed and the standard instructions were 
administered.* Each subject had 20 trials on this task, 
and a difference score (D score) was obtained between 
subject’s actual performance score and his subsequent 
expectancy statement for each trial. 

Rotter (1945) has described nine patterns charac- 
terizing level of aspiration behavior, regarded as “points 
of concentration which can be used as flexible standards 
for understanding individual cases," and based on à 
combination of three factors: his D score, the frequency 
of shifts, and the occurrence of unusual shifts (up 
after failure and down after success): 

1. Low positive D score pattern. This is generally 
considered the most realistic response. Estimates até 
usually higher than past performances with an average 
number of shifts and unusual shifts absent. 

2. Low negative or very slightly positive D scott 
patterns. This pattern is similar to Number 1, but 
with a lower D score. J 

3. Medium high D scores. This pattern 18 charac- 
terized by frequent usual shifts (up after success and 
down after failure) and high D scores. , 

4. Achievement followers. These subjects’ estimates 
are constantly changed to equal or closely approx 
mate the previous performance. 

5. The step pattern. Here the subject never Jowers 
his estimate. 1 

6. Very high, positive D score patterns. In this patter! 
the subject behaves unrealistically by stating d 
goals regardless of characteristically lower performan 
levels. Sie 

7. High negative D score patterns. This is cum 
terized by frequent shifts down after success aml 
negative D scores. a) 

8. Rigid pattern. This pattern is defined by 
absence of shifts. xi 

9. The confused or breakdown pattern. Here b 
a high number of shifts in addition to frequent "| 


2 A questionnaire was constructed as à second en 
ure of characteristic cautiousness. This questio d 
proved to be a less effective predictor than the ^^ | 
of Aspiration Board and was not included in the PPS 

3 Taken from unpublished manual prepared by d- 
Rotter. | 
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usual shifts, i.e., shifts upward after failure and down- 
ward after success. D scores can be any size. 

The subjects within each of the three experimental 
groups were classified as either cautious or noncau- 
tious from their level of aspiration pattern. Research 
findings indicate that the pooling of similar patterns 
provides greater generality and predictability than 
classifications based solely on the use of the specific 
patterns (Neems & Scodel, 1956). In this study, 
Patterns 1 and 3 were classified as noncautious and 
2, 4, 7, and 8 as cautious. Ten subjects who obtained 
nonclassifiable patterns (5, 6, and 9) were omitted 
from the part of the analysis pertaining to personality 
differences. The interrater agreement for scoring the 
nine patterns was 857%; for the pooled cautious and 
noncautious patterns it was 97.5%. 

Session 2. All subjects were administered a brief 
specially constructed paper-and-pencil form that they 
were told was a “test of social acceptability,” con- 
sisting of 25 statements concerning social attitudes. 
The subjects were instructed to rate themselves on a 
five-point scale for each of these 25 statements. For 
Groups 1 and 2, the experimenter pretended to score 
the test in the presence of the subject and compared 
these results with false norms he had in front of him. 
The subjects in Group 1 were told that they scored in 
the “lower 10 percentile" (Failure group, N = 36), 
and the subjects in Group 2 were told that they scored 
in the “upper 10 percentile" (Success group, N = 
36). This procedure was omitted for Group 3 (Con- 
trol group, N = 37). After experiencing the experi- 
mental condition all subjects were asked to take two 
more tests which they were told were also measures of 
“social acceptability.” These two tests were to “make- 
up a story” for Pictures 2, 4, 17GF, and 20 of the TAT 
and to sort into two groups, according to to the sub- 
ject’s own idea, 22 cut-out human figures taken from 
the MAPS test (Shneidman, 1952). The sorting pro- 
cedure was repeated three times for the complete group 
of figures. The TAT and MAPS responses were used 
to assess cautious conceptual behavior. 

TAT. Each TAT story was scored for conceptual 
level on a four-point scale. The most descriptive stories 
obtained a score of 1 and the most inferential ones 
obtained a score of 4. Intermediate stories on the de- 
Scriptive-inferential dimension and popular themes, 
as determined from normative studies (Eron, 1950; 

osenzweig & Fleming, 1949) were given scores of 
2 and 3, The interrater reliabilities, based on product- 
Moment correlations, for Pictures 2, 4, 17GF, and 20, 
respectively, were 1.00-++, .67, 100+, and .95.5 

MAPS Conceptual Sorting Task. The 22 figures in 
the MAPS test—including representations of males, 


“At the conclusion of the experiment the Failure 
Subjects were given information that negated the earlier 
ibo experience. 
Eu ise coefficients were corrected, because @ 
uted number of broad categories were used in 
ates these variables, by a procedure described by 
ier and Van Voorhis (1940). Lack of precision in 
* correction results in some correlations of 1.00+-. 


Rae Correlations do indicate reliabilities in the high 
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TABLE 1 
Descriptive Statistics WITH RESPECT TO 
THE MAPS Sorts AND TAT 
CONCEPTUAL SCORES 
Sorts TAT 
N 

M SD M SD 

Failure group 
Total 31 | 6.45 | 2.49 | 9.03 | 3.96 
Cautious 14 | 6.50 | 2.56 | 9.07 | 4.28 
Noncautious 17 | 6.41 | 2.43 | 9.00 | 3.68 

Success group 
Total 32 | 7.00 | 2.44 | 9.38 | 3.40 
Cautious 14 | 6.79 | 2,01 | 9.21 | 2.98 
Noncautious 18 | 7.17 | 2.71 | 9.50 | 3.69 

Control group 
Total 36 | 7.50 | 2.62 | 9.47 | 2.88 
Cautious 15 | 6.13 | 2.00 | 8.33 | 2.55 
Noncautious 21 | 8.48 | 2.58 |10.29 | 2.83 


females, figures with features, those where the features 
had been omitted, figures in costume, figures in every- 
day clothes, others in protective and gesticulating 
poses, and ones assuming a relatively static stance, 
etc.—suggested a variety of conceptual dimension. 
The subject had the opportunity to sort according to 
clearly physical qualities of the stimuli or on a more 
highly inferential basis, Each of the three MAPS 
sorts was scored according to the same 1-4 point scale 
of descriptiveness to inferentiality used in the scoring 
of the TAT stories. The interrater reliability for this 
score was 1.00+.° 

Ohio State Psychological Examination (OSPE) 
scores were obtained from the university records for 
all the subjects for whom such scores were recorded, 
The OSPE scores provided an estimate of intellectual 
ability and were correlated with the scores obtained 
on the experimental measures. 


RESULTS 


The three groups (Failure, Success, and 
Control) were each dichotomized into cautious 
and noncautious subgroups on the basis of 
the subject’s level of aspiration performance. 
The N, mean, and standard d for 
each group and subgroup are presented m 
Table 1. High scores on the sorts and TAT 
indicate inferential behavior and low scores 
reflect descriptiveness. The Mann-Whitney 
U test was used for comparing groups as to 
the degree of descriptive behavior on the 
MAPS sorts and TAT. The U values were 
converted into z scores’ so that all group 
comparisons are expressed as z score values. 

6 te 5. 

E Sid (1956) states that as the N in either group 
increases in size, the sampling distribution of oen 
proaches the normal distribution. Therefore, a formi 


632 


TABLE 2 


DIFFERENCES BETWEEN EXPERIMENTAL 
GROUPS ON DESCRIPTIVENESS 


(expressed as z score values) 


Condition Sorts TAT 
Failure vs. Success .91 Fe 33 F 
Failure vs. Control 1.45 F .87 F 
Success vs. Control .68 S .61 S 


Note.—s score of 1.96 needed for .05 level. 
A The letter F, S, or C indicates whether Failure, Success, or 
Control group was higher on descriptiveness. 


TABLE 3 
DIFFERENCES BETWEEN EXPERIMENTAL GROUPS 
(Cautious and Noncautious subgroups 
analyzed separately) ON 
ESCRIPTIVENESS 
(expressed as z score values) 


Noncautious Cautious 
subjects subjects 
Condition 
Sorts TAT | Sorts | TAT 
Failure vs. Success .76 F^ |1.56 F| .49 F| .39 F 
Failure vs. Control 2.41%% F|1.27 F| .59 C| .07C 
Success vs. Control 1.50 S -88 S | 1.10 C | .86 C 


S The letter F, S, or C indicates whether Failure, Success 
or Control group was higher on descriptiveness. 
* p < .02; two-tailed test. 


Table 2 presents data pertaining to the 
first hypothesis. While the Failure group was 
more descriptive than the Success or Control 
groups for all comparisons, as hypothesized, 
these differences were not statistically sig- 
nificant. Thus, the hypothesis pertaining to 
the influence of conditions was not supported. 

The second hypothesis stated that the 
Failure condition would have its greatest 
effect in increasing descriptive responses when 
the subject’s personality disposition was 
taken into consideration. This was tested by 
comparing the influence of the conditions 
for the cautious and the noncautious subjects 
separately. The findings pertaining to this 
hypothesis are reported in Table 3. As can 
be seen, the Failure condition promoted 
increased descriptive responses for non- 
cautious but not for cautious subjects. How- 


is provided for converting the obtained U into a z 
score when the W in either group exceeds 20. This 
conversion is also recommended as a correction when 
the same rank occurs for scores involving both groups. 
Since all of the observations meet either one or both of 
these conditions, all of the U's are converted into z 
Scores. 
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ever, the only statistically significant com- 
parison was between the Failure and Control 
conditions for the MAPS sorts (z = 241, 
p < 02; two-tailed). The cautious subjects || 
did not respond differentially to Failure as 
compared with Control and Success conditions. 
The increase in descriptiveness under the 
Failure condition, noted in Table 2, appears, 
thus, to have been contributed by the non- 
cautious subjects. 
The mean scores presented in Table 1 pro- 
vide a basis for summarizing and better 
understanding the foregoing results. The 
cautious subgroup in each of the three ex- 
perimental conditions were quite similar in 
descriptiveness. They had attained pre- 
experimentally, a plateau on descriptiveness 
and were not appreciably influenced by the 
conditions. The noncautious subjects were | 
most inferential in the Control condition and 
became more descriptive in the experimental 
conditions, particularly in the Failure con- 
dition. Therefore, the cautious subjects re 
mained stable on descriptiveness acros 
conditions, whereas, the noncautious subjects 
exhibited a degree of descriptiveness com- 
parable to the cautious subjects only in the 
Failure condition. i 
According to the final hypothesis, the 
cautious subjects should be more descriptive 
in their responses to the sorts and TAT than | 
noncautious subjects, and the difference 
should be most pronounced for a neu 
situation (Control condition). Table 4 shows 
the comparisons between the cautious an 
noncautious subgroups for descriptiveness °° 
the sorts and TAT within each condition. 


TABLE 4 


DIFFERENCES BETWEEN CAUTIOUS AND 
Noncautious SUBGROUPS ON 
DESCRIPTIVENESS 


(expressed as z score values) 


Condition Sorts TAT 
ME. 
Failure (Cautious vs. Non- .01 NC^ .06 NC 
cautious) C 
Success (Cautious vs. Non- .27 C -10 
cautious) C 
Control (Cautious vs. Non- | 2.84**** C 2.07 
cautious) 


* NC or C indicates whether the noncautious OT cautious | 
subgroup was higher on descriptiveness. 
** p < .05; two-tailed test. 
**** p < ,01; two-tailed test. 
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TABLE 5 


Propuct-MoMENT CORRELATIONS BETWEEN 
EXPERIMENTAL MEASURES AND 


OSPE Scores 
Condition Sorts TAT 
Failure group (V = 28) —.02 —.10 
Success group (V = 30) .32* —.02 
Control group (N = 28) .02  5gesét 


* p < .10; two-tailed test. 
*#4* 5 < 01; two-tailed test. 


The z score was again used to evaluate the 
significance of the difference for these com- 
parisons. 

The cautious and noncautious subgroups 
did not differ in descriptive behavior for the 
Failure and Success condition, However, 
for the Control condition, the cautious subjects 
were significantly more descriptive than the 
noncautious subjects on both the sorts (z = 
2.84, p < .01; two-tailed) and TAT (z = 2.07, 
p < .05; two-tailed). The effects of the per- 
sonality variable appear to have been obscured 
by the strong situational forces operating for 
the Failure and Success conditions. When 
these situational forces were minimized 
(Control condition) the influence of the 
Personality variable became quite evident. 
While the cautious subjects were not more 
descriptive in all conditions as predicted, 
the fact that they demonstrated greater 
descriptiveness in the neutral situation indi- 
cated a basic relationship between descriptive- 
hess and cautiousness. That is, descriptive- 
hess on the sorts and TAT was associated 
with characteristic cautiousness and this 
relationship held only for the Control con- 
dition, 

Scores on the OSPE were available for the 
Majority of the subjects, Correlations with 
the conceptual scores on the TAT and MAPS 
Sorting task for each of the situational con- 
ditions are presented in Table 5. 

The MAPS conceptual score and OSPE 
Were correlated for the subjects in the Success 
condition (r = .32, p < .10; two-tailed). 
These MAPS responses were confounded 
with an experimental condition; this correla- 
tion is, therefore, difficult to interpret. On 
the other hand, the TAT conceptual score and 

E were negatively related for the Control 
Condition (r = —.52, p < .01; two-tailed). 
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That is, the brighter subjects gave more 
descriptive TAT responses. 

The two significant correlations reported in 
Table 5 increase the complexity of the experi- 
mental findings. These relationships do not 
appear to detract from the stated hypotheses, 
but rather, suggest that additional dimensions 
also contributed to the conceptual behavior. 

The TAT and MAPS conceptual scores 
were positively correlated (r = .30, p < .01; 
two-tailed). Cautiousness on the Level of 
Aspiration and OSPE scores were unrelated 
(rois = .16, not significant). 


Discussion 


When all subjects were pooled there was 
no relation between descriptiveness and the 
situation. However, when personality differ- 
ences were considered, the situation became 
a significant determinant of descriptive be- 
havior, only noncautious subjects being 
influenced by experimental conditions. Failure 
induced descriptiveness in the noncautious 
subjects, but did not increase descriptive 
responses among the cautious subjects, who 
had a pre-experimental set to respond to all 
conditions as if failure were imminent. This 
finding highlights the importance of systemati- 
cally investigating individual differences in 
susceptibility to situational cues. By so doing, 
one is able to increase the power of predictive 
hypotheses. This position is underscored by 
some findings obtained by Marlowe and 
Crowne (1961), who observed that persons 
differ in their need to subscribe to socially 
desirable behaviors and that differences in 
this need predict sensitivity to situational 
cues. 

The second major finding was that per- 
sonality differences predicted descriptive be- 
havior only for a neutral situation. That is, 
cautious subjects were more descriptive than 
noncautious subjects for the Control condition, 
but not under Failure or Success, which ap- 
parently masked the variance otherwise 
associated with the personality variable. 
This finding bears on the need to consider 
both situational and personality variables. 
Certain situational conditions can easily 
obscure personality differences leading in- 
vestigators to conclude erroneously that no 
differences exist. 


Fear of failure is a function of both situ- 
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ational and personality factors. Subjects 
defended against it by being more descriptive 
on the conceptual tasks. On both the MAPS 
and TAT, tests frequently used to assess 
content variables, the personality and situ- 
ational variables were both associated with 
descriptiveness. Yet the possible occurrence 
of many substantive responses is contingent 
on the conceptual level used by the subject. 
If a subject is highly descriptive, the range of 
potential responses is reduced. Furthermore, 
pronounced descriptiveness precludes the 
occurrence of many personalized responses 
which, by definition, are inferential. These 
findings emphasize the importance of con- 
sidering the presence of general defensive 
patterns, such as cautiousness, in interpreting 
psychological test results. 

No specific predictions were made con- 
cerning the Success group, which turned out 
to be intermediate between the Failure and 
Control groups on descriptive behavior. The 
positive reinforcement given the Success 
group might have been expected to decrease 
Cautious behavior. However, such was not 
the case, An alternative explanation, in line 
with the obtained findings, is that the positive 
reinforcement on the social attitudes test 
actually evoked cautiousness on the subse- 
quent, functionally related tasks. That is, 
the provision of a reinforcement for one task 
(positive or negative) may structure other 
tasks in that situation as potential success or 
failure experiences. The reinforcement, re- 
gardless of its direction, increases involve- 
ment for the related tasks. Support for this 
position is suggested by Lazarus, Baker, 
Broverman, and Mayer (1957), who state 
that "psychological stress involves the thwart- 
ing of a motive state” and that “a motive 
state has to be present in order for stress to 
occur.” Furthermore, in keeping with the 
results of this study, they found that “stressor 
conditions” served to make subjects low in 
achievement motivation more literal on a 
task where they were required to listen to 
and reproduce tape recorded paragraphs, 

In addition to showing the interplay be- 
tween situational and personality variables, 
the study provides evidence for the moderate 
generality of cautiousness over a variety of 
situations. Cautiousness on the Level of 
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Aspiration task predicted descriptiveness on 
the sorts (roi. = .56, p < 01) and on the 
TAT (ris = .42, p < .05) for the Control 
condition. Descriptiveness on the sorts and 
TAT were positively correlated for the total 
sample. Furthermore, cautiousness induced 
by the Failure situation predicted descriptive 
behavior. 


SUMMARY 


This investigation dealt with the influence 
of situational and characteristic cautiousness 
as codeterminants of conceptual behavior. 
A Failure condition produced situational 
cautiousness and the level of aspiration meas- 
ured characteristic cautiousness. Cautious 
conceptual responses consisted of descriptive 
responses to TAT cards and a figure-sorting 
task. 

The major findings were as follows: 

1. The Failure condition contributed a 
significantly greater amount of descriptive 
responses than the Control condition only for 
subjects judged to be noncautious prior to 
the experimental conditions. When all subjects 
were pooled this relationship largely dis- 
appeared. 

2. The personality variable predicted de- 
scriptive behavior only for a neutral situation. 
That is, cautious subjects were significantly 
more descriptive than noncautious subjects 
on the sorts and TAT for the Control con- 
dition, but not for the Failure or Success 
conditions. 
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DOGMATISM AND ACQUIESCENT RESPONSE SET 
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WE Dogmatism scale has been one of the 

major instruments used by Rokeach and 
his associates in their studies of belief systems. 
According to Rokeach (1960), "the primary 
purpose of this scale is to measure individual 
differences in openness or closedness of belief 
systems. Because of the way we have defined 
open and closed . . . the scale should also serve to 
measure general authoritarianism and general 
intolerance" (pp. 71-72). Rokeach has criticized 
the California F Scale on the grounds that the F 
Scale is not independent of ideological preference 
and therefore measures “right” authoritarianism 
and not general authoritarianism. For example, 
communists score low on the F Scale but high on 
Dogmatism, or general authoritarianism. The 
Dogmatism and F scales usually correlate highly, 
however, the reported correlations ranging from 
.54 to .77 (Rokeach, 1956). 

'The Dogmatism and F scales are structurally 
identical, both being Likert-type scales. Items on 
both scales are positively scored attitudinal 
statements, a high number of “Agree” responses 
yielding a high score. On both scales the subject 
responds by endorsing items with numbers ranging 
from +3 to —3, 0 excluded, indicating responses 
ranging from strong agreement to strong dis- 
agreement. 

The construct validity of the F Scale has been 
criticized because several investigators have shown 
that part of the scale’s variance is attributable to 
an acquiescent response bias (Chapman & Camp- 
bell, 1957; Christie, Havel, & Seidenberg, 1958; 
Cohn, 1953; Jackson & Messick, 1957; Leavitt, 
Hax, & Roche, 1955). Some writers have even felt 
impelled to reinterpret the meaning of authori- 
tarianism so as to account for the data on F Scale 
response set (Gage, Leavitt, & Stone, 1957). 

In spite of the structural similarity of the 
Dogmatism and F scales, the susceptibility of 


1 The authors wish to express their appreciation to 
the members of the neuropsychiatric staff of the Ann 
Arbor Veterans Administration Hospital who assisted 
in the collection and processing of the data. All state- 
ments are those of the authors and do not necessarily 
represent the opinions or policy of the Veterans Ad- 
ministration. 


Dogmatism to acquiescent response set has not yet 
been questioned. The present study was designed 
to demonstrate that Dogmatism is just as vulner- 
able to acquiescent response set as the F Scale. 


METHOD 


Subjects were 40 adult male inpatients in the 
neuropsychiatric service of a General, Medical, and 
Surgical Veterans Administration Hospital. The sub- 
jects’ mean age at nearest birthday was 33.7 years 
(SD = 9.8). Their mean number of years of formal 
education was 9.9 (SD = 1.7). On the seven-point 
occupational status rating scale described by Warner, 
Meeker, and Eells (1949), the subjects’ occupations 
had a mean value of 5.8 (SD = 1.0); 27 subjects were 
unskilled or semiskilled laborers. In religious affilia- 
tion 32 subjects were Protestant and 8 were Roman 
Catholic. All subjects were tested within 4 weeks of 
their admission to the neuropsychiatric wards. 

The F Scale, Form 40-45, was given under conven- 
tional instructions (Adorno, Frenkel-Brunswik, Levin- 
son, & Sanford, 1950). The split-half reliability €s- 
timated by the Spearman-Brown formula was .79 for 
our sample (M = 4.5; SD = 0.9). The 40-item Dog- 
matism scale was given under instructions recommended 
by its author (Rokeach, 1956). Split-half reliability 
was .76 (M = 4.1; SD = 0.7). 

Two measures of acquiescent response set were 
developed. The first was a Difference score based on 
32 items taken from a measure of impulsivity developed 
by Sanford, Webster, and Freedman (1957). A revised 
form of Sanford’s scale was given to subjects Wi 
Likert-type response categories. Sanford’s measure 
contained 121 positively and 23 negatively scored 
items. From this item pool 16 negatively scored ‘items 
were selected for which 16 positively scored items 
similar to the negatively scored ones in content, could 
be found. For example, a typical positive item, 
have used alcohol excessively,” was paired with 4 
corresponding negative item, “I have never done any 
heavy drinking.” For each subject two scores were 
obtained, a total score on the 16 positive items and & 
total score on the 16 corresponding negative items. 
Subtracting the negative from the positive total 
yielded a Difference score for each subject. The size 
of this Difference score was assumed to reflect the 
magnitude of each subject’s acquiescent bias. The 
split-half reliability of the measure was .62 (M — 
—14; SD = 16.8). Chapman and Campbell (1957) 
and Couch and Keniston (1960) have also used €07- 
tent canceled items to measure acquiescence. 
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TABLE 1 
INTERCORRELATIONS AMONG ACQUIESCENCE MEASURES, 
THE F AND DoGMATISM SCALES AND VERBAL 
INTELLIGENCE 


1, Acquiescence differ- | .29*| .47**| .45**| —.26 
ence score 
2. Acquiescence MMPI .51**| .38**| —.15 
True 
3. Dogmatism scale .T0**| —.20 
4. F Scale —.32* 
5, Shipley-Hartford 
(verbal intelligence) 


* Indicates significance at the .05 level, one-tailed test. 
** Indicates significance at the .01 level, one-tailed test. 


The second acquiescence measure was total number 
of MMPI True responses; its reliability was .93 (M = 
2346; SD = 41.0). This measure was analogous to 
those used by Mathews (1929), Lorge (1937), Cron- 
bach (1942), Cohn (1953), Shelley (1956), and Gage 
et al. (1957). 

In the first testing session subjects were given in 
fixed succession: several penciland-paper tests being 
used for other purposes, the Shipley-Hartford, the 
F Scale, Dogmatism, and the Sanford measure from 
which Difference scores were obtained. The vocabulary 
section of the Shipley-Hartford was used as a measure 
of verbal intelligence. Its raw-score mean was 25.2 
(SD — 6.0). In the second testing session, a week later, 
Subjects were given the MMPI in card form but were 
not permitted to use the Uncertain response category. 


RESULTS 


The Pearson product-moment correlations 
among the measures are presented in Table 1. The 
obtained correlations were in the expected direc- 
tion. Both acquiescence measures correlated 
significantly with both Dogmatism and the F 
Scale. The —.32 correlation between the F Scale 
and the Shipley-Hartford supports the earlier 
reports of similar negative correlations between 
the F Scale and intelligence by Adorno et al. 
(1950), Cohn (1951), and Gough (1951). The 
correlations between the acquiescence measures 
and the Shipley-Hartford are comparable to the 
low negative relationship between acquiescence 
and intelligence reported by Cohn (1953) and 
Bass (1956). The previously reported findings just 
cited were largely based on student samples; the 
Present study extends the generality of these 
findings to a sample of neuropsychiatric patients 
both older and less educated than the student 
Broups. The only puzzling aspect of the data was 
the telatively low .29 correlation between the two 
acquiescence measures. 

Another way of interpreting the number of 
MMPI True items is as a measure of degree of 
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psychoticism rather than acquiescence. This 
interpretation would follow from Cottle and 
Powell’s (1951) report that endorsing all MMPI 
items True gives a high score on the MMPI 
“psychotic triad” while endorsing all items False 
elevates scores on the “neurotic triad.” Thus the 
obtained relationships between MMPI True 
responses and the F and Dogmatism scales could 
mean that subjects who scored high on the F 
Scale and Dogmatism tend to be the more psy- 
chotic. Two considerations tend to rule out this 
explanation. First is the fact that the pattern of 
correlations between True MMPI responses and 
the other variables was quite similar to the pattern 
based on these variables and the Difference score, 
Second are some additional data that explicitly 
test this interpretation. The final hospital diagnosis 
was used to classify subjects into four groups: 
psychotic, neurotic, personality disorder, and 
psychophysiological reaction. A chi square test 
showed no association between diagnostic category 
and either the F Scale or Dogmatism. The mean F 
Scale and Dogmatism scores were slightly higher 
for the 15 neurotics than for the 11 psychotics; 
the mean number of MMPI True responses was 
also slightly higher for the neurotics. Neither of 
these differences was significant, however. All 
these results are inconsistent with the alternative 
explanation that number of MMPI True responses 
reflects degree of psychoticism and strengthens the 
assumption that this is a valid measure of 
acquiescent response set. 


DISCUSSION 


These data indicate that Rokeach's Dogmatism 
scale is vulnerable to acquiescent response set 
roughly to the same extent as the F Scale. This 
finding suggests that previous work with Dog- 
matism should be re-examined with an eye to 
determining which measures are related to Dog- 
matism by virtue of their sharing variance in 
acquiescence. As with the F Scale, the correlation 
between Dogmatism and other response biased 
measures cannot be interpreted unambiguously. 

Future studies relating the F Scale and Dog- 
matism to other possibly acquiescence-biased 
measures should include an independent measure 
of acquiescence as a matter of routine. This will 
make it possible to specify to what extent obtained 
relationships are attributable to construct rather 
than methods variance. For example, in the present 
study the correlation of .70 between the F Scale 
and Dogmatism overstates the relationship 
between the two measures due to content alone. 
When the effects of the two acquiescence measures 
are partialed out this correlation reduces to .57. 

The recent attempt by Couch and Keniston 
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(1960) to characterize the “personality” of the 
consistently acquiescent respondent fails to 
recognize the acquiescence-vulnerability of Dog- 
matism. Because their measure of acquiescence 
correlates .40 with Dogmatism, they conclude that 
this “appears to reflect the diffuse anxiety of the 
yeasayer, and to suggest that he seeks dependency 
on external figures for support” (p. 159). On the 
other hand, their data indicate only that the 
acquiescent respondent is characterized by a high 
score on a personality measure with an 
acquiescence bias built into it. The hypothesis 
that the Dogmatism scale is contaminated by 
acquiescence bias mot only offers a more 
parsimonious interpretation of Couch and 
Keniston's results but also receives substantial 
support in the present study. 


SUMMARY 


The study was designed to test the hypothesis 
that Rokeach’s Dogmatism scale, structurally 
identical to the F Scale, is similarly vulnerable to 
acquiescent response bias. Two measures of 
acquiescence were developed and administered, 
along with the Dogmatism and F scales and a 
measure of verbal intelligence, to 40 adult neuro- 
psychiatric inpatients. Both acquiescence measures 
were positively related to both the Dogmatism 
and F scales. It was concluded that the Dogmatism 
measure contains acquiescent response set vari- 
ance. The implications of this finding for past and 
present research using the Dogmatism scale were 
discussed. 
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THE EFFECTS OF INFORMATION SOURCE STATUS AND DOGMATISM UPON 
: CONFORMITY BEHAVIOR 


ROBERT N. VIDULICH ann IVAN P. KAIMAN 
Louisiana State University 


TJ: present investigation attempted to 
provide a partial answer to the question: are 
highly authoritarian persons uniformly conforming 
to all directive-producing sources, or are they 
differentially influenced by information sources 
that are high or low in status? Previous research 
has indicated that both variables, source status 
and authoritarianism are related to conformity 
behavior (e.g, Crutchfield, 1955; Lefkowitz, 
Blake, & Mouton, 1955). 

Rokeach (1960) in his theory of general authori- 
tarianism or “dogmatism,” differentiates between 
persons who are relatively open (low dogmatic or 
low authoritarian) or closed (high dogmatic or 
high authoritarian) with respect to their belief- 
disbelief systems about reality. A person’s belief- 
disbelief system is relatively open or closed depend- 
ing on his ability to “receive, evaluate and act on 
relevant information received from the outside on 
its own intrinsic merits... the more closed the 
belief-system, the more difficult should it be to 
distinguish between information received about 
the world and information received about the 
source [of this information]...” (pp. 57-58). 

Rokeach’s (1960) theory would predict that 
relatively closed persons should confuse what an 
external authority had to say about a situation 
with who the authority was, leading to increased 
conformity to the authority if he were high in 
status, and decreased conformity to, or even 
rejection of, the authority were he seen as being 
low in status. Relatively open individuals should 
be capable of discriminating source from message, 
and should show little difference in their reactions 
to varying source figures. 


METHOD 
Apparatus 


The experimental situation used the autokinetic 
effect and the general procedure of Sherif (1935). 
Subjects in the present study were asked to report 
the direction rather than the distance of autokinetic 
movement. 

_ The stimulus light box was mounted on a table 32 
inches above the floor at one end of a lightproof and 
semisoundproof room. A point of light was exposed 
through a hole 1 millimeter in diameter in one end of a 
lightproof wooden box, 9 inches long, 3 inches wide, 
and 3 inches high, painted flat black inside and out. 
The light source was a small radio bulb at the far end 
of the box. The bulb was connected to a transformer 
to receive a constant electrical input of 2.5 volts. Two 


thicknesses of tissue paper diffused the light. Exposure 
time was automatically controlled by two Hunter 
timers so that the light was alternately activated and 
shut off for 10 second intervals. This sequence was 
repeated for the total experimental period. The light 
was visible at all times except when the experimenter 
switched off the power supply between the two phases 
of the experiment. 

The subjects and the experimenter were seated at 
the opposite end of the experimental room from the 
light source, the subject being 197 inches (5 meters) 
from the light aperture. The experimental room was in 
complete darkness at all times. 


Subjects 


From a pool of 307 undergraduates in introductory 
psychology courses at Louisiana State University who 
had taken the 40-item Dogmatism scale, Form E 
(Rokeach, 1960), 30 high and 30 low scorers were 
selected as subjects. All 60 subjects were in the upper 
or lower quartiles of Dogmatism scale scores, all were 
females between the ages of 18 and 23 years, and all 
but 4 were sophomores. None had previous knowledge 
about the autokinetic phenomenon. The mean Dogma- 
tism scale score for the 30 open subjects was 126.75 
(e = 11.66); for the 30 closed subjects, the mean 
Dogmatism score was 181,37 (c = 11.87). 

Each subject was in the experimental situation with 
a confederate of the experimenter throughout the 
experiment. The 60 subjects were divided into four 
groups: Group I consisted of 15 subjects who had 
scored high on the Dogmatism scale (closed), for whom 
the confederate was identified as a high status person; 
Group II consisted of 15 closed subjects for whom the 
confederate was identified as a low status person. 
Groups III and IV each consisted of 15 subjects who 
had scored low on the Dogmatism scale (open), for 
whom the confederate was identified as being high or 
low in status, respectively. 


Procedure 


During Phase A of the experiment, the subject ob- 
served 30 exposures of the light, judging the direction 
of the light’s “movement,” either left or right, and 
privately recording these judgments on one or another 
of two 3 X 5 pads in front of him. During Phase B, 
the subject observed 30 additional exposures of the 
light, now making his responses verbally, with the 
experimenter recording these responses. The con- 
federate reported his “judgments” prior to those of the 
subject during Phase B. 

Eighty percent of the confederate’s responses during 
Phase B were randomly given in the direction least 
frequently reported by the subject in Phase A. Upon 
determining the subjects least predominant directional 
choice of Phase A, the experimenter informed the 
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The 30 trials of Phase A were then administered. 
After the thirtieth exposure, the experimenter said: 

Please hand me your pads. In order to speed this 

up a little bit, the next time the light goes on, I 

want you to tell me which way the light moved, and 

I'll record your answers myself. This should take 

less time. You can tell me your answers after the 


And out of courtesy to the faculty 


[visiting student], Dr. Wilson [John] you can go first. 
At the termination of Phase B, the subject was asked 
not to discuss the procedures of the experiment with 


Open group 5 
t 
? «01» 


* N = 15 for each group. 
b One-tailed test of significance used for these comparisons. 


confederate of the response pattern to be reported by 
tapping him on the right or left shoulder. The con- 
federate used a string of beads and knots to determine 
the direction of each response to be given. 

The subject entered the already darkened room in 
which the confederate was seated, and was guided to 
his seat by the experimenter. The meeting of the subject 
and the confederate took place in complete darkness. 
For Groups I and III (high status source), the con- 
federate was presented as “Dr. John Wilson, who has 
been doing a good deal of research at LSU. He will be 
observing the same thing as you and we are interested 


then escorted the subject 


out of the room, asking the confederate to remain 
“to talk to you for a few minutes.” Thus the subject 


RESULTS 


The first response measure defining conformity 
was calculated by subtracting the number of the 
least dominant directional responses of Phase A 
from the number of same direction responses during 
(difference score). The 
calculated by 
tabulating the number of responses of each subject 
which agreed with those of the confederate during 
Phase B (agreement score). For both measures, the 
higher the score, the greater the presumed con- 
formity to the confederate’s responses. 

From the upper portion of Table 1, it is evident 
that neither dogmatism nor status source is signifi- 
cantly related to the shift in directional judgments 
from Phase A: to Phase B (difference score). The 
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TABLE 1 
TREATMENT OF DIFFERENCE SCORES 
Analysis of Variance 
Source SS af | P ? 
Source status 56.06| 1| 3.95 ms light goes off. 
Dogmatism 19.26] 1| 1.36) ms 
Interaction (S X D) 308.29| 1 | 21.74| <.01 
Within 794.12, 56 
anyone. The experimenter 
Total 1177.73 4 a 
Comparison of Subgroup Means never saw the confederate. 
Source status 
Dogmatism High Low t ? 
y | sp | ue | sD 
Closed group 5:93l4-45 —0.53/4.5313.29|<.01® Phase B for each subject 
2.53(2.69| 5.1313.022.48|«.05 ^ second response measure was 


TABLE 2 
TREATMENT OF AGREEMENT SCORES 


Analysis of Variance 


in comparing your reactions with his." For Groups II Sone ea sie ee i 
and IV (low status. source), the confederate was pre- ot 
sented as “John Wilson, who is a high school student Source status 355.26) 1 SE S “OL 
at Baton Rouge High. He will be observing the same DE SxD ARS H d E 
thing as you and we are interested in comparing your Within on ( ) 647.32| 56 5 ; 
reactions with his.” These introductory statements n 
were made to the subject while the experimenter was Total 1687.93| 59 
escorting him to the experimental room. 

Task instructions were given as follows: i 

When I give the signal ‘‘Ready” a light will go on. Cant Spee 

After a short time, the light will start to move. Then Source status 

the het m go off. When the light goes off you are ES 

to mark the direction of the light's movement on " F 

one of e two pads directly in front of you. There icy Re eu ; 4 

is a pencil between the two pads. If the light moved 

to the left, make a mark on the left pad; if the light us | so | ar | s2 

moved to the right, make a mark on the right pad. Closed o| <.01” 

Sometimes the light will move a great distance, at One group 1215.8 l8 RE ns 

other times it will move only slightly. Be sure you n CAES 75.62 "8.80 i 

make a mark for each trial, and only one mark to ? eb | <.01è 


indicate the direction in which the light moved on 
each trial, When I give the signal there will be no 
talking. Ready. 


a N = 15 for each group. 
b One-tailed test of significance used for these comparisons. 
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major source of variance is attributable to the 
interaction of these two variables (F = 21.74, p < 
01). The lower portion of Table 1 indicates that 
the closed group exposed to a high status source 
shifted most toward the confederate’s responses 
from Phase A to Phase B (difference = 5.93), 
while the closed-low status group shifted away 
from the "judgments" of the confederate (differ- 
ence = -—0.53). The open groups displayed 
dissimilar behavior, agreeing significantly more 
with the directional responses of the low status 
than the high status source. That these results are 
not due to differential responsiveness during 
Phase A is indicated by the following: the mean 
number of least dominant directional responses 
during Phase A are 11.40, 10.80, 10.87, and 10.93 
for Groups I through IV, respectively. 

The analyses of agreement scores generally 
corroborate the foregoing results. Analysis of 
variance (upper portion of Table 2) indicates that 
both source status and dogmatism are highly 
significant sources of variation when considered 
alone, a finding which differs from the difference 
score analysis, but that the interaction of these two 
variables again produces the greatest effect (F = 
50.96, p < .01). 

_ Again, the closed subjects agreed more with the 
high than with the low status source (lower portion 
of Table 2). Out of 30 possible agreements during 
Phase B, the closed groups averaged 18.27 agree- 
ments under the high status condition and only 
7.13 agreements when responding following the 
low status source (t = 9.36, p < .01). The open 
groups did not differ significantly in mean agree- 
ne responses under the two source status condi- 

ions, 


Discussion 


The results of this experiment generally support 
the contention that conformity is a function of 
both the status of the source of information to 
which the person is exposed, and the authoritarian 
tendencies of the person being influenced. Analyses 
of both conformity measures used indicate that a 
highly important interaction exists between the 
variables of general authoritarianism (cognitive 
Closedness) and information source status, which is 
more influential in the determination of conformity 
behavior than either variable taken alone. These 
data amply illustrate the importance of jointly 
considering field conditions and person char- 
acteristics in examining conformity/influence 
behavior. 
, That these results cannot be attributed to 
intelligence and/or cognizance of the “rigged” 
nature of the situation is supported by two 
ae findings. No differences were found 

etween open and closed subjects in gross in- 


tellectual ability (as estimated by academic grade 
point average). Previous studies with the Dog- 
matism scale also indicate that it is unrelated to 
intelligence for this population (Rokeach, 1960). 
With regard to the second possibility, no subject 
reported having suspected that the autokinetic 
movement of the light was not real, or that the 
confederate was anything but what he was intro- 
duced as. 

With regard to the postulated inability of the 
closed person to discriminate a message from its 
source, the present findings are seen as suggestive 
rather than definitive. What is required in future 
studies of this hypothesis is the use of an experi- 
mental situation in which it is possible for the 
subject to evaluate independently the information 
received about reality from a source and the 
source itself. A modified Asch-type procedure, 
such as that used by Crutchfield (1955), would 
appear to be appropriate. 

Two weaknesses of the present study are 
readily apparent. The control conditions of having 
the subjects alone in Phase B, and having the 
subjects in the presence of the confederate without 
having the confederate make responses during 
Phase B, would be helpful in determining the 
reasons for the presently-puzzling behavior of the 
open groups which tended to conform more to low 
than to high status sources. Also, the reliability of 
the two response measures used in this study has 
not been established. A third experimental phase in 
which the subject’s judgments are not preceded 
by those of the confederate is required to determine 
if it is the judgment of the subject or his announce- 
ment of his judgment that is altered in the present 
situation. Similar attempts to determine the 
temporal duration of “conformity” are notably 
lacking in previously-reported studies in this area 
also. 


SUMMARY 


This study tested the prediction that 
cognitively open (high authoritarian) and closed 
(low authoritarian) persons should be influenced 
differentially by high and low status information 
sources. 

Groups of 30 open and closed subjects, selected 
on the basis of extreme scores on Rokeach’s 
Dogmatism scale, were placed in an autokinetic 
situation. Each subject privately judged the 
directional movement of the light for 30 exposures 
and then made an additional 30 judgments follow- 
ing reports of a confederate of the experimenter. 
For one-half of each group, the confederate was 
presented as a college professor (high status); he 
was introduced as a high school student for the low 
status groups. During the second experimental 
phase, the confederate made responses which were 
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80% in the least-judged direction of the subject 
during the private judging experimental phase. 

Data analyses for two response measures of 
conformity fully support the prediction of an 
interaction between cognitive closedness (au- 
thoritarianism) and information source status. The 
closed groups agreed significantly more with the 
high status than with the low status source. Open 
groups tended to agree more with low than with 
high status sources. 
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MANIFEST CONTENT OF DREAMS OF CONVICTED SEX OFFENDERS! 


MARK I. GOLDHIRSH 
Slate Correctional Institution, Graterford, Pennsylvania 


pee (1954) extensive review of the 
literature on sex offenders and Ramsey's 
(1953) critical review of dreaming show the total 
absence of systematic quantitative investigations 
of dream material from sex offenders. During the 
past 6 years of administering therapy to convicted 
institutionalized sex offenders, I have observed 
that their dreams had a relatively high frequency 
of sexual content and uncamouflaged unlawful sex 
activity. The present study is an attempt to test 
these observations objectively and systematically. 
The two previous systematic investigations into 
the dreams of convicts (Hanks, 1940; Selling, 
1932), failed to include a control group and the 
sex criminal group was not studied. 


METHOD 


The subjects were 16 therapy cases selected from 
my files. All the inmates were confined at the State 
Correctional Institution at Graterford, Pennsylvania, 
and had volunteered for therapy. The data were col- 
lected between 1954 and 1960. 

In order to subject the hypothesis of this study to a 
quantitative systematic evaluation, a scoring manual 
was developed. Several hundred dreams of inmates 
with records of sexual offenses were studied and com- 
pared to dreams of convicted nonsexual offenders. 
Material in the dreams that seemed to differentiate the 
two groups provided the basis for expanding and 
refining the scoring categories. 

A brief version of the scoring system used to identify 
Sexual themes in the manifest content of the dreams 
follows? 

The dream text will score positive (+) “sexual” if it 
contains words, slang expressions, activity, or references 
to the following items: 

1. “Wet dreams” or any form of sex play. 

2. Sexual organs or their activity (erection, mastur- 
bation). 

3, Nakedness, or body description of any erotic 
hature: Sexy, lovely, beautiful, or well-built breasts, 
hips, thighs, legs, nipples, body, etc. 


This study was supported in part by Research 
Grant 534 from the National Institute of Mental 
Health, United States Public Health Service. 

I am indebted to Aaron T. Beck for his guidance 
and helpful suggestions, 

_! The unabridged scoring manual has been deposited 
with the American Documentation Institute. Order 
Document No. 6862 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Congress; 
Washington 25, D. C., remitting in advance $1.25 for 
microfilm or $1.25 for photocopies. Make checks pay- 
able to Chief, Photoduplication Service, Library of 


4, Kissing, hugging, petting, necking, or erotic 
feelings from body contact. 

5, Sex crime: Any form of sex play with a child or 
teenager until 16 years of age. Any activity involving 
incest, rape, homosexuality, or sodomy (oral or anal 
genital contacts). An “sex crime” dream 
also receives a double score (++). 

The following are examples of dreams that are 
scored “sexual” (+): 

1. Woman in her early forties came into my room. 
She had large breasts, full face, dark eyes, short hair, 
and a megligee on. She just killed someone, We had 
coitus and enjoyed it. I had this same dream before 
but couldn’t see the whole body and did not have sex 
before. She has the physical qualities of my mother. 

2. A girl about 45 years old—she is good looking and 
I ask her out. We went to a movie, ate, and I hissed 
her and got an erection. 

3. I had a wet dream with a girl. This is an improve- 
ment. She was 30 years old. 

4, I was in some sort of camp or gathering where 

were dispossessed by a flood and had to sleep 
where they could. We were all jammed together sleep- 
ing. I went to sleep and awoke. The person next to me 
was touching me and it was a very nice girl. She was 
asleep. It was a shock to me. I felt around and made 
advances. I finally got down to business but there was no 

. She did not awaken. Just as I was reaching a 
climax Y awoke. I had a wet bed. 

5, I was on the beach swimming with a girl in the 
nude. She had long red hair. I don’t like red. I didn’t 
recognize her. I had a portable radio and plenty to 
drink. We swam and lay in the sun but there was no 
sex as I can remember. 

6. This woman suggested that we take a hot shower. 
So we went upstairs, undressed and we both went into 
the shower. We started fooling around, washed each 
other on the back, and J laid her there in the bathtub. 
As I was coming in the dream J came over the bed. 

The following are examples of dreams that are 
scored uncamouflaged “sex crime" and they receive a 
double score (+-+): 

1. My father shakes my hand and says that they 
forgive me for having sex with boys. I couldn’t forgive 


myself. 

2A little boy plays with my penis and squeezes it. I 
do not get a bardon. There is a glass on my belly and 
Iam supposed to get a hardon. 

3. Lately over and over mostly sex dreams and they 
never finish. I am trying to have sex with a young girl 
about 12 years old. It is hard to see her face. I undress 
her and play around with her tits and love her up. But 
I never get to finish up. Looks like the first girl I got 
in trouble with sexually. My wife is in the corner sneering 
at me. 

4. I had boys play with my dick. Then I went home 
and cried. 

5. I am trying to get my prick into a boy. Y have a 
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TABLE 1 
MATCHING DATA ON SEX CRIMINAL (SC) AND OTHER CRIMINAL (OC) PATIENTS 
Ae ees E col anis Nature of crime 
Pair 
SC|OC| SC|OC| SC | OC | SC | OC sc oc 
A[| 54| 50) w| W| 109) 116| M | M So d) Robbery 
B |53| 50 W| W| 113| 115| S |.S Sodomy Murder 
(teens?) 
C |40| 43| W| Nj 112 102| S | S Sodomy Burglary 
(teens) 
D | 42) 34 W| W| 112) 104| S | S Sodom) Robbery 
E |25| 32 W| W| 101| 84| M | M Sodomy, Burglary 
cl 
F | 44) 32| W| W| 98 114| S |S Sodomy Larceny 
(child) 
G |26| 28) W| W| 107| 105| S | S Soy Burglary 
c 
H |33| 33) W| W| 104| 110| M | M Sodomy Robbery 
(child) 


® Crime pattern means more than one conviction for current type of crime or an admitted extensive history. 
b Sodomy is defined as sexual activity involving oral or anal relations. 


© Child means until 9 years old. 
4 Teens is 10-16 years of age. 


TABLE 2 


FnEQUENCY oF “SEXUAL” AND UNCAMOUFLAGED 
“Sex CRIME” DREAMS OF MATCHED SEX OFFENDER 


AND CRIMINAL PATIENTS 

Number of sexual Number of uncamou- 
um dreamsoutofio** gaged sex crime | 

sc oc sc ue 
A 10 1 7 9 
B 5 2 3 0 
C 5 0 0 0 
D 4 0 2 0 
E 9 0 9 0 
F 3 1 1 0 
G 2 0 1 0 
H 4 0 1 Ü 


*p = .01, one-tailed, Wilcoxon matched-pairs signed-ranks 
test. 


** p 005, 


hardon and can't get it in. I awoke with a hardon. I was 
ashamed and afraid to tell you. 

6. Iam a tree surgeon and a kid is playing with my 
penis. 

After the scoring system was developed, it was then 
applied to a sex criminal group and a control group. 
The sex criminal group consisted of eight therapy cases 
with an extensive pattern of dangerous criminal sexual 
behavior against children. They were matched patient 
for patient as closely as possible with a control group 
(no arrests for sex crimes and no admitted sexually 
deviated history) on the basis of age, race, intelligence, 


marital study, pattern to their type of crime, age at ' 


time of first offense, and socioeconomic level. The 
relevant identifying data on all patients are listed in 
Table 1. The dreams of the eight matched pairs of 
subjects were not part of the series of dreams used in 
the construction of the scoring manual. 

The first 10 dreams reported during the therapy 
process were abstracted from each case record an 
typed on individual sheets of paper. The total sample 
of 160. dreams (10 per patient for 16 patients) was then 
arranged in random order and presented to another 
rater? who was unfamiliar with the proposed research. 
After a few minutes of clarification she then scored the 
dreams using only the scoring system of five steps 
presented in this paper. She had no prior knowledge 
concerning these cases or their dreams. These blind 
ratings were subjected to statistical evaluation report 
in Table 2. In order to get an estimate of the reliability 
of the ratings, I also rated the dreams. 


RESULTS AND DISCUSSION 


The raters agreed as to the presence or absence 
of a sexual element on 158 out of 160 dreams. This 
is almost 99% agreement. The scoring procedure, 
thus, is highly reliable. The results of the compari- 
son between the two groups is given in Table 2. 

There is virtually no overlap between the 
groups: 53% of the dreams of the sex criminal 
patients contain one or more sexual elements, 
whereas in the other criminal group, 5% of their 
dreams contain one or more sexual elements. 


*The author wishes to thank Stephanie Manning 
for her scoring help. 
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The raters further agreed as to the presence or 
absence of an uncamouflaged element of sex crime 
on 156 out of 160 dreams. This is almost 98% 
agreement. Note that in every case except one, the 
sexual offender had one or more dreams containing 
an uncamouflaged sex crime theme (Table 2), 
whereas with the eight convicts in the control 
- group there was no reference to sexual criminal 

activity in any of their 80 dreams. 

On the basis of these results, the conclusion 
seems warranted, within limits set by the institu- 
tionalized population and the small sample, that 
male sexual offenders show a greater incidence of 


N A T ee ee 


sexual and sex crime dreams than do other types 
of convicts. 
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SELF-ESTEEM, MANIFEST HOSTILITY, AND EXPRESSION OF HOSTILITY 


MILTON E. ROSENBAUM anp ROBERT F. STANNERS 
State University of Iowa 


number of recent studies have demonstrated 
EA complex relationships between personality 
test measures and aggressive behavior. The 
findings have suggested that the test responses may 
reflect individual differences both in level of 
potential expression and inhibition of aggressive 
response. Two such measures are the Self-Esteem 
(SE) scale and the Siegel (1956) Manifest Hostility 
Scale (MHS). Two studies (Rosenbaum & 
deCharms, 1960, in press) employing the SE scale 
have suggested that under conditions of threat to 
self-esteem, subjects with low SE scores are more 
strongly instigated to aggression than subjects who 
have high SE scores but that inhibition of expres- 
sion of aggression in social situations is greater 
among low SE subjects. Two other studies 
(Berkowitz, 1960; Hokanson & Gordon, 1958) 
have found that under conditions of low arousal to 
aggression subjects with high MHS scores express 
more aggression than low MHS subjects but under 
conditions of high arousal, subjects low in MHS 
increase their level of expression of aggression to a 
greater extent than subjects with high MHS 
Scores. 

In the previous studies relating SE scale scores 
and hostile response, control conditions in which 
aggression arousal was low were not employed. 
It was not possible, therefore, to examine the 
character of aggressive expression prior to instiga- 
tion and consequent inhibition. The present study 
was in part designed to correct this fault. In 
addition, it has been suggested (Rosenbaum & 
deCharms, in press) that a decrease of the inhibi- 
tion in expression of aggression may occur when 
the aggression is measured in a nonsocial context, 
i.e., in response to TAT cards as in the present 
study. Finally, the present circumstance allows for 
replication of the previous studies relating MHS 
Scores to aggressive response but in a slightly 
altered context. In the present study, the subject 
is subjected to personal attack in contrast to the 
indirect arousal procedures used in the other 
studies. 


METHOD 


Students in the elementary psychology course were 
administered a variety of personality measures, in- 
cluding the Self-Esteem scale (Rosenbaum & deCharms, 
1960) and Siegel’s (1956) Manifest Hostility Scale. 
Based on their scores on the SE scale, male subjects 
were recruited in pairs to participate in an experiment 
on “creativity.” 

In the laboratory, subjects participated in a proce- 
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dure adapted from a portion of Weiss and Fine's 
(1956) study. All subjects were asked to complete three 
easy (1-3) designs from the Arthur Stencil Design Test 
I following the standard instructions. Subjects in the 
Arousal condition then were given five difficult designs 
(16-20) with a time limit of one minute for each design 
(no subject completed this task satisfactorily). Between 
each design, the experimenter made increasingly de- 
rogatory comments concerning the subject’s perform- 
ance. Following the three easy designs subjects in 
the Nonarousal condition were given five relatively 
easy designs (4-8) with a one-minute time limit (all 
completed the designs satisfactorily). Between each 
design, the experimenter praised the subject’s per- 
formance. 

All subjects then were presented with two cards 
from the TAT series (2 and 8BM) for which they were 
to write stories for 5 minutes each. The experimenter | 
continued to offer derogatory or favorable remarks to | 
the Arousal and Nonarousal subjects, respectively. | 
Consistent with the procedure employed by Hokanson 
and Gordon (1958), Card 2 may be regarded as a Low 
Cue for aggression and Card 8BM as a High Cue. The . 
order of presentation of the cards was counterbalanced 
within the Arousal and Nonarousal groups. Finally, 
subjects responded to a questionnaire containing five 
questions of which Number 5 requested the subject to | 
indicate his impression of the experimenter on a Sit 
point scale ranging from "very favorable" to "very 
unfavorable.” 

Forty male subjects participated in the experiment. 
One-half of the subjects were exposed to the Arousal 
procedure and one-half to the Nonarousal treatment. 
In addition, one-half of the subjects in each of these 
conditions had low SE scale scores (45 or below) and 
the other half had high SE scale scores (50 or above), < 
No effort was made to select subjects on the basis of 
MHS scores. 


RESULTS 


Responses to Question 5 of the post-experiment 
questionnaire, which assessed the favorability of 
the subjects’ impression of the experimenter, may 
serve to evaluate the effectiveness of the Arou: 
manipulation. The analysis of variance of the 
scores derived from Question 5 is summarized in 
Table 1. The results indicate that the procedures 
produced differential hostility toward the experi 
menter between the Arousal and Nonarousal 
subjects. The obtained means were 3.0 and 1.6, 
respectively, a high score representing a negative 
evaluation. There was no significant difference 
between High and Low SE subjects, nor was there 
a significant SE X Arousal interaction. 

The TAT stories were scored by the category 
system employed by Hokanson and Gordon 
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TABLE 1 
oF VARIANCE OF THE FAVORABILITY OF 


TABLE 3 


Mean TAT HOSTILITY SCORES AND MULTIPLE RANGE 
Tests rog EacH Cuz LEVEL 


ANALYSIS 
IMPRESSIONS OF THE EXPERIMENTER 
Source df MS F 
Arousal 1 20.30  18.13** 
Self-Esteem 1 1.80 1.61 
Arousal X Self-Esteem i .07 — 
Error (within) 36 1.12 
p= < 01. 
TABLE 2 


- ANALVSIS OF VARIANCE OF TAT SCORES AS A FUNCTION 
or LEVEL OF SE AND AROUSAL CONDITION 


Source df MS F 
Self-Esteem 1 5.00 1.07 
-. Arousal 1 14.45 3.09 
- Self-Esteem X Arousal 1 2.45 => 
- Error (between) 36 4.67 
. Cue 1 24.20 16.46** 
Cue X Self-Esteem T .80 vm 
-. Cue X Arousal 1 6.05 4.12* 
-. Cue X Self-Esteem X 1 .45 = 
= _ Arousal 
. Error (within) 36 1.47 
"Total 79 
"p= < 05. 
“p= <0 


(1958), The principal rater was not informed of the 
conditions in which the subjects appeared and his 
reliability was checked by partial scoring on the 
part of the experimenter. Table 2 presents the 
summary of the analysis of variance of these scores. 
Both the main effect for Cue and the interaction 
of Cue X Arousal are significant. These results 
reflect greater aggressive response to the High Cue 
than the Low Cue card. In addition, the subjects 
were differentially responsive to the Arousal 
Conditions on only the High Cue card in that 
subjects in the Arousal condition presented 
greater aggression to the High Cue card than did 
Subjects in the Nonarousal condition. Both findings 
Principally support the experimental manipula- 
tions. The results fail to indicate any differential 
effect of level of SE. 
. Subjects in each Arousal condition were split 
into two groups on the basis of the over-all median 
of the MHS scores. Table 3 presents the resulting 
Ns and mean TAT hostility scores. The previous 
analysis (Table 1) indicated a highly significant 
difference between the two levels of cue. In order to 
facilitate analysis of variance by the approximation 
, method suggested by Walker and Lev (1953) 


1 We are grateful to Jesse E. Gordon for providing 
the scoring guide. 


Arousal Nonarousal 
i Low | High | Lo 
Fb | iis | Mis | MES 
W =8) |Q 912 | W=11)| W =9) 
High cue* 5.00» | 2.50% | 2.09» 2.115 
Low cue* 2.38e | 1.503 | 1.454 | 1.674 


* Means within each row marked with a letter in common do 
not differ significantly at the .05 level by multiple range test 
(Kramer, 1956). 

TABLE 4 


ANALYSES OF VARIANCE OF TAT SCORES AS A FUNCTION 
OF LEVEL oF MHS AND AROUSAL CONDITION 


High Cue Low Cue 
Source —— | 
df | MS F df | MS | F 
Manifest hostility 1| 2.719| 8.31**| 1|.142 | — 
Arousal 1| 1.538| 4.70* | 1|.107 | — 
Manifest hostility X| 1| 1.588| 4.86* | 1).299 | — 
Arousal 
Error 36| 0.327 36|.4551| 
*p = <05. 
p= < Ol. 


separate analyses of the results for each level of 
cue were conducted and are presented in Table 4. 
The significant Arousal X MHS interaction for 
the High Cue scores is clarified by reference to the 
results of the multiple range test (Kramer, 1956) 
presented in the upper row of Table 3. A significant 
difference is demonstrated between high MHS 
and low MHS subjects under Arousal while no 
difference is indicated between levels of MHS 
under Nonarousal. 

The Low Cue results did not show a significant 
Arousal X MHS interaction although the trend of 
the means is in the same direction as found for the 
High Cue responses. As support for the trends 
indicated in the Low Cue findings, the multiple 
range test? presents significant differences between 
the mean for the high MHS subjects in the Arousal 
condition and each of the other means but no 
difference among these latter means (lower row, 
Table 3). 

A complex interaction among SE, MHS, and 
Arousal is possible but not readily assessed. 
Statistical evaluation is precluded by excessively 
imbalanced cell entries when all three variables are 
included. Inspection of means does not suggest 
statistical interaction of the variables. 


2 The multiple range test does not necessarily require 
a significant F ratio. 
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Scores on both the SE scale and the MHS were 
available from the source population in the 
elementary psychology course. Product-moment 
correlations were computed. For males (V = 120), 
r = —.173 which is not significant at .05 level. A 
low but significant (P = < .01) negative relation- 
ship is found for females (r = —.244, N = 169). 


Discussion 


The present results present no support for a 
relationship between SE scale scores and hostility 
expression in TAT stories despite apparent success 
in experimental arousal of aggression. The present 
procedure differs from earlier studies in which SE 
scores have been found to relate to hostile response 
on the dependent measure of hostility. In previous 
studies aroused subjects were permitted to respond 
directly to the source of instigation either orally or 
by means of written personality sketches. In the 
present study, only indirect imaginative responses 
were assessed. In contrast with previous studies 
in which arousal was accomplished by a peer in a 
clearly arbitrary manner, the present arousal 
procedure was carried out by the experimenter 
(a graduate student) and was task related. This 
difference may have contributed to excessive 
inhibition, thereby obscuring arousal differences 
between levels of SE. . 

Interpretation of the results that indicate a 
direct relationship between Arousal and MHS 
scores in their effect on hostility expression does 
not require reference to inhibitory concepts. In 
previous studies differences in the direction of 
greater hostility on the part of subjects with high 
MHS scores appeared under Nonarousal conditions 
and a greater increase in hostility expression 
occurred for low MHS subjects under arousal 
conditions. This result has been interpreted as 
reflecting the operation of an inhibitory variable 
for the aroused high MHS subjects based on the 
production of strong aggression-anxiety at high 
levels of aggression. potential. In the "present 


situation, MHS level is related to hostile responses - 


only in the Arousal condition and the results 
indicate a positive relationship; the higher the 
MHS score the higher is the hostile response in the 
TAT stories, As indicated, this relationship must be 
regarded only as a trend for the Low Cue condition. 
Several possible explanations are available to 

account for the discrepancies in results obtained 
in the present and previous studies employing the 
MHS scores. The Arousal procedures differ in that 
the subject is directly derogated by the experi- 
menter in the present study while in earlier studies 
more indirect methods of arousal were employed. 
Hokanson and Gordon (1958) relied on the 
presentation of the case of a juvenile delinquent 


of level of self-esteem was not related to hostile 
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who was portrayed as more or less sadistic in order | 
to effect High and Low Arousal, respectively. 
Berkowitz’s (1960) subjects received messages from 
a peer indicating either liking or disliking of the 
university and its students to create differential 
arousal. One possibility is that the present proce- 
dure served to arouse stronger aggression although, 
as has been suggested, the character of the instiga- 
tor and the task relevance of the attack is likely 
to have produced greater inhibition. To reconcile 
these factors it may be suggested that the 
heightened strength of instigation may have served 
to raise the goal gradient for hostile response so 
that in relative terms, the influence of the inhibi- 
tion gradient was not apparent in the results. High 
MHS subjects in particular could proceed “closer” 
to the goal of aggression without crossing the 
inhibitory gradient. 

Both earlier studies found greater hostility 
expressed by high MHS subjects under Low 
Arousal conditions. which is in contrast to the 
present study in which no difference is obtained 
between the MHS levels. The present procedure | 
emphasized approbation throughout the subjects 
performance while it is possible that at least to 
some degree hostility was aroused in the Low " 
Arousal condition in the earlier studies. It may be | 
suggested that some minimal degree of arousal is^ 
necessary before differentiation in hostile expres- — 
sion occurs for the different MHS levels. 


SUMMARY 


The relationship of two personality measures to 
hostile response under conditions of differential 
arousal to hostility was evaluated. Responses to 
TAT cards served as the dependent measure of 
hostility. The arousal procedures produced differ- s 
ential response to the TAT cards but the measure 


expression. Following arousal, subjects with high 
Scores on the Manifest Hostility Scale (MHS) 
responded with greater hostility than low MHS 
subjects to the TAT cards. No difference ™ 
response was found between aroused and non- 
aroused subjects with low MHS scores. Discrep- 
ancies between the present results and those 
obtained in other experiments were discusse 
primarily in terms of differences in procedure. 
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A TEST OF THE PROPOSITION: WE WANT TO BE ESTEEMED MOST. 
BY THOSE WE ESTEEM MOST HIGHLY 


CECIL R. WURSTER, BERNARD M. BASS, Axp WAYNE ALCOCK 
Louisiana State University 


i. is the value ascribed to a person, as a 
person, regardless of the position that he 
occupies (Bass, 1960, p. 277). A tenet of modern 
personality theory that is basic to several schools 
of psychotherapy, seemingly axiomatic because of 
its widespread acceptance in psychology, is that 
we are motivated to maintain or increase our 
own esteem in the eyes of others. "This cherished 
possession [selfhood] must forever be made more 
adequate, more worthy, and it must forever be 
defended against stain and injury, whether from 
the acts of others or of the valuing organism” 
(Murphy, 1947, p. 523). 

‘The maintenance and enhancement of esteem is 
directly revealed in our employing such defense 
mechanisms as rationalization, identification, or 
projection, as well as in our tendency to over- 
estimate our esteem more often than we under- 
estimate it (Bass, 1960, pp. 324-325). The desire 
to support and enhance one’s own esteem suggests 
a dynamic relationship significant for leadership 
theory: we want to be esteemed most by those we 
esteem the most. 

The proposition can be derived as follows: 


I want to be valued (p. 364).! Some people, in my 
eyes, are more able or more valuable or more estimable 
than others (p. 277). I accept more readily and attach 
more weight to the influence and opinions of those I 
esteem most highly (p. 289). I want to be valued by 
those whom I value most highly. 


It follows also that: 


I will want those I esteem the most to know of my 
worthy performances. Therefore, I will be motivated to 
influence those most whom I most esteem about matters 


enhancing my own esteem. 


What devolves here is a circular leadership 
process. “I esteem you. You can influence me. I 
will want to influence you to look upon me favor- 
ably. I will be less likely to attempt to influence 
those whom I do not esteem, to regard me favor- 
ably.” This devolution, in turn, is consistent with 
the proposition that “we are more likely to inter- 
act, the more we esteem or value each other” 
(p. 359). 

Thus, for example, we would infer that the 
classroom instructor should be much more con- 


1 All subsequent page references are from Bass 
(1960). 


cerned to have his efforts favorably viewed by 
those whom he regards as good students than by| 
those he regards as poor students. In turn students} 
should be more concerned about their evaluation 
from an instructor whom they regard highly than 
from an instructor whose opinions they discount, 

This study tests the proposition that we art 
motivated to influence most those whom we esteem! 
the most, Esteem is based on ability to som 
extent (p. 284), and since a person's abilities va 
from one situation to another, say from a scholasti 
to a social situation, so should his esteem. At the 
same time, esteem appears to transfer through 
stimulus generalization across situations (p. 287), 
A second purpose of the study was therefore to set 
whether the proposition is bounded by specific 
situations. That is, the study sought to examine 
which of the following two propositions is more 
adequate: 


I want to be esteemed in a particular situation by 
those I esteem specifically in that situation. 

I want to be esteemed in a variety of situations by 
those whom I esteem regardless of the situation. 


METHOD 
Esteem: Scholastic and Social 


One hundred ten students from several elementary 
psychology classes were asked to list the first names ol 
15 campus friends. The 110 students listed a total 
1,650 friends. No doubt, the same person was cited by 
several respondents. Indeed, the names of many of the 
respondents are likely themselves to have appeared 
the lists of other respondents. But our attention V. 
focused on 110 independent respondents, each Wi 
their 15 independent responses, not on any objectiv 
attributes of those identified in the 1,650 responses 

Each respondent was asked to rank his 15 friend 
in the order he would prefer each “as a study partner 
This rank order of friends provided an index of the 
extent each friend was esteemed scholastically by 
respondent, The actual esteem accorded to a particul 
person by all his associates was never obtained, oni 
what one respondent thought of him. 

In the same way, each respondent was asked tori 
his friends again according to the extent he WO 
prefer each “to join a social club of mine.” This 
order provided an index of the social esteem 
friend mentioned. 


The instructions were worded, thus: 


. 
... Assume that you're to recommend the peo 
that are included in your list for membership 
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CRITIQUE AND NOTES 


social club of which you're a member. You're to 

recommend them in your order of preference for 
- membership. Place the number J after the name of 
- theperson you would most want to be in your club. 
| [Experimenter pause] Place the number 15 after the 
name of the person you would least want to be in 
© your club. [Experimenter pause] Now rank all other 
names in your list. Be sure to use all numbers be- 
L tween 1 and 15. Don’t use any number more than 
N once... 

Now assume that you're to recommend people 
that are included in your list for an honorary award 
for scholastic achievement. You're to recommend 
these people in your order of preference for this 
award, Place the number J after the name of the 
person you would recommend first. [Experimenter 
pause] Place the number 15 after the name of the 
person you would recommend last. [Experimenter 
pause] Now rank all other names in your list. Be 
sure to use all numbers between 7 and 15. Do not 
use any number more than once. 


Targets Selected or Avoided for Communications 
about Oneself 


Each respondent was given a list of 15 questions in 
the scholastic area such as: 
Which friend would you most want to know that 
you have a superior IQ on an intelligence test? 
Which friend would you least want to know that 
you have had to take a remedial arithmetic course? 
1 Which friend would you most like to know that you 
- have won a scholarship? 
Intermixed among these scholastic questions were 
15 socially relevant questions such as: 
__ Which friend would you most want to know that 
you have dated a campus personality? 
Which friend would you tell first that you have a 
- Dew sports car? 
Which friend would you least want to know that 
you do not have the proper clothes to wear to a social 
"function? 
- Finally, 15 buffer questions that overlapped both 
areas or were occasionally irrelevant were likewise 
Scrambled in the total list of 45 questions: 

Which friend’s opinion do you value the most? 

Which friend would you least like to know that you 
havea religious problem? 
Which friend would you least like to know that you 
have sprained your ankle? 


RESULTS AND DISCUSSION 


Relation between Esteem and Selection as Target of 
Communication . 


The same single friend could be mentioned by a 
Tespondent as much as 15 times on the 15 social 
‘items or on the 15 scholastic items. But only 3 
Tespondents did this of the 110. A Poisson distri- 
bution of frequency of repeated mentions emerged 
‚as shown in Table 1. Over half of the acquaint- 
ances were mever mentioned by respondents in a 
given social or scholastic area on any of the 15 
| tems. On the other hand, extremely few friends 
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TABLE 1 
FREQUENCY THAT A FRIEND Was MENTIONED ON 15 
SOCIAL QUESTIONS AND ON 15 SCHOLASTIC 
QUESTIONS 


Number of friends 
Frequency with which the mentioned on: 
same friend is mentioned in 


response to the 15 questions 45 social — 1S scholastic 
questions questions 

0 895 1006 
1 372 288 
2 178 151 
3 78 65 
4 59 40 
5 26 26 
6 19 24 
7 9 19 
8 3 11 
9 or more 11 19 

1650 1650 


earned more than nine mentions from a respondent 
in either area. It appeared easy for respondents to 
list 15 acquaintainces by first name and to rank 
them. Yet most respondents had a smaller cadre 
of dose friends to whom they preferred to direct 
all favorable information about themselves? 
Figure 1 shows how this preference centered on 
those friends they most esteemed. Thus, friends 
ranked first in esteem socially received an average 
of 40 mentions in response to the 15 social ques- 
tions. Similarly, friends ranked first scholastically 
received 3.7 mentions of the 15 scholastic items. A 
sharply accelerating curve appears to describe the 
relation between the social esteem with which a 
respondent holds a friend and his tendency to 
single out the friend as a preferred recipient of 
favorable social information. The same holds for 
scholastic esteem and selection as a recipient of 
scholastic information. 

An unexpected and possibly significant reversal 
appeared. Note in Figure 1 that minimum mention 
asa preferred recipient occurs for those ranked 11th 
and 12th in esteem rather than 14th or 15th. 
Some kind of perversity in responding particularly 
in the social area seemed to develop among about 
10% of the respondents. For example, no respon- 
dent gave any friend ranked 11th, 12th, or 13th 
more than three mentions. Yet those friends 
ranked 14th or 15th in esteem received four 
mentions as targets for favorable communication 
from four respondents and five mentions from two 
respondents. 

2 Wherever we state “desired recipient of favorable 
information,” we also imply “undesired recipient of 
unfavorable information.” The 15-item lists were 
balanced almost equally with respect to favorable and 
unfavorable information. 


> 
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AVERAGE 
TIMES 

MENTIONED 

AS DESIRED 
RECIPIENT 

OF FAVORABLE 
INFORMATION 
ABOUT 

RATEE 


E 


Social 


? 


ESTEEM ACCORDING TO RANK OF 
NOMINATION OF FRIEND BY RATEE 


Fic. 1. Relation between esteem of friend according 
to rank of nomination and average times mentioned as 
desired recipient of favorable information. 


A few subjects may possibly have misinter- 
preted instructions and ranked as “15” their 
most esteemed friend and as “1” their least 
esteemed friend. Yet this "mechanical" failure 
seems most unlikely in view of the specific in- 
structions that were read word-for-word by the 
experimenter. 

In designating the preferred recipient of favor- 
able information about the respondent, one men- 
tioned a bottom-ranker in his own esteem hierarchy 
7 times, another 9 times, and one even 12 times! 
The latter was an extreme number of mentions 
even for top ranking friends. It seems doubtful 
that chance or haphazard responding is involved, 
in these bizarre results for no similar extreme 
numbers of mentions emerged anywhere else in 
the scatterplot other than here or for the most 
highly esteemed members. As many as 10% of 
the respondents may, thus, single out as recipients 
of favorable information those they esteem least 
perhaps as a means of self-enhancement through 
increased contempt or contrast. A suggested 
dynamic for these respondents might be: “I value 
myself. I do not value you. I tell you how good I 
am. This furthers my feeling how much better 
Iam than you by contrast.” 


General or Specific Relationship? 


As can be seen in Figure 1, the curvilinear re- 
lations between esteem and mentions were the 
same in both the social and ‘scholastic areas. But 
the question remains as to whether halo or general 
rating bias was responsible, or whether social 
esteem was associated specifically with social 


CRITIQUE AND NOTES 


rather than scholastic mentions and scholastic | 
esteem with scholastic rather than social mentions, 

The data were made linear by collapsing the 15 
esteem ranks into two classes: the upper 7 and the 
lower 8 ranks, and by collapsing into two classes 
the frequency with which a friend was mentioned: 
mentioned at least once vs. never mentioned, 
Phi coefficients were then computed for each of | 
the 110 subjects between the tendency for the 
subject to place a friend in the upper or lower 
half in esteem and the tendency to mention the 
friend as a desired recipient of favorable informa: 
tion. In collapsing these cells we undoubtedly 
reduced the strength of the associations displayed | 
in Figure 1, but at this point we were interested 
in examining the relative strength of associations 
by situation. Table 2 shows the average phi; 
coefficients, corrected for the collapse of 15 into, 
two classes, to provide estimates of product- 
moment correlations that would have been ob- 
tained had we not dichotomized the data. The 
standard error of the mean correlations based on| 
the distributions of 110 values are also “given | 
showing that the distance from the obtained means | 
and zero were about 10 times their standard errors 

The comparable mean correlation between the 
tendency to be selected as the recipient of social 
and scholastic information was .44 while the mean! 
rank-difference correlation between the 15 social 
esteem rankings and the 15 scholastic esteem 
rankings assigned by each of the 110 respondents 
was .39. 

Social esteem appeared to make a friend equally | 
desirable as the recipient of either socia y 
($ = .41) or scholastically ($ = 42) favorable. 
information about oneself. On the other hand | 
scholastic esteem seemed somewhat more specifi- 
cally associated with desirability as a target for 
favorable scholastic communications about on’ 
self (6 = .43) than favorable social: communica- 
tions about oneself ($ = .22). 

These results suggest that if we admire someone 
socially we are likely to want him to know of our 
academic and social successes. But if we esteem 
someone scholastically, we tend to be more con- 


TABLE 2 a 
CORRECTED PHI COEFFICIENTS BETWEEN SOCIAL 
Scuotastic EsrEEM AND TENDENCY TO e 
SELECTED AS THE DEsmED RECIPIENT © 
INFORMATION ABOUT THE RATEE 


Esteem 
Desired as recipient of 
favorable information mm Scholastic ` 
1 
SEI dae. 2 
Scholastic 21:00 4 09 


CRITIQUE AND NOTES 


cerned that he learn of our academic worth than 
our social reputation. The specific situations were 
selected, of course, not for their own sake but to 
test a proposition and they may not be equipotent. 
On a college campus, particulady in the post- 
Sputnik era, scholastic success may enhance 
social self-esteem but social success may have less 


effect on scholastic appraisal. 


Implications 

Extrapolation from these results suggests that 
we tend to behave in a manner directed at main- 
taining and enhancing our own esteem in the 
eyes of those we esteem rather than in the opinion 
of just anybody. That we are, thus, more respon- 
‘sive to the suggestions of those we esteem but- 
‘tresses the coercive effects of esteem on success as 
‘a leader. One of us (Bass, 1960) has noted else- 
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where that esteem can produce coercive as well 
as persuasive success as a leader, although the 
possible coerciveness arising from one’s power as 
a person (irrespective of position) has not been 
much appreciated or investigated (p. 289). We 
follow the esteemed person because we believe 
him to have the ability and the power as a person 
to reward or punish us. Part of this coercive power, 
these data suggest, derives from the effects of the 
esteemed figure in granting or denying to us words 
of support, encouragement, and admiration. 
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A CROSS-CULTURAL INVESTIGATION OF THE RELATIONSHIP BETWEEN 
ETHNIC PREJUDICE, AUTHORITARIAN IDEOLOGY, AND PERSONALITY! 


ARON WOLFE SIEGMAN? 
Bar-Ilan University, Israel 


pons major finding reported in The Authori- 
tarian Personality (Adorno, Frenkel-Bruns- 
wik, Levinson, & Sanford, 1950) is that prejudice 
is part of a larger attitudinal or ideological syn- 
drome, which can be measured by the F Scale, 
and which in turn is associated with certain per- 
sonality needs and defense mechanisms, e.g., 
aggression, repression, projection, etc. Although 
serious questions have been raised about the 
adequacy of the original sample (Hyman & Sheats- 
ley, 1954), subsequent studies with diverse groups 
have produced essentially similar results (Christie, 
1954, p. 155). In the United States, however, 
ethnic tolerance plays a prominent role in the 
ideologies of leftist and liberal political groups. 
Thus in selecting subjects who vary on the prej- 
udice variable one may automatically select 
along a political radicalism-conservatism dimen- 
sion, This raises the possibility that the obtained 
correlates of ethnic prejudice may really be re- 
lated to the radicalism-conservatism dimension 
rather than to prejudice as such. 

The present study of the relationship between 
ethnic prejudice, authoritarian ideology, and per- 
sonality in a group of Israeli college students in- 
cluded two measures of ethnic prejudice. One 
measure was designed to evaluate the subjects’ 
prejudice against Arabs, and the other the subjects’ 
prejudice against recent Jewish North African 
immigrants, Whereas political parties in Israel 
differ in their attitudes towards the Arabs—the 
leftist parties advocating greater equality for the 
Arabs—prejudice against the North African im- 
migrants is equally condemned by all political 
parties. Thus the present study afforded the op- 
portunity of investigating the correlates of ethnic 
prejudice that is independent of political commit- 
ment. 

METHOD 


Subjects. The subjects consisted of 38 female and 
16 male undergraduate psychology majors at Bar-Ilan 
University. One half of the group was European-born 
and the other half Israeli-born, but all subjects were of 


1 This study was aided by a grant from the Com- 
mittee for the Encouragement of Research, the Israeli 
Labor Organization. 

The author is grateful to Jacob Jonah and Tsadok 
Baruch for their help in the collection of the data of 
this paper. 

2 Now at the University of Maryland School of 
Medicine. 


European middle class background. The subjects' age 
range was from 19 to 24 with about 85% of the subjects 
in the age range 19-22. 

Authoritarian ideology scale. The 10 F Scale items! 
with the highest discriminatory power (Adorno et al, | 
1950, p. 260) were selected from a pool of F Scale items 
on which there was unanimous agreement by the 
members of an advanced seminar class that they 
possess transcultural meaning and do not refer to} 
historical events or situations with a special meaning 
either in the United States or in Israel. Each item had 
the following response categories: strongly agree, agree, | 
agree slightly, disagree slightly, disagree, strongly dis- | 
agree, with a possible score from 1 to 6. 

Authoritarian personality questionnaire. The Gough. 
Pr scale, consisting of MMPI items that differentiated 
significantly between prejudiced and nonprejudiced 
individuals and correlated significantly with the F | 
Scale, was used as an index of authoritarian personality | 
characteristics (Gough, 1951a, 1951b). An analysis of | 
the items of this scale reveals that they cover the 
various needs and adjustment mechanisms mention 
by Adorno et al. (1950) as typical of the prejudiced | 
individual (Gough, 1951b). T 

Prejudice scales. The subjects’ ethnic prejudice was. 
determined by means of two especially constructed. 
Likert-type scales: a 19-item scale of prejudice against 
Moroccan immigrants (NP scale), and a 17-item scale 
of prejudice against the local Arab population ( 
scale). The original version of each scale consisted of | 
especially constructed items, similar in content to the | 
items used by Adorno et al. (1950) for their Ethno- 
centrism and Negro prejudice scales. The final version | 
of each scale consisted of all items that correlated 
(r) .65 or higher with the subjects’ total score con the 
original version of each scale. Test-retest reliability 
(r) was .91 and .89, respectively, for the two scales. 
‘The scoring system was the same as for the F Scale. 


RESULTS 


Table 1 gives means and standard deviations | 
for the various scales and questionnaires. There. 
was no significant sex difference on any of the 
measures. (The ¢ values ranged from .13 to 38.) 

Table 2 presents the intercorrelations o) 
tween the various measures. A highly significant 
correlation was obtained between the two prejudio 
scales. Only one of the prejudice scales, the el 
scale (Arab prejudice) was significantly relat | 
to the F Scale. Both prejudice scales were signi" 
cantly related with the Gough Pr scale, measuring 

3 The following items were used: 1, 8, 13, 18, 2h 
26,27,31,37,42 (Adorno, Frenkel- Brunswik, Levinso™ 
& Sanford, 1950, p. 260). 
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TABLE 1 
SCALE STATISTICS 
Measure M SD 
F scale 39.98 9.67 
NP scale 45.55 14.44 
AP scale 57.89 18.35 
Pr scale 10.37 4.38 
TABLE 2 
INTERCORRELATIONS BETWEEN THE VARIOUS 
MEASURES 
AP scale F Scale Pr scale 
NP scale .66** .23 do e 
AP scale .44** .59** 
F Scale A ytd 


** Significant at .01 level. 


authoritarian personality traits. A. highly signifi- 
cant correlation was obtained between the F 
Scale and the Gough Pr scale. 


Discussion 


These results provide cross-cultural validation 
for the major conclusions of The Authoritarian 
Personality. The high correlation between the 
subjects’ Arab and North African prejudice scores 
is consistent with the hypothesis that ethnocen- 
trism is a generalized trait. The fact that the Gough 
Pr scale, a measure of authoritarian personality 


. needs and defenses, was a significant source of 


| 
| 


| 
| 
f 


variance in both types of prejudice is certainly 
Consistent with the basic theme of The Authoritar- 
ian Personality. The results of the present study 
do suggest, however, that not all ethnic prejudice 
is related to authoritarian ideology. The failure to 
obtain a significant correlation between the F 
Scale and North African prejudice, may be attribu- 


bf 
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table to the independence of the latter from politi- 
cal affiliation. 

The significant positive correlation between the 
F Scale and the Gough Pr scale is consistent with 
the hypothesis that personality needs and defenses 
determine attitudes, although the correlation 
cannot prove the supposed causal relation. An 
alternate possibility is that children learn an 
authoritarian ideology from parents who share it, 
and who, in keeping with their ideology, engage 
in socialization procedures that produce essentially 
authoritarian personalities, That is, rather than 
reflecting personality needs and defenses, ideology 
may influence socialization practices which in turn 
influence personality needs and defenses. The fact, 
however, that in the present study anti-Moroccan 
prejudice was related to the personality charac- 
teristics indexed by the Gough scale without 
being part of an authoritarian ideology as meas- 
ured by the F Scale does suggest the former 
causal sequence. 
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EFFECTS OF FAMILIARITY WITH A CONTROVERSIAL ISSUE ON ACCEPTANCE 
OF SUCCESSIVE PERSUASIVE COMMUNICATIONS 


EDWIN J. THOMAS, SUSAN WEBB, AND JEAN TWEEDIE! 
University of Michigan 


NE of the significant conclusions of The Order 
O of Presentation in Persuasion by Hovland, 
Mandell, Campbell, Brock, Luchins, Cohen, Mc- 
Guire, Janis, Feierabend, and Anderson (1957) 
denies the existence of a general law of primacy in 
persuasion—of any universal tendency for the 
first of two successive, persuasive communica- 
tions to be more effective than the second. This 
conclusion, however, is based largely upon ex- 
periments in which familiar topics were used 
(e.g., protective tariffs, reduction of the legal vot- 
ing age, socialized medicine), and familiarity with 
controversial topics is most generally accompanied 
by previous exposure to persuasive communica- 
tion concerning the issues. Clearly it is not possible 
to study the effects of the first exposure to a per- 
suasive message if a person is already familiar 
with the topic and its controversial features, al- 
though the immediate serial effects of two-sided 
persuasion can be studied with familiar topics. 
Thus the question of whether there is a general 
law of primacy in persuasion must be examined in 
experiments making use of unfamiliar topics. 

With a new topic, the first persuasion has the 
possible advantages over the second of being more 
fully attended to, of being learned with less inter- 
ference, of being accepted more readily, and of 
being established as a first impression that struc- 
tures the perception of the subsequent persuasive 
message. The operation of any or all of these could 
generate primacy. Particularly relevant to this 
problem are studies by Luchins (Hovland et al., 
1957), who attributed to set or Einstellung the 
marked primacy effect found in forming concep- 
tions of an imaginary person from a limited de- 
scription of him, While Luchins’ studies had the 
virtue of dealing with the unfamiliar topic of a 
previously unknown person, they did not involve a 
controversial issue. The experiments reported 
here compare the effectiveness of successive pro 
and con persuasions with unfamiliar and familiar 
controversial topics to test the hypothesis that 
primacy should occur with an unfamiliar topic 
and that there should be no primacy or recency 
with a familiar topic. 


l'The authprs wish to express their gratitude to 
Lynette Beall, Sharon Edwards, Cheryl Howard, Louis 
Jaffe, Daniel Marcus, Gail Smith, James Schreuder, 
Fred Steel, Terry Sokay, and Elizabeth Sweebe for 
help provided in various phases of the experiments. 
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EXPERIMENT I 
Method 


The subjects were 38 male and 24 female, high 
school seniors who participated in the experiment as 
“captive volunteers” while attending one of their 
classes. 


Topic. The issue was whether to use Dr. Murray's 
new anti-cancer serum routinely in medical practice. 
To familiarize the subjects with it, they read a report 
on Dr. Murray's research essentially as it had appeared 
a few months earlier in an issue of a popular weekly 
news magazine; the brief described general facts con- 
cerning cancer, cancer research, how Dr. Murray de- 
veloped the serum from horse blood, and some elements 
of the controversy concerning the wisdom of using this 
serum in medical practice. In the nonfamiliar condition 
the filler article was one on the discovery of a new 
tranquilizing tea, taken from the same magazine. 

The arguments were not labeled as controversial; 
the pro position was entitled “An Anti-Cancer Serum” 
in which it was advocated that Dr. Murray's anti- 
cancer serum be used routinely in medical practice, 
and the con arguments were entitled “Information 
On the Use of Dr. Murray’s Anti-Cancer Serum” and 
advocated withholding the use of serum from use M 
medical practice until further inquiry into its effects 
had been undertaken. T 

Response measures. The three measures of attitude 
regarding the use of the serum routinely in medical 
practice were nine-point Likert-type scales ranging 
from strongly disagree at 1 to strongly agree at 9, with 
5 being neutral. The crucial question was “An anti- 
cancer serum recently developed by Dr. Murray of 
Canada should be routinely used in medical practice 
and was embedded among six other questions, some of 
which were unrelated to the serum. 

A multiple-choice knowledge test composed of 14 
items was given to determine the effectiveness of the 
familiarity induction. The knowledge score consis! 
of the number of correctly endorsed items of the 10 
dealing with significant facts of the familiarity induc- | 
tion; the remaining 4 items related to the tran! 
tea. 

Procedure. There were 17 subjects who received the 
pro-con order and 14 the con-pro in the familiar condi- 
tion, and 16 who received the pro-con order and 15 the. 
con-pro order in the nonfamiliar condition; males an 

cr were represented approximately equally ™ 
Subjects took one hour to complete the following 
tasks in a booklet: Attitude Questionnaire 1, f 1 

induction (article on Dr. Murray's serum oF filler 
article), Attitude Questionnaire 2, knowledge tetr 
persuasive communications, and Attitude Question 
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naire 3. While sitting one seat apart in the classroom, 
they went through the booklet without knowing what 
was to come subsequently, and after each page of 
responses was completed, the subject was directed to 
raise his hand so that it could be picked up. 


Results 


Success of inductions. The score on the knowledge 
test was 5.65 in the familiar groupand 1.6 in thenon- 
familiar group (p < .01 by ¢ test), indicating that 
subjects in the familiar group generally became fa- 
miliar with the topic and that subjects in the non- 
familiar group were previously unfamiliar with the 
topic. The familiarity induction also changed atti- 
tudes ina con direction; in the familiar condition the 
means on Attitude Questionnaire 1 and 2 were 5.06 
and 4.39, respectively (¢ = 59.5, p < .05 by Wilcox- 
on’s test); the analogous means for the nonfamiliar 
condition were 4.61 and 4.55, respectively. How- 
ever, the scepticism regarding the use of the serum 
caused by the familiar induction cannot be re- 
garded as disposing subjects in that condition to- 
ward either a primacy or recency effect. 

The persuasive communications succeeded in 
moving the majority of the subjects from their 
original opinions. In the familiar condition, 68% of 
the subjects changed their opinion on Attitude 
Questionnaire 2, 71% changed from Attitude 
Questionnaire 2 to 3, and 77% had different posi- 
tions on Questionnaire 3 as compared to Ques- 
tionnaire 1; analogous percentages for the non- 
familiar condition were 45, 87, and 87, respectively. 
The mean amount of movement fell between one 
and three scale steps, with the nonfamiliar con- 
dition bringing about somewhat more movement 
than the familiar condition ( is not significant). 

Altitude change. The primacy score computed 
for each subject was the amount of change, in the 
direction of the position advocated first, from the 
Score obtained before the persuasive communica- 
tions to the score gotten afterwards. When these 
Scores were ranked over both conditions and the 
familiarity and nonfamiliarity groups compared by 
the Mann-Whitney U test, there was no differ- 
ence found for changes from Attitude Question- 
naire 1 to 3 or from Attitude Questionnaire 2 to 3. 


ExPERIMENT II 


One reason why the expected primacy effect did 
hot appear in the nonfamiliarity condition in the 

st experiment is that the measurement of the 
effects of the persuasive communications occurred 
too soon after the second persuasive communica- 
tion, Miller and Campbell (1959) have shown that 
if a general primacy effect exists it is most likely 
to be manifested with delayed rather than imme- 
diate measurement. The Miller-Campbell theory 
assumes that a persuasive communication may be 
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forgotten like anything else and that persuasive 
messages are forgotten much more rapidly im- 
mediately after their initial learning than at later 
periods. Like the Ebbinghaus forgetting curves 
that served as models for the analogy, these for- 
getting curves take the form of a negatively ac- 
celerated function with time. Given this shape of 
the forgetting curve and the additional assumption 
that the strength of the first persuasive communi- 
cation is stronger than the second (thereby ele- 
vating the curve for the first message), then the 
early measurement of the effects of opposing, 
persuasive communications may obscure the 
effects of the first message because of the relatively 
high asymptote of the more recent second message. 
Later measurement of these effects would reveal 
primacy because the asymptote for the first mes- 
sage would have settled at a higher level than that 
for the second message (Miller & Campbell, 1957, 
p. 2, Figure 2). To examine the possibility. that a 
primacy effect was obscured in the nonfamiliar 
condition by too early measurement a second ex- 
periment was done replicating the first and adding 
delayed measurement of the effects of the per- 
suasion communications. 


Method 


Subjects were 75 female undergraduates at the 
University of Michigan who participated in the ex- 
periment as part of a requirement for an introductory 
course in psychology. In the condition of immediate- 
measurement there were 17 subjects in the nonfamiliar 
group and 19 in the familiar group; in the condition of 
delayed measurement there were 17 subjects in the 
nonfamiliar group and 22 in the familiar group. The 
topic, procedure, and response measures were the same 
as in Experiment I, with the following exceptions: 
Attitude Questionnaire 2, given in the first experiment 
following the familiarity induction, was eliminated to 
reduce the number of times the subject had to give his 
attitude, thereby hopefully reducing any possible con- 
sistency effect. Two additional attitudinal items were 
added to the questionnaires given to the subjects 
before and after the persuasive communications, and 
five additional items were inserted into the final 
questionnaire. All of the items were nine-point Likert 
scales having the same form as described for those 
used in Experiment I. A question was inserted concern- 
ing the extent to which subjects perceived the pro and 
con arguments as controversial. Luchins’ work (Hov- 
land et al., 1957) suggests that subjects who perceive 
the statements as noncontroversial would be more 
likely to view the communications as a single unit than 
subjects who view the arguments as controversial. 
Communications seen as comprising a noncontradic- 
tory whole should be more subject to Einstellung 
structuring, and hence a primacy effect, than com- 
munications viewed as Separate arguments. 


Results 


Success of inductions. The mean score on the 
knowledge test was 6.75 in the familiar condition 
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and .54 in the nonfamiliar condition ($ < .0001 by 
1 test), again indicating the effectiveness of the 
manipulation of familiarity. The persuasive com- 
munications also were again effective in moving a 
majority of the subjects from their original posi- 
tion; percentages of subjects changing ranged from 
51 to 83. 

Attitude change. Using the same method to test 
the significance of the primacy scores as in Experi- 
ment I, no differences were found on any of the 
three items given before and after the persuasive 
messages for the familiar and nonfamiliar groups 
in the immediate condition, the familiar and non- 
familiar groups in the delayed condition, and the 
nonfamiliar groups in the immediate and delayed 
conditions; the five questions used in the post 
questionnaire only were also analyzed, and again 
no differences were found. 

Effect of perceived controversy. Exploration of 
Luchins’ Einstellung interpretation of primacy 
was undertaken by comparing subjects who per- 
ceived the two sides of the argument concerning 
Dr. Murray’s serum to be controversial versus 
those who did not. If Luchins’ hypothesis is cor- 
rect, then those subjects who did not perceive 
controversy between the pro and con arguments 
would be most likely to evidence a primacy effect 
and those who perceived that the issues were con- 
troversial would be least likely to evidence 
primacy. Fifty-three of the subjects saw the argu- 
ments as conflicting and 23 did not; there were no 
differences in primacy scores between these groups 
on any of the attitude items for either delayed or 
immediate measurement. We also examined the 
possibility that the primacy effect predicted for 
the nonfamiliar condition had occurred for sub- 
jects not perceiving controversy, but had not oc- 
curred for subjects perceiving controversy. The 
analysis revealed no unequivocal primacy effect 
for subjects not perceiving controversy in the non- 
familiar condition. 


Exrerment III 


The condition of immediate measurement of the 
second experiment was replicated with 216 female 
undergraduates of the University of Michigan. 
with negative results. 


Discussion 


Suggestions concerning how to interpret the 
failure to find a primacy effect in our studies can 
be derived from comparing our results with those 
of Miller and Campbell (1959), whose experiment 
appeared in print after the first of our experiments 
was conducted. The results of their experiment 
directly comparable to ours involve the comparison 
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Vy 
of the attitude-change scores for the conditions of 
immediate and delayed measurement. The topic 
was the question of who to favor in a suit for 
damages resulting from a defective vaporizer—an 
unfamiliar topic. They found no tendency toward 
primacy or recency in the condition of immediate 
measurement but a primacy effect for the condi- 
dition of delayed measurement. Thus this study 
provides results corroborating the hypothesis of | 
the present studies. 

A critical difference between our studies and 
theirs was the topic. Although both topics were 
unfamiliar to the subjects, ours could readily have . 
aroused existing attitudes toward the treatment of 
cancer, the chronically ill, taking risks in medical 
practice, and the related topic of euthanasia, 
whereas theirs involved a neutral and unknown 
subject. Thus the failure to obtain a difference in 
the condition of delayed measurement of our 
second experiment may have been due to the rela- 
tive weakness of the persuasive communications 
in the face of the existing attitudes. This interpre- 
tation suggests the necessity to distinguish between 
familiarity with the topic and familiarity with the 
attitudes that may apply to it. The topic as well 
as attitudes relating to it should be new for the | 
subjects, as presumably was the case in the Miller- 
Campbell experiment. If there is a new topic but it 
is similar to others known to the subjects and 
arouses attitudes that subjects already have to- | 
ward the related topics, then the reactions of the 
subjects to the new topic should be a function of 
the extent to which it is similar to the already 
known topics and of the intensity of the attitudinal 
affect associated with the related attitudes. 


SUMMARY 


The hypothesis of these experiments was that | 
the first of two successive persuasive communica- 
tions would be more effective than the second, 
where subjects were not familiar with the topió 
and that no primacy or recency effect would be 
found where subjects were familiar with the topic 
In the first experiment, in which conditions 0 
familiarity and nonfamiliarity with a contro- 
versial topic were established, no tendency was 
found toward either a primacy or a recency He 
in either condition. In the second experiment the 
conditions of the first were replicated with the 
addition of delayed measurement added to that of | 
immediate measurement. Again no tendency to- | 
ward primacy or recency was found. A third & | 
periment replicated the condition of immediate 
measurement of the second, again finding v 
primacy effect. Although the topic in these D 
periments was not familiar to the subjects, strong 
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ANXIETY AND THE EXPERIMENTAL AROUSAL OF AFFILIATION NEED! 


DONN BYRNE* 
University of Texas 


effect of experimentally aroused affilia- 
tive motivation on responses to projective 
tests has been investigated in order to develop a 
TAT scoring system for n Affiliation. Shipley and 
Veroff (1952) employed a sociometric procedure 
as the arousing condition. Their scoring system 
focused on imagery involving affiliation-depriva- 
tion. They cross-validated their scoring cate- 
gories by comparing responses of freshmen ac- 
cepted and rejected by college fraternities. 
Atkinson, Heyns, and Veroff (1954) attempted to 
broaden the definition of affiliation imagery by in- 
cluding any statement of concern over establish- 
ing, maintaining, or restoring a positive affective 
relationship with another person or persons. They 
also maximized the difference between the ex- 
perimental and control groups used in the pro- 
cedure for analyzing the responses. Again, a 
sociometric rating experience constituted the 
arousal condition. In both studies, and with some- 
what different scoring systems, more affiliation 
responses occurred in the stories of the aroused 
subjects. 

The experimental conditions that have defined 
affiliation arousal have been such that both positive 
and negative cues probably are present. Affiliation 
need is conceptualized as an approach tendency, 
and the score as it is defined by the Michigan group 
is largely oriented toward approach responses. 
However, imagery motivated primarily by fear of 
rejection may have been confounded with that 
motivated primarily by hope of affiliation because 
of the nature of the arousal conditions that have 
been used. If arousal conditions that involve 
ratings by others actually do evoke avoidance re- 
sponses, such conditions should evoke feelings of 
anxiety as well as affiliation need. 

A further consideration is whether affiliation 
arousal would be expected to evoke anxiety to the 
same extent in all subjects. When TAT protocols 
are obtained in a neutral situation and scored for n 
Affiliation, the resulting individual differences are 
assumed to indicate variations in characteristic 
motivational states. Persons who are found to 
have high need for affiliation should be most re- 

1 This investigation was supported in by re- 
search grants (EF-140, EF-143) from a uei 
of Texas Research Institute. 

2 The author would like to express his appreciation 
to John Sheffield and Carole Calvert who served as 
research assistants and to Elaine Abbott who scored 
the affiliation need protocols. 
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sponsive to affiliation arousing cues in their en- 
vironments. That is, external stimulus conditions 
and internal motivational state should interact, 
French and Chadwick (1956), using their own pro- 
jective measure of affiliation need, found the great- 
est increase in n Affiliation in an arousal condition 
for subjects whose scores were highest in a neutral 
condition. Perhaps experimental manipulations 
designed to arouse affiliative motivation are only 
effective with those subjects for whom such en- | 
vironmental cues are initially salient. Therefore, if 
anxiety is evoked by affiliation arousal, one might 
reason that its intensity should be a function of an 
individual's characteristic level of n Affiliation. 
These two considerations lead to the hypotheses 
that (a) anxiety is evoked by the experimental — 
arousal of affiliation motives and (b) the amount of 
anxiety is a positive function of n Affiliation de- 
termined in a neutral situation. 


METHOD 


Procedure. As part of another study, 84 students (4 
males, 40 females) enrolled in introductory psychology 
courses at the University of Texas were seen in groups 
of 5-10. They were seated in a small amphitheater 
equipped with a large one-way mirror. They were 
given a paper-and-pencil task requiring about 15 
minutes time. 

For the experimental group, the following instruc 
tions were given: 

Before you begin, there is something else I 
would like to tell you about. You are actually taking 
part in two experiments at the same time. A 

One way to make judgments about people is to 
observe them carefully. The study of expressive 
movements has taught psychologists a lot about the 
meaning of the many little unconscious gestures, 
facial expressions, body movements, etc. in Ww 
all of us engage in our everyday behavior. We have 
some evidence that such things have a lot to do wi 
a person's popularity. So, during the rest of the 
session today, I and my two assistants 
watching each of you very closely. 

This mirror here is actually a one-way screen. D 
the room next to this one, two research assistants 
are already taking notes on various aspects of your 
behavior. They are both graduate students ip 
psychology. Il show you where they are. (Lights ID 
experimental room are turned off and those in the 
observation room turned on momentarily.) _ 

Before the hour is over, we will have assigned 8 
rating to each of you with respect to your general 
popularity, your attractiveness to the opposite 
how likeable you seem, etc. 
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Following this, the experimenter left the subjects 
alone and noisily entered the observation room. 

Anxiety ratings. Immediately following the paper- 
and-pencil tasks, the experimenter returned and ad- 
ministered a questionnaire dealing with reactions to 
the experiment. Two six-point self-rating scales dealt 
with their feelings of nervousness and uneasiness during 
the preceding 15 minutes and their estimates as to how 
nervous or uneasy most of the other students in the 
room probably felt. On each scale, the descriptive 
points were “completely calm," “relatively calm,” “a 
little uneasy,” “quite uneasy,” “very uneasy,” and 
“extremely uneasy.” 

The anxiety score consists of the combined responses 
to these two scales. The split-half reliability of this 
instrument was found to be .72, corrected by the Brown- 
Spearman formula. 

Affiliation need. One week later, a different experi- 
menter using a different experimental room admin- 
istered four TAT slides according to Atkinson’s (1958, 
pp. 836-837) instructions. The slides consisted of 
Pictures 28, 83, 102, and 103 in the list presented by 
Atkinson (1958, pp. 832-834). A research assistant 
used the Heyns, Veroff, and Atkinson (1958) manual to 
score the protocols. 

A subsample of 29 protocols was scored inde- 
pendently by an experienced scorer at the University 
of Michigan. The correlation of .95 between the two 
sets of scores indicates high interjudge reliability. For 
the 84 subjects, the mean n Affiliation score was 2.67 
with a standard deviation of 2.54. Subjects were 
divided approximately at the median into low (N = 
38, scores of 0-2) and high (N = 46, scores of 3-10) 
groups. 


RESULTS 


The means and standard deviations of the 
anxiety ratings are presented in Table 1. The in- 
teraction between experimental conditions and 
affiliation need in evoking anxiety was examined in 
a two-way analysis of variance. Disproportionality 
among the number of cases in the four cells was 
Corrected using the procedure outlined by Wert, 
Neidt, and Ahmann (1954). The results of this 
analysis are reported in Table 2. 

„Neither n Affiliation nor the experimental con- 
ditions were related to self-ratings of anxiety.* The 
Interaction between the two independent variables, 
however, was statistically significant. The major 
Source of the difference is between the High and 
Low n Affiliation groups in the experimental con- 
dition. Subjects high in affiliation need indicated 
Significantly greater anxiety in response to affilia- 
tion arousing cues than subjects low in affiliation 
need (| = 2.61, df = 34, p « .02). Also, the 
difference between the experimental and control 
Conditions approaches significance for the sub- 
jects high in n Affiliation (t = 1.90, df = 44, p 
between .05 and -10). 


*It was the failure of the arousal instructions to 
evoke anxiety that led to the present investigation. 
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TABLE 1 
MEANS AND STANDARD DEVIATIONS OF 
RATINGS 
Experimental 
Control group Toup! 


High n Affiliation 27| 3.81| 1.16| 19| 4.42) .88 
Lown 21| 4.00) 1.23) 17| 3.59) .97 
TABLE 2 


Summary OF ANALYSIS OF VARIANCE OF ANXIETY 
RATINGS AS BY EXPERIMENTAL 


CONDITIONS AND AFFILIATION NEED 


Source SS dí MS F ? 
Experimental condi- .4072*| 1| .4072| .33| ns 
tions 
Affiliation need 1.3438*| 1/1.3438/1.09| ns 
Interaction 5.2843*| 1/5.2843/4.28| < .05 
Within (error) 98.8233 1.2353) 
* Adjusted. 


Another way of examining the data is through a 
correlational approach. Anxiety ratings and n 
Affiliation scores correlated .45 (p « .01) in the 
experimental condition and — .12 (ms) in the con- 
trol condition. 


Discussion 


The results suggest that manipulations de- 
signed to arouse affiliation need evoke anxiety, at 
least in persons whose fantasy productions give 
evidence of concern over establishing, maintain- 
ing, or restoring positive affective relationships. 
Arousal conditions that involve a high probability 
of success should be less likely to evoke such 
anxiety than arousal conditions that involve the 
strong possibility of failure. A separation of these 
positive and negative qualities in the arousal condi- 
tion might lead to a more specific differentiation 
of hope of success and fear of failure with respect 
to affiliation imagery in thematic material. As 
Shipley and Veroff (1952) have suggested, the 
ability to specify whether individuals are moti- 
vated primarily by positive, negative, or mixed 
affiliative needs would seem to permit more ac- 
curate behavior predictions in affiliation relevant 
situations. 

From another viewpoint, the findings suggest 
that the present n Affiliation scoring system identi- 
fies those individuals who are likely to be made 
anxious by a situation containing an affiliation 
threat. Presumably these same high n Affiliation 
subjects would respond with greater pleasure to the 
promise of an affiliative reward than would subjects 
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low in n Affiliation. Other motivational scoring 
schemes might also be used in a similar way to pre- 
dict those who would be made anxious by threat 
relevant experimental conditions or made pleased 
by reward relevant conditions. 

A word should be added about the failure of the 
experimental condition to evoke more anxiety than 
the control condition. Remarks by various sub- 
jects suggested that the room in which the experi- 
ment was conducted was somewhat anxiety pro- 
voking in and of itself. If so, perhaps the instruc- 
tions only served to define the source of anxiety 
cues. Without those instructions (in the control 
group), the array of individual hypotheses about 
the threat presented by the room was un- 
related to n Affiliation. However, when other 
subjects (in the experimental group) were in- 
formed that they were being observed and rated, 
the source of concern became relatively specific. 
"Those who feared, for example, the possibility of 
electric shock were now able to relax while those 
with strong affiliation motives perceived the cues 
as increasingly stressful. 


SUMMARY 


‘The experimental conditions employed in studies 
of n Affiliation probably contain a mixture of 
positive and negative cues. If so, such arousal con- 
ditions should evoke anxiety as well as n Affiliation. 
The hypothesis was tested that anxiety is evoked 
by the experimental arousal of affiliation motives 
and that the amount of anxiety is a positive func- 
tion of n Affiliation determined in a neutral situa- 
tion. 

In the experimental group, subjects were aware 
of being watched through a one-way mirror and 
rated for popularity, attractiveness, etc. while 
completing a paper-and-pencil task. The control 
group simply completed the paper-and-pencil 
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task. All subjects then rated themselves and others 

in the group with respect to feelings of nervousness | 

and uneasiness. At another testing session, TAT 
slides were shown, and the protocols were scored 
for n Affiliation. 

Experimental conditions and affiliation need 
interacted to influence the anxiety self-ratings. In 
the experimental group, subjects high in n Affilia- 
tion rated themselves significantly more anxious 
than did those subjects low in n Affiliation; in the 
control group, affiliation need was unrelated to 
anxiety ratings. Manipulations designed to arouse | 
affiliation need thus appear to evoke anxiety in | 
persons whose fantasy productions give evidence 
of concern over affiliation. Also, the present n 
Affiliation scoring system identifies those indi- 
viduals who are made anxious by an affiliation | 
threat. 
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PSYCHOGENIC DEAFNESS, PERCEPTUAL DEFENSE, AND PERSONALITY 
VARIABLES IN CHILDREN 
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N a recent discussion, Kodman and Blanton 
I (1960) have suggested examination of the per- 
ceptual defense hypothesis, using as subjects 
children who manifest psychogenic auditory be- 
havior. Indicating the theoretical difficulties in- 
volved, they observe that a preponderance of the 
cases encountered by them were children whose 
normal thresholds for speech contrasted with losses 
on the pure tone audiogram, i.e., in response to 
meaningless stimuli. 

It may be postulated that conventional pure- 
tone audiometry presupposes conditions of set and 
attention different in quality from those obtaining 
in normal speech communication. If this is so, 
then personality may well be an important variable 
in the determination of responses in the two situa- 
tions, The present study reports a preliminary 
check on this prediction, with some observations 
on the applicability of perceptual defense thories 


to psychogenic deafness. 


PROCEDURE 


The files (totaling over 3,000) of a large city child 
guidance clinic and school psychological service were 
searched for cases presenting psychogenic or functional 
deafness for speech where responses to pure-tone 
audiometric testing had been judged to be normal. A 
further group of children was found whose thresholds 
for pure tones had been relatively high but whose 
responses to speech had been within normal limits. In 
both groups the age range was restricted to 8-14, and 
Cases of below 75 IQ, those where there was any evi- 
dence of brain injury, and those where the psychiatric 
diagnosis was in doubt were excluded. All cases were 
Psychiatrically diagnosed, independently of this in- 
vestigation, as being predominantly "hysteric" or 

anxiety state." The children in the first group pre- 
Sented outgoing behavior problems, whilst those in 
the second were characterized by timidity, withdrawal, 
or mild phobic symptoms. 

Both groups were roughly equal in distributions of 
age, sex, and IQ, 


RESULTS AND DISCUSSION 


Findings are summarized in Table 1. Despite 
the statistically high significance (p = .005), con- 
fidence in generalization is, of course, diminished 
by considerations of the small numbers involved 
and the unreliability of psychiatric diagnosis. But 
there seems to be a clear tendency in this sample 
for those children whose auditory threshold for 
pure tones is high to be categorized as hysteric 
(as opposed to anxious) and some tendency to- 


wards the reverse among those who display a lack 
of response to speech. 

Kleinman (1957), in a careful experimental 
comparison of subjects with psychogenic hearing 
loss and a control group with organic hearing loss, 
found a decreased practice effect rather than 
heightened auditory thresholds in the experi- 
mental group. However, Kleinman pointed out 
that “individuals whose auditory thresholds are 
characteristically raised as a defense against 
anxiety-provoking stimuli can be identified on the 
basis of their perceptual behaviour,” and inter- 
preted his results as being generally consistent 
with perceptual defense theory. Other workers 
have concurred with this view, and it may well be 
presumed that some relationship does exist be- 
tween psychogenic deafness and perceptual de- 
fense. Indeed, investigations into the latter have, 
to some extent, been stimulated by clinical find- 
ings. But theoretical difficulties at once become 
apparent, as Kodman and Blanton (1960) have 
pointed out. 

Most relevant experimental studies have been 
designed to show whether there are differences 
between subjects’ responses to “threatening,” as 
opposed to neutral, words or sentences. In practice, 
our cases reacted abnormally to situations. Group 
A showed high or inconsistent thresholds for pure 
tones during audiometric testing, but not for 
speech, Group B members were hard of hearing 
among strangers or in class, but responded 
normally to pure-tone testing. Thus it is difficult 
to explain the behavior of Group A in terms of 
perceptual defense as presented in Eriksen’s (1954) 
review or in Blum’s (1955) more specifically psy- 
choanalytic account unless it is postulated that 
pure-tone audiometry constitutes a threatening 
stimulus. 

On the other hand, Solomon and Howes’ (1951) 
argument regarding the relative lowness of ap- 
propriate responses in the subjects’ response 
hierarchies might well explain the high thresholds 
of Group A, but not those of Group B. In what to 
them are presumably anxiety provoking situa- 
tions, these children show a relative lack of 
response, not only to unfamiliar or “tabu” words, 
but to the bulk of verbal stimuli. 

In short, the concept of perceptual defense in 
its original forms is not directly applicable to the 
behavior of children in Group A. Their type of 
response is more readily studied in terms of labora- 
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TABLE 1 


DISTRIBUTION OF CHILDREN WITH PSYCHOGENIC 
DEAFNESS BY PERSONALITY TYPE 


Anxiety Hys- 


state terio Totals 
Elevated thresholds for: 
Pure Tones (Group A) 1 13 14 
Speech (Group B) 9 3 12 
10 16 26 


x? (after Yates’ Correction) = 9.86 


tory and industrial studies of “vigilance” as 
defined by Mackworth (1957). Furthermore the 
early experiments on perceptual defense do not 
seem to have a direct bearing upon the behavior 
of children in Group B unless wide generalizations 
from the early theory are to be permitted. If the 
behavior of both groups is in fact explicable in 
terms of perceptual defense, then we would 
suggest, as did Singer (1956) in a lucid review and 
experimental study, that the most rewarding 
type of approach is one using “hypothesis” 
concepts as presented, for instance, by Postman 
(1953). Crudely put, it may be argued that some 
children adopt a generalized anxious set for certain 
social situations, and thus show elevated thres- 
holds for all verbal stimuli in those situations. 
Other children adopt faulty expectations in 
perceptual tasks demanding heightened attention, 
and thus produce insensitive or inconsistent 
responses. It remains now to attempt to account 
for the individual differences in response to these 
two types of situation, which are highlighted by 
the distributions of clinical categories in the 
present data. 

Where personality variables are concerned, 
the present findings, for reasons given above, 
cannot be directly compared with those of Lazarus, 
Eriksen, and Fonda (1951), who set out to study 
personality mechanisms as reflected in the “audi- 
tory recognition of sexual and aggressive material.” 
It is of interest to note, however, that they found 
that their freely verbalizing or "intellectualizer" 
group, which included obsessive-compulsives, 
showed greater accuracy in the perception of 
threatening material than did their “repressor” 
group, which included conversion hysterics. 


Objection may be made to the clinical aptness of ~ 


these workers’ classificatory procedure. But their 
empirical results are clear and seem to point in 
the opposite direction to that indicated by the 
present data, Kurland (1954), on the other hand, 
found no difference between obsessional-compul- 
sives and hysterics in their recognition threshold 
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4 
of emotional words. Again these results are not 
directly comparable with the present ones, but 
would appear to be different in implication. 

The different distributions of clinical classifica- 
tions in our two groups are predictable, however, 
from Eysenck’s (1957) formulations. Pure-tone | 
audiometry is a task which might be expected to | 
discriminate between dysthymic individuals, with 
their relatively high levels of drive, aspiration, 
and persistence, and hysterics, who have been 
found to build up reactive inhibition more rapidly 
and to be behaviorally more distractible. In this 
connection, Kodman and Blanton (1960) point 
out the importance for accurate signal detection 
of freedom from competing response tendencies, 
and a most stimulating account of the differential 
performance of introverts and extraverts in 
relevant experimental tasks is given by Broadbent 
(1958). 

A similar explanation, coupled with considera- 
tions of attention gaining behavior, might be 
given for the existence of a proportion of hysterics | 
in Group B. As far as the majority of children in 
this group is concerned, however, a different set 
of mechanisms must be postulated; but, again, à 
plausible explanation is readily derived from 
Eysenckian theory. Dysthymic children are highly 
reactive and readily conditionable. They tend to | 
demonstrate strong fear reactions, which, linked | 
with oversocialization, may be triggered off by | 
apparently neutral stimuli in social situations. 
Heightened thresholds for speech may thus be | 
acquired as an anxiety reducing avoidance mecha- 
nism by these children more readily than by 
hysterics. 


SUMMARY 


Survey of a child guidance population showed 
that of 26 functionally hard-of-hearing cases, i 
presented elevated thresholds for pure tones} 
but were within normal limits for speech. The 
other group, though normal on pure-tone testing | 
presented elevated thresholds for all speech 
rather than for relatively threatening stim 
It would appear that both groups were reactive to 
situations, as opposed to specific auditory stim! 
within those situations. Furthermore the majority 
of the pure-tone group were clinically classified 45) 
“hysteric,” whereas the majority of the spe 
group was classed as “anxiety state.” 3 

The applicability of “defense” theories tt 
such cases is discussed, and it is suggested thal 
the behavior of both groups may best be studii 
in the light of a “hypothesis” approach. 4 

It is pointed out that the pure-tone kar 
situation demands “vigilance” in the experimen, 
sense, and it is then shown that the diffe 
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distributions of clinical categories in the two 
groups is predictable from Eysenck’s formulations. 
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AN OUTCOME INDEX FOR MENTAL HOSPITAL PATIENTS! 


E. I. BURDOCK anp ANNE S. HARDESTY 
New York State Department of Mental Hygiene and Columbia University 


tany particular point in time, the patients in 
a mental hospital vary with respect to 
length of stay, number of admissions, number of 
releases, and time since first admission. A single 
index number combining these factors could 
provide a criterion for assessing the predictive 
validity of psychiatric prognoses, psychological 
tests, and treatment regimens. 

Previous attempts to describe the fates of 
institutionalized patients by Crandell, Zubin, 
Mettler, and Logan (1954) and by Crandell, Zubin, 
Mettler, and Kugelmass (1956) culminated in an 
“immobility index,” which was the ratio of number 
of months of hospitalization to number of admis- 
sions during the two years following first admission. 
This immobility index was found to correlate 
negatively with subsequent outcome. 

The Outcome Index, 


36.) 2A 
eTA 


represents a refinement of the earlier immobility 
index in the direction of greater sensitivity and 
convenience. The four elements are: 


} = length of time in days between first admission 
and a selected anniversary date. 
1, = total number of nights slept outside the hospi- 
tal on official leave or discharge status. 

a = number of admissions. 

r = number of releases. 
The relative weights given to admissions and 
releases. were determined empirically from a 
sample of patients ranked in order of length and 
frequency of hospital experience. Previous studies 
as well as clinical observation suggest that each 
successive readmission increases the expectation 
of subsequent admissions, but each successive 
release is less indicative of genuine remission. 

The Outcome Index is like a probability in 
that it ranges from zero for a patient with no 
significant releases to a hypothetical maximum of 
one for a patient who spent no days in the hospital. 

The Outcome Index has been applied to three 
samples of patients first admitted to a New York 
State mental hospital in 1953, 1955, and 1956, 
respectively. They were followed up at four 
successive intervals: 6 months, 1 year, 2 years, 
and 3 years. The long-term follow-up was greatly 

1 This research has been supported by Research 


Grant M-3546 from the National Institute of 
eae Mental 


facilitated by the fact that the patients had been 
located on the earlier occasions. The data were 
tabulated and Outcome Indices were computed 
for each patient at every period. 

The 1953 sample consisted of 35 first-admission 
patients who were all young schizophrenics (mean 
age 29) and who had been selected by the pro- 
fessional staff for accessibility and amenability to 
psychological testing. The 1955 group of 60 first- 
admission patients was a sample of young schizo- 
phrenics (mean age 30) chosen by a survey of the 
case records, The 1956 group of 85 first-admission 
patients successively admitted to the same institu- 
tion were heterogeneous from the standpoint of 
diagnostic category, age, sex, etc., being alike 
only with respect to time of entry into the hospital 
and first-admission status. These 85 patients were 
the survivors of a somewhat larger cohort reduced 
to have been a dispro- 
f geriatric admissions 
in this cohort.) No Outcome Yndices were computed | 
for those who failed to surviye to the end of the 
final follow-up period. 

Previous studies (Kramer, Goldstein, Israel, & 
Johnson, 1955) have suggested that by the end of 
the second year of hospitalizition, the careers of 
mental patients have become stabilized. Therefore, 
2 years has been taken as a basal follow-up period 
for dividing patients into homogeneous group? 
with respect to the Outcome Index. Figuzes 1 
through 3 represent the trends of individual 
Outcome Indices from 6 months to 3 years for 
each sample successively. In each sample, the 
patients have been divided into four groups 
according to the magnitude of the 2-year index. 
Chronic residents with zero indices at 720 days 
are not shown on the figures. There were 4 of these 
in the 1953 sample, 4 in the 1955 sample, and 17 in 
the 1956 sample. Only 3 of these 25 with zero index 
showed any change by the third year. One of the 
1953 patients rose from zero to .004 and 2 of the 
1956 patients rose from zero to .013 and to .222, 
respectively. 

In each of the three figures, the right-hand 
graph, Early Release, represents patients with à 
2-year Outcome Index of .50 or higher. To obtain 
basal index this high, these patients must have 
had only a single admission and must have been 
released within one year. Some of these, with 
a 2-year index of .75 or higher, were rel 
within 6 months of admission. In each of the 
three samples more than half the patients fit this 
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Fic. 1. Individual lines of trend of successive Outcome Indices from time of first admission to a 


state mental hospital in 1953. 
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Fic. 2. Individual lines of trend of successive Outcome Indices from time of first admission to a state 


1080. 180 360 
DAYS 
mental hospital in 1955. 
definition for early release: 
In 1953—18 out of 35 


In 1955—36 out of 60 
In 1956—44 out of 85 


Total—98 out of 180 


Only 4 of the 98 relapsed during the third year, 
2 from the 1955 sample and 2 from the 1956 
sample, 


The middle graphs in Figures 1 through 3 are 
for patients with Delayed Release. The 2-year 
Outcome Index ranging from .125 to .50 again 
reflects a single admission, but with between 12 
and 21 months of hospitalization. There were only 
three such patients in each sample. 

The three left-hand graphs, Late Release or 
Relapse, are for patients with 2-year Outcome 
Index between zero and .125. A few patients with 
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Fic. 3. Individual lines of trend of successive Outcome Indices from time of first admission to 4 


state mental hospital in 1956. 


Outcome Index less than .125 had a single admis- 
sion but were released only after 21 months of 
hospitalization. There were two such patients in 
the 1955 sample and one in the 1956 sample. But 
the majority of those with an Outcome Index 
between zero and .125 were released during their 
first year and readmitted before the end of the 
2-year basal period. é 

Figures 1-3 also suggest that the frequencies 
for each of the four categories of outcome are 
unequal. The disparity is more clearly revealed 
in Table 1, which presents the data in summarized 
form and includes the appropriate statistical 
tests of significance. The tests support the con- 
clusion that in each sample the frequencies of the 
four different outcome categories are significantly 
different from each other. Moreover, an overall 
test of the pooled data, as shown in Table 1, 
indicates that the differences among the categories 
tend to be consistently in the same direction 
from sample to sample. Finally, a test of independ- 


ence encourages the presumption that all three 
samples are drawn from the same population of 
outcomes, 3 

The stability of the basal Outcome Index is 
most graphically demonstrated by dividing each 
sample into two subgroups, those with a 2-year 
index equal to or greater than .5 vs. those with à 
2-year index less than .5. If separate frequency 
distributions of the 3-year Outcome Indices Íor 
each subgroup are then plotted on the same axes 
the extent of overlap is seen to be negligible 
Figures 4 through 6 show the data arranged in. 
this way. 3 

"The evidence so far presented suggests that, m 
general, when it is applied to individual patients 
the basal Outcome Index computed for the 2-year 
period following first admission yields a maximum 
amount of information in a minimum follow-up 
time. However, it will often be of interest 1 
examine the records of a cohort at the end of 
their first year so as to get some notion as to the 


TABLE 1 


Cat SQUARE ANALYSIS OF FREQUENCIES IN FOUR CATEGORIES OF A Two-YEAR OUTCOME INDEX 
APPLIED TO THREE SAMPLES OF FIRST-ADMISSION PATIENTS AT A STATE MENTAL HOSPITAL 


Sample Olmo 6S OE OUS So E Total E a $ 
1953 Coa NOD ERN 35 16.312 s «0 
1955 dO POT7) 31965 26 60 47.332 3 <-00! 
1956 rune Wiss A4. 85 40.879 3 «00 

ex) 104.5233 9 | «9 
Pooled Dona CN TORN Og 180 10.30 3 <.00t 
(Test of Independence) 4.213 6 m 
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Fic. 4. Frequencies of various 3-year Outcome In- 
dices among two subgroups of a 1953 sample of first- 
admission patients. 


Ql 


types of patients and their disposition. For this 
purpose the Outcome Index has been applied to a 
sample of 83 first-admission patients who entered 
New York Psychiatric Institute in 1958-1959. 
The New York Psychiatric Institute is a small 
intensive treatment facility which selects patients 
suitable for teaching and research and which does 
not relegate any patient to custodial care. It has a 
high staff-to-patient ratio. As it is still less than 2 
years since admission, a basal Outcome Index 
could not be computed for these patients. But it is 
of interest to compare them with unselected state 
hospital patients for early outcome. The propor- 
tion of institute patients with zero index for the 
first 6 months is almost twice as large as the 
corresponding proportion for the state hospital; by 
the end of the year, the proportion of institute 
patients with zero index has increased so that the 
ratio is 5:2, The reason why fewer patients 
are released from the Psychiatric Institute during 
their first year appears to be that almost all 
patients become involved in intensive and pro- 
longed psychotherapy (Burdock, Glass, Hardesty, 
& Beck, 1961). 


OUTCOME INDEX 


2-YR. FOLLOW-UP (BASAL PERIOD) 


€ 


669 


N 


25 ---2-YEAR OUTCOME INDEX <5 
YEAR OUTCOME INDEX2 .5 


20 


FREQUENCY 


MM À 


e 3 4567 8 9 I0 
3-YEAR OUTCOME INDEX 


Fic. 5. Frequencies of various 3-year Outcome In- 
dices among two subgroups of a 1955 sample of first- 
admission patients. 
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Fic. 6. Frequencies of various 3-year Outcome In- 
dices among two subgroups of a 1956 sample of first- 
admission patients. 


only one year of follow-up, the ranges of the three 
categories above zero are correspondingly reduced 
as follows: 


1-¥YR. FOLLOW-UP 


Ol =0 Continuous residence Continuous residence 

0< OI < .125 Late release (only after 21 months) or Relapse Release only after 10.5 months or 
(2 or more admissions) Relapse 

125 < OI < .50 Delayed release (only after 1 year) Release after 6 months 

50 < OI < 1.00 Early release (within first year) Release within 6 months 


The difference between the state hospital and the 
Psychiatric Institute is pointed up by tabulating 
the corresponding frequencies for each outcome 
category for the end of the first year after ad- 
mission. When the Outcome Index is based on 


Table 2 shows the relative frequencies of the 1- 
year outcome categories for each of the three 
samples from the state hospital and for the sample 
from the Psychiatric Institute. The accompanying 
statistical tests again support the hypotheses 
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TABLE 2 
Crt SQUARE ANALYSIS OF FREQUENCIES IN Four CATEGORIES OF A ONE-YEAR OUTCOME INDEX 
APPLIED TO THREE SAMPLES OF First-ADMISSION PATIENTS AT A STATE MENTAL HOSPITAL 
AND ONE SAMPLE oF FIRST-ADMISSION PATIENTS AT A PSYCHIATRIC INSTITUTE 


Sample Oro eer OD SE Or S Total x df > 
1953 7 8 5 15 35 6.485 3 ns 
1955 9 7 8 36 60 39.337 3 <.001 
1956 21 16 9 39 85 23.186 3 «.001 
(zx) 69.008 9 <.001 
Pooled 7 BL ay Be 90 18 62.532 3 <.001 
PI. 41 8 12 22 83 31.361 3 <.001 
(zx? 93.893 6 <.001 
Pooled 78 39 34 112 263 61.029 3 <.001 
Tests of Independence 
Among state hospital samples: Zx? — Pooled x? = 6.476 6 ns 
State hospital vs. Psychiatric Institute: £x? — Pooled x? = 32.863 3 <.001 


that the four outcome categories are on the whole 
significantly different and that the variations are 
in the same direction from sample to sample, but 
the 1953 sample, the smallest, fails to provide a 
statistically significant difference. However, a 
noteworthy contrast emerges between the institute 
sample and the combined state hospital samples 
when the tests of independence are examined. 
While all three state hospital samples appear to 
come from the same population of outcomes, the 
Psychiatric Institute sample registers as in all 
probability independent. The institute’s policy of 
holding patients for intensive psychotherapy is 
reflected in a slower release rate during the first 
year after admission. 

From the data presented here it can be con- 
cluded that the Outcome Index, based on the first 
2 years after admission, provides a reasonably 
stable criterion for assessing effectiveness of 
treatment and for distinguishing institutional 
policies. Three successive samples of first-admission 
patients taken from a regular state mental hospital 
show basically similar patterns of outcome despite 
different diagnoses, different age groups, and the 
varying effects of the current trend in drug therapy. 

The Outcome Index can be used to characterize 
institutions as well as to describe individual 
patients, This latter application is illustrated by 
the comparison of the three state hospital samples 
with the sample from the Psychiatric Institute, 


SUMMARY 
An Outcome Index for mental hospital patients 
is described. The index relates time out of hospital 
to follow-up period since first admission and 
weights their ratio differentially by number of 
readmissions and releases. This index becomes 
relatively stable on 2-year follow-up. It can be 
used as a criterion for prognostic efficiency, for 
efficacy of treatment, and for characterizing 
institutional policy. 
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STATUS AND THE JUDGMENT OF OCCUPATIONAL PRESTIGE! 


SIDNEY I. PERLOE 
Haverford College 


HEN people are asked to rate a series of 
W stimuli on an arbitrary category scale, 
their judgments are made in relation to one or 
more reference points or anchors. The rating 
assigned to a stimulus depends upon the direction 
and magnitude of the distance between the stimu- 
lus and the anchor (Volkmann, 1951). A shift in 
the location of the anchor toward the high end of 
the rating dimension leads to a displacement of 
the stimulus judgments toward the low end. A 
displacement of stimulus judgments in an upward 
direction is found when the anchor is shifted 
toward the low end of the judgment dimension? 
These judgmental phenomena have been observed 
with both physical and verbal stimuli (Guilford, 
1954; McGarvey, 1943). 

There are a variety of ways in which the loca- 
tions of anchors are determined. Sometimes 
subjects spontaneously choose the most extreme 
stimuli in a series as anchors because they are 
most easily discriminated from the other stimuli. 
At other times the experimenter provides special 
stimuli that subjects are instructed to use as 
anchors. In both of these cases the anchor is an 
external stimulus that is continuously or inter- 
mittently present in the judgment situation. 
There have, however, been studies which suggest 
that people may also use internal anchors to 
guide their judgments. Hunt (1941) found that 
asking subjects to imagine the most extreme 
stimulus they could think of, on a given dimension, 
had the same effect as employing an external 
anchor, Tresselt (1948) compared the weight 
judgments of weight lifters, watchmakers, and 
students. She found that the weight lifters initially 
judged the experimental weights as lighter than did 
either of the other groups. Presumably this differ- 
ence was due to heavier internal anchors on the 
part of the weight lifters than on that of the 
watchmakers or students. 


1 Supported in part by NSF Grant G.3023. 

The author wishes to express his appreciation to 
P. Billard, G. Brauner, H. Stick, T. Warne, and W. 
Welsh of Yale College, for their help in collecting part 
of the Yale data. 

? A related theory proposed by Helson (1948) states 
that the rating assigned to a stimulus is determined by 
the distance between the stimulus and the adaptation 
level or neutral point of the series. Helson holds that 
anchors displace judgments only insofar as they change 
the adaptation level. In the present case, Volkmann's 
(1951) and Helson's theory make the same predictions. 


Tnternal anchor effects have also been found by 
Hovland, Harvey, and Sherif (1957); Hovland 
and Sherif (1952); Kelley, Hovland, Schwartz, and 
Abelson (1955); Manis (1960); Segall (1959); 
Sherif and Hovland (1953); and Weiss (1959). In 
these studies, the subjects’ opinions acted as 
internal anchors in their judgments of the position 
advocated by a series of opinion statements. 
Subjects with extremely favorable positions on an 
issue tended to displace their judgments of items 
that deviated from their views toward the un- 
favorable end of the judgment scale. Those with 
extremely unfavorable positions tended to displace 
their judgments of statements that deviated 
from their views toward the favorable end. Similar 
results have been obtained by Hinckley and 
Rethlingshafer (1951) in a study of the effects of 
the subjects! own heights on judgments of height; 
a series of heights were judged to be shorter by 
tall men than by short men. 

.'The present paper is concerned with social 
status as an internal anchor in the judgment of 
occupational prestige. The hypothesis is that 
high status subjects should displace their judg- 
ments of medium and low prestige occupations 
toward the low end of the occupational prestige 
dimension. 

METHOD 


The data to be presented have been assembled from 
two sources. The first is a survey conducted by the 
National Opinion Research Center (1953). The survey 
sample covered a nationwide cross-section of Americans 
whose occupations ranged from laborer to professional 
man. The respondents were asked to give their own 
personal opinions of the general standing, i.e., prestige, 
of each of a series of 90 occupations. The ratings, made 
on a five-point scale, were converted into prestige 
scores by assigning a weight of .2 to the percentage of 
respondents rating a given occupation in the lowest 
category, a weight of .4 for the next lowest, .6 for the 
middle, .8 for the next highest, and 1.0 for the highest. 
The weighted percentages were then summed across 
the categories to yield the prestige score for the oc- 
cupation in question. By this procedure the minimum 
score was 20, assigned to an occupation unanimously 
placed in the lowest rating category; the maximum was 
100, assigned to an occupation unanimously placed in 
the highest rating category. 

The data for high status subjects were taken from 
three unpublished studies performed by the author 
with Vale undergraduates in which incidental ratings 
of occupational prestige were collected. All three re- 
quired subjects to rate the amount of prestige com- 
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TABLE 1 
‘PRESTIGE SCORES 

Nationwide decile — N* Sample occupation Nationwide Yale mean Diference p 

10 6 Diplomat 92.17 93.33 — 1716) ns 

9 5 Judge 86.60 88.90 —2.30 ns 

8 5 Psychologist 84.00 80.00 4.00 .05 

1 4 Symphony musician 79.75 69.00 10.75 ‘03 
6 5 Farm owner and operator 75.00 58.60 16.40 .004 
5 8 Policeman 70.13 56.37 13.76 001 
4 5 Plumber 63.80 41.40 22.40 004 
3 7 Restaurant cook 56.71 41.89 14.82 .001. 
2 5 Farm hand 49.20 27.00 22.20 004 

1 6 Shoeshiner 38.83 32.17 6.66 ns 


a WV refers to the number of occupations used in each decile of the complete nationwide distribution. 
b The sample for each decile lies as close as possible to the median of the Yale ratings of occupations in that decile. 


? One-tailed values. 


manded by each occupation in the population at 
large.* The first study included 14 subjects whose data 
were appropriate for the present problem, the second 
51, and the third 41. In the first study, 31 occupations 
were the same as the ones in the nationwide sample; 
in the second study 44; and in the third 14. There was 
considerable overlap in the occupations used in each of 
the studies so that, taken as a whole, the Vale sample 
rated 56 of the 90 occupations used in the nationwide 
survey. The data from each of the Yale samples were 
converted into the same kind of prestige scores em- 
ployed in the survey. Where the same occupation was 
used with more than one Yale sample, the mean of the 
Several scores was used to arrive at an overall value 
for the Yale group. 

The hypothesis that high status subjects would 
displace their ratings of moderate and low prestige 
occupations toward the low end of the prestige dimen- 
sion was tested by comparing the prestige scores based 
on the judgments of the nationwide sample to ones 
based on estimates of these judgments by Yale students. 
The comparison was made for the 56 occupations which 
were rated by both groups. The prestige scores of the 
nationwide sample can be considered an objective 
measure of social reality, much in the same way as the 
readings on a balance are an objective measure of 
physical reality. The Yale students’ judgments of the 
prestige. commanded by a given occupation in the 
population at large may be thought of as equivalent to 
judgments of the weight of a lifted stimulus. In the 
case of the latter, accuracy is defined in terms of agree- 
ment between the judgments and the reading on the 
balance. In the former, accuracy is defined by the agree- 
ment between the judgments and the scores of the 
nationwide sample. 


RESULTS 


Ten occupational prestige levels were created by 
grouping the occupations falling in the same 
decile of the prestige score distribution as found 
with the nationwide sample (see Table 1 for a 


3 The specific instructions were based on those used 
by McGarvey (1943, p. 28). 


sample occupation from each decile), The mean 
prestige scores of the nationwide and of the Yale 
samples were computed for each decile. These | 
means are presented in Table 1. 1 
Tn general, the data support the hypothesis that 
high status subjects displace their judgments of 
middle and low prestige occupations toward the ~ 
low end of the prestige dimension. With the 
exception of the two highest deciles, the Yale 
sample produced lower prestige scores than the 
nationwide sample. That is, the Yale students 
erred by displacing their judgments of the occu 
pations in the bottom eight deciles toward the low 
end of the scale. They gave lower prestige scores 
to 43 of the 45 occupations located in this region: 4 
An analysis of variance! showed that the over 
difference between the Yale and nationwide 
scores was significant (F = 124.66, p < .005, df = 
1/92) and that the interaction between the status 
of the judges, i.e., Yale or nationwide, and the 
prestige of the occupations judged was also signi 
cant (F = 8.34, p < .005, df = 9/92). : 
In order to interpret the significant interaction, 
the difference between the scores assigned by the 
two samples to occupations in each decile was 
tested for significance using the Mann-Whitney 
U test. In Table 1 it can be seen that the differencts 
between the samples with occupations from deciles 
2 through 8 were significant. The lack of si 
cance for the lowest decile may have been due to 
the unambiguous nature of the prestige of 
lowest occupations, which may have preveni 
judgmental displacements from occurring. 
suggestion is similar to one offered by Hinckley 
and Rethlingshafer (1951) as an explanation for 


* The analysis was based on the technique for e j 
equal frequencies presented by Walker and Lev uu d 
P- 381). The error term was computed from the d ia 
among the scores of the several occupations locate 
each of the 20 cells of the design. 


"their finding that tall and short men differ signifi- 
cantly in their judgments of moderate heights, 
- "but not of very tall or very short ones. 

Yale students appeared to be able to make finer 

discriminations near the high prestige end of the 
scale than near the low end. The differences 
— between the ratings assigned to occupations in 
adjacent deciles were significant at p < .05 or 
better, two-tailed test, for the following pairs: 
8 and 9, 7 and 8, 4 and 5, 2 and 3; the difference 
between deciles 6 and 7 approached significance 
(p < .12). This result is generally consistent with 
previous findings of greater discrimination among 
stimuli near anchors than among those farther 
away (Volkman, 1951). However, contrary to 
what might be expected on the basis of previous 
findings, Yale subjects did not discriminate 
significantly between the occupations in Categories 
= 9 and 10, even though these were near their 
| presumed anchors. 


Í 


DISCUSSION 


|. The assumption underlying the present study is 
that an anchor affects the metric of the subjective 
reference scale that subjects use in locating 
[stimuli along a judgment dimension. The anchor 
lis seen as exerting an assimilative pull on the 
reference scale that leads to a compression of the 
scale. The compression of the scale around the 
anchor increases the apparent distance between 
the anchor and new stimulus inputs that differ 
from the anchor and are referred to the scale. 
On the response level, this change is manifested as 
a displacement of the judgments of nonanchor 
stimuli away from the value assigned to the 
anchor. A fuller discussion of this view of anchor 
effects, with some supporting data, may be found 
in Perloe (1961). 

| Although the present results support the hypoth- 
esis drawn from the approach just described, and 
are consistent with previous findings related to 
this approach, another interpretation of the data 
_ is possible. This explanation holds that the anchor 
has its effect by changing the subject’s selection of 
the specific subjective reference scale to be used 
in judging a set of stimuli rather than by changing 
the metric of a single scale (McGarvey, 1943; 
Kelley et al., 1955). According to this view, 
anchors often determine which aspect of a set of 
complex stimuli will be salient and therefore which 
subjective reference scale will be selected for 
making judgments. 

In the present case, the contrast between the 
extremely desirable statuses held or aspired to by 
the Yale subjects and the relatively low desirability 
of the majority of middle and low status occupa- 


* The Mann-Whitney U test was employed. 
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tions they were asked to judge may have led 
them tp make their judgments in terms of their 
personal view of the desirability of the occupations 
rather than in terms of their “objective” prestige. 
1f subjects did judge the occupations in this way, 
it is not surprising that they should find middle 
and lower class occupations less desirable than 
the population. These occupations fall much 
farther below Yale subjects’ occupational aspira- 
tions than they do below the aspirations of the 
population at large. 

It is interesting to note, in this connection, that 
the original National Opinion Research Center 
(1953) study, which required respondents to 
give “their own personal opinion of the gen- 
eral standing” of occupations, found higher 
ratings given to most occupations by unskilled 
laborers, domestic, and service workers than by 
persons in more prestigeful occupations. This may 
have been due to the complement of the process 
suggested for the high status Yale subjects by the 
alternative explanation. On the other hand, it 
may also have been due to a metric distortion. 

The ambiguity of the quality being judged in 
the present experiment and in all experiments 
that ask subjects to locate meaningful stimuli 
along valued dimensions lends weight to the 
alternative explanation. It is of course possible 
that both’ scale distortion and substitution are 
involved in these anchor effects. The final evalu- 
ation of the importance of the two processes waits 
upon the application of more adequate techniques 
for measuring the metric and semantic properties 
of subjective reference scales. 


SUMMARY 


A test was made of the hypothesis that high 
status subjects would displace their judgments of 
moderate and low prestige occupations toward the 
low end of the prestige dimension. A comparison 
of the judgments of Yale students and previously 
published judgments of a nationwide sample 
supported the hypothesis, The results were 
attributed to the high status (Yale) subjects 
using their own actual or expected positions on 
the occupational prestige dimension as internal 
anchors. Two explanations of the anchor effect 
were discussed. One focused on the distorting 
effects that anchors have on the metric properties 
of subjective reference scales; the other stressed 
the role played by anchors in determining which 
of several potential subjective reference scales 
would be used in judging a set of stimuli. 
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